Hi Developers,
Is there any way to get word count of all topics in SDL repository from a particlaur period of time. Ex: Last one year?
Thanks
Roopesh N
Hi Developers,
Is there any way to get word count of all topics in SDL repository from a particlaur period of time. Ex: Last one year?
Thanks
Roopesh N
Hi Roopesh,
As always, it is nice to know for which product version you are making this request. As that could give more options. So assuming the latest fielded release of Knowledge Center 2016SP1 Content Manager (12.0.1).
Typically there is an IWrite* Plugin configured which you can see in the Web Client > Settings > Write Plugin XML configuration. It is called ISHSYSWORDCOUNT and is responsible to fill the language-level FISHWORDCOUNT field which is present on every content object (so Maps, Topics, Libraries,...).
The implementation of the plugin is a balance between performance and accuracy. It doesn't do full language segmentation. It simply knows how to treat xml and counts space characters (as word delimeter) - so a simple word count. (KC2016SP1 - IWriteMetadataAndBlobPlugin - BlobWordCount)
So now you know if and where we store a word count on every content object (so Maps, Topics, Libraries,...). Next is to know what time period means, I would suggest to use the language-level field FISHLASTMODIFIEDON which is updated every time a new blob is submitted into the repository.
Something worth thinking about is if you only want to do this on the latest version, or all versions per object. And if you want to do this for one language (e.g. source language) or for all languages. As said the word count is space-based, so not valuable for all languages.
Those fields can be added in some Client Tools list views, but this probably wouldn't meet your aggregation expectations. Also adding them to Web Client CSV reports still leaves a gap with your suggested "Last one year" as these are typically scoped to per-publication-export.
Then we have the API
I hope this gets you started.
Best wishes,
Dave