How Asian characters are counted with "Use word-based tokenization for Asian languages"

When "Use word-based tokenization for Asian languages" is selected, will it count each Asian character as a character and count each English word as a word? And when this option is not selected (default), will it count each character (including English words and numbers) as a character?

Can I rerun the analysis report for an existing project? I'd like to rerun the report as "Use word-based tokenization for Asian languages" and see how the word count changes.

Parents Reply Children
No Data