This is a known issue and work has been done on it. The solution it turned out was to use a custom tokenizer for CJK languages. It hasn’t yet been merged into the mainline. I’ll get it done this week
Also, yes, you should use zh
instead of cn
.
This is a known issue and work has been done on it. The solution it turned out was to use a custom tokenizer for CJK languages. It hasn’t yet been merged into the mainline. I’ll get it done this week
Also, yes, you should use zh
instead of cn
.