How to use custom tokenizer with bulk loader

example

#1

Hi,

I cannot find any doc about bulk loader and custom tokenizer?
Can I use it together?

Thank you.


(Daniel Mai) #2

Using custom tokenizers during the bulk load is currently not possible since the custom tokenizers are set when the Alpha runs. You can bulk load your data and then run the Alpha to load the custom tokenizer to index with the live cluster.


#3

Hmm, i discuss it in this issue, and manishrjain responded me that it should be able in bulk loader because I have OOM issue with building large index at runtime.

Should I create new issue in github as a feature request?


(Manish R Jain) #4

Hey @selmeci, if you can add an issue in Github, that’d be great. Looks like a small change from our side.