In our experience, using e-discovery software that does not properly tokenize CJK characters to find responsive documents not only can miss key documents but also result in up to 50
percent false - positive
identification, which leads to excessive review costs.