You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The JS string is actually iterated in UTF-16 Code Units, which are entire characters for English, most alphabetic text, common Chinese characters; but not Emojis and rare Chinese characters.
elasticlunr-rs/src/inverted_index.rs
Lines 40 to 42 in 29d97e4
During index building,
elasticlunr-rs
iterates over the token&str
's content in Unicode Scalar Values.While the JS library does it in this way:
The JS string is actually iterated in UTF-16 Code Units, which are entire characters for English, most alphabetic text, common Chinese characters; but not Emojis and rare Chinese characters.
Related issue with mdBook.
The text was updated successfully, but these errors were encountered: