Releases: DoodleBears/split-lang
Releases · DoodleBears/split-lang
v2.0.5
v2.0.4
v2.0.3
v2.0.2
v2.0.1
Summary
- fix(splitter): when merge across punctuation on section stage the lang of substring inside is wrong
- fix(splitter): when merge across digit on section stage the substring inside section is not merged
Full Changelog: v2.0.0...v2.0.1
v2.0.0
Summary
- Improve stability when dealing with long and short text (by separate the splitting logic based on language SET {ZH_JA, KO, OTHERS}
- add parameter for whether to merge over, (default is
True
, when set toFalse
, this part will been separated in result substring)- newline
- digit
- punctuation (allow to set no need to merge punctuation
List[str]
)
What's Changed
- Close #23 by @DoodleBears in #24
Full Changelog: v1.4.1...v2.0.0
v1.4.1
Full Changelog: v1.4.0...v1.4.1
v1.3.9
v1.3.8
v1.3.7
Summary
- fix(detector): avoid divide by zero since word frequency can be zero (if text does not exist in that language)
Full Changelog: v1.3.6...v1.3.7