benchmarkstt.segmentation.core module¶
%%{init: {'theme': 'base', 'themeVariables': { 'primaryColor': '#e7f2fa', 'lineColor': '#2980B9' }}}%%
classDiagram
Simple
Segmenter <|-- Simple
class Simple {
<<iterable>>
text: str
pattern='[\\n\\t\\s]+'
normalizer=None
}
Core segmenters, each segmenter must be Iterable returning a Item
-
class
benchmarkstt.segmentation.core.
Simple
(text: str, pattern='[\\n\\t\\s]+', normalizer=None)[source]¶ Bases:
benchmarkstt.segmentation.Segmenter
Simplest case, split into words by white space