benchmarkstt.segmentation.core module

Core segmenters, each segmenter must be Iterable returning a Item

class benchmarkstt.segmentation.core.Simple(text: str, pattern='[\n\t\s]+', normalizer=None)[source]

Bases: benchmarkstt.segmentation.Base

Simplest case, split into words by white space