@Bindable(prefix="PreprocessingPipeline") public class CompletePreprocessingPipeline extends BasicPreprocessingPipeline
Tokenizer.tokenize(PreprocessingContext)
CaseNormalizer.normalize(PreprocessingContext)
LanguageModelStemmer.stem(PreprocessingContext)
StopListMarker.mark(PreprocessingContext)
PhraseExtractor.extractPhrases(PreprocessingContext)
LabelFilterProcessor.process(PreprocessingContext)
DocumentAssigner.assign(PreprocessingContext)
Modifier and Type | Field and Description |
---|---|
DocumentAssigner |
documentAssigner
Document assigner used by the algorithm, contains bindable attributes.
|
LabelFilterProcessor |
labelFilterProcessor
Label filter processor used by the algorithm, contains bindable attributes.
|
PhraseExtractor |
phraseExtractor
Phrase extractor used by the algorithm, contains bindable attributes.
|
caseNormalizer, languageModelStemmer, lexicalDataFactory, stemmerFactory, stopListMarker, tokenizer, tokenizerFactory
Constructor and Description |
---|
CompletePreprocessingPipeline() |
Modifier and Type | Method and Description |
---|---|
PreprocessingContext |
preprocess(List<Document> documents,
String query,
LanguageCode language)
Performs preprocessing on the provided list of documents.
|
public final PhraseExtractor phraseExtractor
public final LabelFilterProcessor labelFilterProcessor
public final DocumentAssigner documentAssigner
public PreprocessingContext preprocess(List<Document> documents, String query, LanguageCode language)
BasicPreprocessingPipeline
PreprocessingContext
.preprocess
in interface IPreprocessingPipeline
preprocess
in class BasicPreprocessingPipeline