Text cleaner¶
main¶
-
nlper.text_cleaner.__init__.main(text: str)¶ Executes the text cleaning pipeline.
- Parameters
text (str) – Text to clean
application¶
-
class
nlper.text_cleaner.application.Application(text)¶ Text cleaner application, starts by initializing object with cleaning utils.
- Parameters
text (str) – Text to clean
-
clean_text() → None¶ Calls cleaning operations on given text.
-
lemmatize_text() → None¶ Calls text lemmatization procedure using cleaning utils.
-
remove_characters_and_hide_numbers() → str¶ Calls removing special characters and hiding numbers procedures using cleaning utils.
Removed characters includes html and non text chars.
Hidden numbers includes different number formats, dates and time.
- Returns
Text without special characters and with hidden numbers
- Return type
str
-
run() → None¶ Executes text cleaning process.