Downloaders - processors¶
-
MordinezNLP.downloaders.Processors.
gzip_to_text_data_processor
(data: _io.BytesIO) → str¶ Function can be used together with downloaders to covnert BytesIO to GZIP and unpack it to str.
- Parameters
data (BytesIO) – input data which comes from downlaoder class/function
- Returns
parsed input
- Return type
str
-
MordinezNLP.downloaders.Processors.
pdf_data_processor
(data: _io.BytesIO) → str¶ Function can be used together with downloaders to convert BytesIO from PDF files to str.
- Parameters
data (BytesIO) – input data which comes from downloader class/function
- Returns
- parsed input, more informations about parsing PDFs can be found in method
MordinezNLP.parsers.process_pdf
- Return type
str
-
MordinezNLP.downloaders.Processors.
text_data_processor
(data: _io.BytesIO) → str¶ Function can be used together with downloaders to convert BytesIO from text data to str.
- Parameters
data (BytesIO) – input data which comes from downloader class/function
- Returns
parsed input
- Return type
str