Downloaders - processors

MordinezNLP.downloaders.Processors.gzip_to_text_data_processor(data: _io.BytesIO) → str

Function can be used together with downloaders to covnert BytesIO to GZIP and unpack it to str.

Parameters

data (BytesIO) – input data which comes from downlaoder class/function

Returns

parsed input

Return type

str

MordinezNLP.downloaders.Processors.pdf_data_processor(data: _io.BytesIO) → str

Function can be used together with downloaders to convert BytesIO from PDF files to str.

Parameters

data (BytesIO) – input data which comes from downloader class/function

Returns

parsed input, more informations about parsing PDFs can be found in method

MordinezNLP.parsers.process_pdf

Return type

str

MordinezNLP.downloaders.Processors.text_data_processor(data: _io.BytesIO) → str

Function can be used together with downloaders to convert BytesIO from text data to str.

Parameters

data (BytesIO) – input data which comes from downloader class/function

Returns

parsed input

Return type

str