The Document to Text module is a data module that can read a document and extract all its textual content.

You can use this module to transform any document of PDF, DOCX, TXT into text.

The Document to Text module has one input and one output:

  • Input: One input of type Document
  • Output: Text, the output is the text inside the document you provided