Software Alternatives & Reviews

Ask HN: How to extract information from mutiple (unstructured text) documents?

Apache Tika Apache OpenNLP Hacker News Search
  1. Apache Tika toolkit detects and extracts metadata and text from different file types.
    Pricing:
    • Open Source

    #App Reviews #Customer Feedback #Marketing Tools 15 social mentions

  2. Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.
    Pricing:
    • Open Source

    #NLP And Text Analytics #Spreadsheets #Natural Language Processing 3 social mentions

Discuss: Ask HN: How to extract information from mutiple (unstructured text) documents?

Log in or Post with