Unstructured Core Library
unstructured library is designed to help preprocess and structure unstructured text documents for use in downstream machine learning tasks. Examples of documents that can be processed
unstructured library include PDFs, XML and HTML documents.
Instructions on how to install the
unstructuredlibrary on your system.
- Unstructured API Services
Access all the power of
unstructured-apior learn to host it locally.
- Core Functionality
Learn more about the core partitioning, chunking, cleaning, and staging functionality within the Unstructured library.
Connect to your favorite data storage platforms for an effortless batch processing of your files.
Learn more about how metadata is tracked in the
Examples of other types of workflows within the
We make it easy for you to connect your output with other popular ML services.
- Best Practices
Learn best practices to optimize document information extraction using