You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+29-20Lines changed: 29 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,9 +3,8 @@
3
3
4
4
# Unstract
5
5
6
-
## No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
6
+
## The Data Layer for your Agentic Workflows—Automate Document-based workflows with close to 100% accuracy!
7
7
8
-
##
9
8
10
9

Prompt Studio's primary reason for existence is so you can develop the necessary prompts for document data extraction super efficiently. It is a purpose-built environment that makes this not just easy for you—but, lot of fun! The document sample, its variants, the prompts you're developing, outputs from different LLMs, the schema you're developing, costing details of the extraction and various tools that let you measure the effectiveness of your prompts are just a click away and easily accessible. Prompt Studio is designed for effective and high speed development and iteration of prompts for document data extraction. Welcome to IDP 2.0!
27
-
25
+
Prompt Studio is a purpose-built environment that supercharges your schema definition efforts. Compare outputs from different LLMs side-by-side, keep tab on costs while you develop generic prompts that work across wide-ranging document variations. And when you're ready, launch extraction APIs with a single click.
Once you've used Prompt Studio to define your schema, Unstract makes it easy to integrate into your existing workflows. Simply choose the integration type that best fits your environment:
32
+
33
+
| Integration Type | Description | Best For | Documentation |
| 🖥️ **MCP Servers**| Run Unstract as an MCP Server to provide structured data extraction to Agents or LLMs in your ecosystem. | Developers building **Agentic/LLM apps/tools** that speak MCP. |[Unstract MCP Server Docs](https://docs.unstract.com/unstract/unstract_platform/mcp/unstract_platform_mcp_server/)|
36
+
| 🌐 **API Deployments**| Turn any document into JSON with an API call. Deploy any Prompt Studio project as a REST API endpoint with a single click. | Teams needing **programmatic access** in apps, services, or custom tooling. |[API Deployment Docs](https://docs.unstract.com/unstract/unstract_platform/api_deployment/unstract_api_deployment_intro/)|
37
+
| ⚙️ **ETL Pipelines**| Embed Unstract directly into your ETL jobs to transform unstructured data before loading it into your warehouse / database. |**Engineering and Data engineering teams** that need to batch process documents into clean JSON. |[ETL Pipelines Docs](https://docs.unstract.com/unstract/unstract_platform/etl_pipeline/unstract_etl_pipeline_intro/)|
38
+
| 🧩 **n8n Nodes**| Use Unstract as ready-made nodes in n8n workflows for drag-and-drop automation. |**Low-code users** and **ops teams** automating workflows. |[Unstract n8n Nodes Docs](https://docs.unstract.com/unstract/unstract_platform/api_deployment/unstract_api_deployment_n8n_custom_node/)|
39
+
40
+
## ☁️ Getting Started (Cloud / Enterprise)
32
41
33
-
Automate critical business processes that involve complex documents with a human in the loop. Go beyond RPA with the power of Large Language Models.
42
+
The easy-peasy way to try Unstract is to [sign up for a **14-day free trial**](https://unstract.com/start-for-free/). Give Unstract a spin now!
34
43
35
-
🌟 **Step 1**: Add documents to no-code Prompt Studio and do prompt engineering to extract required fields <br>
36
-
🌟 **Step 2**: Configure Prompt Studio project as API deployment or configure input source and output destination for ETL Pipeline<br>
37
-
🌟 **Step 3**: Deploy Workflows as unstructured data APIs or unstructured data ETL Pipelines!
44
+
Unstract Cloud also comes with some really awesome features that give serious accuracy boosts to agentic/LLM-powered document-centric workflows in the enterprise.
38
45
39
-

46
+
| Feature | Description | Documentation |
47
+
|---------|-------------|---------------|
48
+
| 🧪 **LLMChallenge**| Uses two Large Language Models to ensure trustworthy output. You either get the right response or no response at all. |[Docs](https://docs.unstract.com/unstract/unstract_platform/features/llm_challenge/llm_challenge_intro/)|
49
+
| ⚡ **SinglePass Extraction**| Reduces LLM token usage by up to **8x**, dramatically cutting costs. |[Docs](https://docs.unstract.com/unstract/editions/cloud_edition/#singlepass-extraction)|
50
+
| 📉 **SummarizedExtraction**| Reduces LLM token usage by up to **6x**, saving costs while keeping accuracy. |[Docs](https://docs.unstract.com/unstract/unstract_platform/features/summarized_extraction/summarized_extraction_intro/)|
51
+
| 👀 **Human-In-The-Loop**| Side-by-side comparison of extracted value and source document, with highlighting for human review and tweaking. |[Docs](https://docs.unstract.com/unstract/unstract_platform/human_quality_review/human_quality_review_intro/)|
52
+
| 🔐 **SSO Support**| Enterprise-ready authentication options for seamless onboarding and off-boarding. |[Docs](https://docs.unstract.com/unstract/editions/cloud_edition/#enterprise-features)|
53
+
54
+
## ⏩ Quick Start Guide
55
+
56
+
Unstract comes well documented. You can get introduced to the [basics of Unstract](https://docs.unstract.com/unstract/), and [learn how to connect](https://docs.unstract.com/unstract/unstract_platform/setup_accounts/whats_needed) various systems like LLMs, Vector Databases, Embedding Models and Text Extractors to it. The easiest way to wet your feet is to go through our [Quick Start Guide](https://docs.unstract.com/unstract/unstract_platform/quick_start) where you actually get to do some prompt engineering in Prompt Studio and launch an API to structure varied credit card statements!
40
57
41
-
## 🚀 Getting started
58
+
## 🚀 Getting started (self-hosted)
42
59
43
60
### System Requirements
44
61
45
-
- 8GB RAM (recommended)
62
+
- 8GB RAM (minimum)
46
63
47
64
### Prerequisites
48
65
@@ -57,7 +74,6 @@ Next, either download a release or clone this repo and do the following:
57
74
✅ Now visit [http://frontend.unstract.localhost](http://frontend.unstract.localhost) in your browser <br>
58
75
✅ Use username and password `unstract` to login
59
76
60
-
61
77
That's all there is to it!
62
78
63
79
Follow [these steps](backend/README.md#authentication) to change the default username and password.
@@ -93,10 +109,6 @@ Unstract supports a wide range of file formats for document processing:
93
109
|| TIFF | Tagged Image File Format |
94
110
|| WEBP | Web Picture Format |
95
111
96
-
## ⏩ Quick Start Guide
97
-
98
-
Unstract comes well documented. You can get introduced to the [basics of Unstract](https://docs.unstract.com/unstract/), and [learn how to connect](https://docs.unstract.com/unstract/unstract_platform/setup_accounts/whats_needed) various systems like LLMs, Vector Databases, Embedding Models and Text Extractors to it. The easiest way to wet your feet is to go through our [Quick Start Guide](https://docs.unstract.com/unstract/unstract_platform/quick_start) where you actually get to do some prompt engineering in Prompt Studio and launch an API to structure varied credit card statements!
99
-
100
112
## 🤝 Ecosystem support
101
113
102
114
### LLM Providers
@@ -113,7 +125,6 @@ Unstract comes well documented. You can get introduced to the [basics of Unstrac
113
125
| <imgsrc="docs/assets/3rd_party/anyscale.png"width="32"/> | Anyscale | ✅ Working |
114
126
| <imgsrc="docs/assets/3rd_party/mistral_ai.png"width="32"/> | Mistral AI | ✅ Working |
115
127
116
-
117
128
### Vector Databases
118
129
119
130
|| Provider | Status |
@@ -124,8 +135,6 @@ Unstract comes well documented. You can get introduced to the [basics of Unstrac
124
135
|<imgsrc="docs/assets/3rd_party/postgres.png"width="32"/>| PostgreSQL | ✅ Working |
125
136
|<imgsrc="docs/assets/3rd_party/milvus.png"width="32"/>| Milvus | ✅ Working |
0 commit comments