Products Open-Source Features Industries How It Works Company Get Started →
All Three Products are 100% Open-Source — Star us on GitHub

Three AI Products.
Extract. Crawl. Reason.

GramoPro.ai is a fully open-source AI data stack — document intelligence, web extraction, and knowledge graph RAG — built for developers and enterprises alike.

Start Free → Explore Products
100K+
Invoices Processed Monthly
99.4%
Extraction Accuracy
GdoczAI · Document Intelligence · Open-Source GcrawlAI · Web Crawler · Open-Source GRagAI · Knowledge Graph RAG · Open-Source
Our Products

Three Open-Source Tools, One Intelligence Stack

From raw documents to web data to connected knowledge graphs — all fully open-source, MIT licensed, and self-hostable.

📄 Document Intelligence

GdoczAI Open-Source

gdocz.gramopro.ai

AI-powered Intelligent Document Processing that extracts structured data from invoices, contracts, and forms with 99.4% accuracy — zero templates, zero training required. Free and open-source under MIT license.

Multi-Model OCR Table Extraction HITL Review SAP / ERP Push 100K+ Docs/Month ⭐ MIT License
Explore GdoczAI → View on GitHub ↗
gdocz.gramopro.ai · Invoice Extractor
📄
invoice_BA2847.pdf · Processing…
EXTRACTED FIELDS
Invoice NoBA-2847-2025
VendorBatik Air Sdn Bhd
AmountMYR 48,250.00
GST6%
Due Date15 Apr 2025
Status✓ Verified
Confidence
99.4%
🕸️ Web Intelligence

GcrawlAI Open-Source

gcrawl.gramopro.ai

Open-source AI web crawler that converts any website into clean Markdown, structured JSON, or screenshots. Purpose-built for LLM pipelines, RAG, and enterprise automation. Self-host or use our cloud API.

Playwright Engine JS Rendering Proxy Rotation LLM-Ready Output Full-Page Screenshots ⭐ MIT License
Explore GcrawlAI → View on GitHub ↗
gcrawl.gramopro.ai · Crawler
https://example.com/docs
Crawl →
Markdown JSON Screenshot
# Documentation Overview

## Getting Started
Install via npm:
`npm install gcrawl-sdk`

Links found: 247
Images: 38
SEO metadata: complete
247
Links
1.2s
Time
98%
Coverage
🧠 Knowledge Graph RAG

GRagAI Open-Source

grag.gramopro.ai

Open-source Knowledge Graph-powered RAG that connects your documents, data, and enterprise knowledge into a multi-hop reasoning engine — far beyond simple vector search. Deploy on-premise or self-host.

Knowledge Graphs Entity Linking Multi-Hop Reasoning Source Grounding LLM Agnostic ⭐ MIT License
Explore GRagAI → View on GitHub ↗
grag.gramopro.ai · Knowledge Graph
💬 Which Batik Air invoices exceeded MYR 40,000 in Q1 2025?
Batik Air Invoices Q1 2025 MYR 40K+
Graph RAG Answer
Found 3 invoices from Batik Air in Q1 2025 exceeding MYR 40,000: BA-2847 (48,250), BA-2901 (52,100), BA-2963 (41,800).
Sources: invoice_BA2847.pdf · invoice_BA2901.pdf · invoice_BA2963.pdf
Open-Source

Built in the Open.
Free Forever.

All three GramoPro.ai products are fully open-source under the MIT license. Inspect the code, self-host on your infrastructure, contribute features, or build your own products on top — no vendor lock-in, ever.

GdoczAI ⭐ MIT License

Intelligent Document Processing engine. Extract structured data from any document type — invoices, contracts, forms — with zero templates. Self-hostable, API-first, enterprise-ready.

Python Java MIT License
GcrawlAI ⭐ MIT License

Open-source AI web crawler. Turn any website into LLM-ready Markdown or JSON. Playwright-powered, proxy-aware, and built to compete with Firecrawl and ScrapeGraphAI.

Python JavaScript MIT License
GRagAI ⭐ MIT License

Open-source Knowledge Graph RAG engine. Build entity graphs from your data and enable multi-hop reasoning over enterprise knowledge. LLM-agnostic and fully self-hostable.

Python TypeScript MIT License
Capabilities

Everything Your AI Data Stack Needs

Across all three products, GramoPro.ai delivers enterprise-grade capabilities for every stage of your AI data pipeline.

GdoczAI 🔍

Zero-Template Extraction

Extracts fields from any document layout — invoices, contracts, receipts — without pre-training or templates. Works on first upload.

GcrawlAI 🕸️

Full-Site Crawling

Crawl entire websites with JS rendering, dynamic content support, and proxy rotation. Get clean Markdown or structured JSON in seconds.

GRagAI 🧠

Knowledge Graph Reasoning

Build entity-relationship graphs from your documents and data. Answer multi-hop questions across connected knowledge — not just vector search.

GdoczAI 📊

Table & Form Intelligence

Accurately parse complex multi-row tables, nested forms, and handwritten fields from PDFs, scans, and images with visual grounding.

GcrawlAI 📸

Screenshots & SEO Metadata

Capture full-page screenshots and extract SEO metadata — title, meta description, headings, structured data — all in one crawl.

GRagAI 🔗

Entity Linking & Deduplication

Automatically links related entities across documents and data sources. Resolves duplicates and builds a unified, queryable knowledge graph.

All

REST API First

All three products are API-first. Clean REST APIs, webhooks, and SDKs for seamless integration into any automation stack.

All 🔒

Enterprise Security

HIPAA-compliant document processing, audit trails, role-based access controls, and data residency for regulated industries.

All 🔄

500+ Integrations

Push data to SAP, Salesforce, QuickBooks, NetSuite, or any of 500+ platforms via API, webhook, Zapier, or native connectors.

Industries

Built for Every Sector

From aviation to fintech — powering intelligent automation for enterprise teams worldwide.

✈️Aviation
🏦Finance
🛡️Insurance
🏥Healthcare
📦Logistics
⚖️Legal
🛒E-Commerce
🤖AI / LLM Dev
🏗️Manufacturing
🎓Education
How It Works

From Raw Data to Structured Intelligence

Free, Open-Source,
and Enterprise-Ready.

All three products are MIT licensed and self-hostable. Star us on GitHub, deploy on your own infrastructure, or get started instantly with our cloud platform.