Overview
WhizoAI can parse and extract content from various document formats including PDFs, Word documents (DOCX), Excel files, and images with OCR capabilities.Supported Formats
PDF Documents
Extract text, tables, and images from PDF files
Word Documents
Parse DOCX, DOC files and extract formatted content
Images (OCR)
Extract text from PNG, JPG, TIFF using OCR
PDF Parsing
Basic PDF Extraction
Extract Text with Layout Preservation
Table Extraction from PDFs
Image Extraction from PDFs
Word Document Parsing
DOCX Extraction
Extract Document Metadata
OCR (Optical Character Recognition)
Extract Text from Images
Multi-Language OCR
Supported OCR Languages
| Language | Code | Language | Code |
|---|---|---|---|
| English | eng | Spanish | spa |
| French | fra | German | deu |
| Italian | ita | Portuguese | por |
| Chinese (Simplified) | chi_sim | Japanese | jpn |
| Korean | kor | Arabic | ara |
Excel & Spreadsheet Parsing
Parse Excel Files
Convert to CSV
Batch Document Processing
Process Multiple Documents
Advanced Features
PDF Page Range Selection
Password-Protected Documents
Form Field Extraction
Extract data from PDF forms:AI-Powered Document Analysis
Structured Data Extraction from Documents
Document Classification
Error Handling
Credit Costs
| Operation | Cost |
|---|---|
| PDF Parsing (per page) | 1 credit |
| DOCX Parsing (per page) | 1 credit |
| OCR (per image) | 2 credits |
| Table Extraction | +1 credit per table |
| AI Extraction from Document | +3-6 credits (LLM cost) |
| Image Extraction from PDF | Included |
Performance Tips
Common Use Cases
Invoice Processing
Invoice Processing
Extract invoice numbers, amounts, line items from PDF invoices
Resume Parsing
Resume Parsing
Extract candidate information from resume PDFs/DOCX files
Contract Analysis
Contract Analysis
Extract key terms, dates, parties from legal contracts
Data Migration
Data Migration
Convert legacy documents to structured data formats
Receipt OCR
Receipt OCR
Extract amounts, dates, merchant info from receipt images
Integration Examples
With AI Extraction
With Webhooks
Related Resources
AI Extraction
Extract structured data from parsed documents
Batch Processing
Process multiple documents efficiently
Webhooks
Get notifications when document parsing completes