File Upload for Structured Extraction
Upload documents for structured data extraction.
Upload Methods
Drag & Drop Interface
Fastest way to upload multiple files
Upload Area
┌─────────────────────────────────────┐
│ Drag files here │
│ │
│ or click to browse │
│ │
│ Supported: PDF, JPG, PNG, TIFF │
│ Max size: 10MB per file │
└─────────────────────────────────────┘
Features:
- Multi-file selection - Upload entire folders
- Progress tracking - See upload status
- Error handling - Retry failed uploads
- Preview thumbnails - Verify files before processing
Folder Upload
Process entire directories
Supported File Types
Document Formats
PDF Files:
- Scanned PDFs - Images converted to text
- Text PDFs - Direct text extraction
- Mixed PDFs - Both text and images
Image Formats:
- JPEG/JPG - Most common format
- PNG - High quality, transparent backgrounds
- WEBP - Modern web format
Quality Requirements
Resolution:
- Recommended: 300 DPI or higher
File Size:
- Maximum: 10MB per file
Image Quality:
- Clear text - Readable without zooming
- Good contrast - Dark text on light background
- Minimal skew - Straight, not tilted
- Complete document - All sections visible
Processing Workflow
Real-time Processing
Batch Processing
Efficient Bulk Processing
Process hundreds of documents:
- Parallel processing - Multiple files at once
- Queue management - Automatic retry for failures
- Progress tracking - Real-time status updates
- Error reporting - Detailed failure reasons
Upload Best Practices
File Preparation
Before Upload:
- Organize files - Group similar documents
- Check quality - Ensure readability
- Remove duplicates - Avoid processing same file twice
- Rename files - Use descriptive names
File Naming:
- Good: invoice_2024_001_acme_corp.pdf
- Poor: scan001.pdf
- Good: receipt_2024_01_15_starbucks.jpg
- Poor: IMG_1234.jpg
Quality Control
Image Optimization:
- Scan at 300 DPI for best results
- Use good lighting when photographing
- Keep documents flat to avoid shadows
- Crop to document edges remove backgrounds
PDF Preparation:
- Combine related pages into single PDFs
- Ensure text is selectable when possible
- Optimize file size without losing quality
- Remove password protection before upload
Troubleshooting Uploads
Common Upload Issues
File Too Large:
Error: File exceeds 10MB limit
Solution: Compress image or split PDF
Unsupported Format:
Error: .docx files not supported
Solution: Convert to PDF or image first
Upload Failed:
Error: Network timeout
Solution: Check connection, retry upload
Poor Quality:
Warning: Low resolution detected
Solution: Rescan at higher DPI
Resolution Steps
For Upload Failures:
- Check file size - Must be under 10MB
- Verify format - Use supported types only
- Test connection - Ensure stable internet
- Clear browser cache - Refresh and retry
- Contact support - If issues persist
For Processing Failures:
- Check image quality - Must be readable
- Verify schema match - Document type should match
- Review file content - Ensure complete document
- Try different format - Convert PDF to image or vice versa
Mobile Upload
Camera Integration
Direct Camera Upload:
- Take photo of document
- Auto-crop to document edges
- Enhance quality automatically
- Upload immediately for processing
Mobile Best Practices:
- Good lighting - Use natural light when possible
- Steady hands - Avoid blurry images
- Fill frame - Document should fill most of photo
- Multiple angles - Take 2-3 shots, pick best
Ready to upload your documents? Go to your project and start processing!