Question 1

How do I integrate the document parsing API?

Accepted Answer

POST a document to /api/v1/documents with the document_type parameter. The API returns a job ID for async processing or a synchronous response for small documents. Webhooks deliver results when ready. SDKs available for Python and JavaScript.

Question 2

What document types are supported?

Accepted Answer

Bank statements, tax filings (BIR, SAT, IRS), payslips, audited financial statements, government IDs, business registrations, mobile money exports, e-wallet records, utility bills, invoices, loan agreements, vehicle and property titles. 50+ supported types with custom extraction for documents outside the list.

Question 3

What is the output format?

Accepted Answer

JSON by default with a structured schema per document type. CSV and Excel exports are available for spreadsheet workflows. The full schema for each document type is published in the documentation.

Question 4

How fast is the document parsing API?

Accepted Answer

Most documents parse in under 30 seconds. Batch and async modes are available for high-volume use. Latency budgets and SLAs are configurable for production deployments.

Question 5

Can the document parsing API detect fraud?

Accepted Answer

Yes. Tampering, metadata inconsistencies, forgery patterns, and cross-document mismatches. Fraud signals are returned alongside the extracted data, calibrated per market because fraud patterns differ by country.

Question 6

How accurate is the document parsing?

Accepted Answer

Around 98 percent end-to-end accuracy on supported document types. Per-field confidence is exposed in the API response so downstream systems can route low-confidence fields to human review.

Question 7

Is the document parsing API secure?

Accepted Answer

Kita is in active engagement for ISO 27001 and SOC 2 Type II audits. Engagement letters available on request. Data residency options for the EU and other jurisdictions. Encryption in transit and at rest.

Aspect	Legacy OCR API	Kita document parsing API
Setup per document type	Weeks of template configuration	Day-one support for 50+ types
Handwriting and photos	Fails outside clean scans	Handles photo, scan, screenshot, handwritten
Output	Raw fields you have to interpret	Credit signals plus raw data
Fraud signals	Not included	Tampering, mismatches, forgery patterns
Latency	Variable; depends on form complexity	Sub-30s typical, async webhooks for batch
Maintenance	Templates break when forms change	Generalizes across format variations

Document parsing API for loan origination.

A single endpoint for any lending document. Bank statements, tax returns, IDs, audited financials. Typed, scanned, photographed, handwritten.

What is a document parsing API?

One endpoint, many document types

Vision-language, not template OCR

Output ready for your pipeline

Document parsing API vs. legacy OCR.

Built for the three lender scenarios we serve.

Plug parsing into your LOS.

Feed your decision engine.

Document verification at scale.

Kita Capture API

Common questions