Document parsing eliminates the need for data entry since it is fast and significantly accurate. With our API parsing tool, you can parse text files, images, XML files, PDF files, among other common formats used to store information.
- Benefits of Using our Document Parser for Complex Documents
- Parsing Large Files with Document Parsing API
- How to Upload Files and Parse in Asynchronous Mode in the Background
Benefits of Using our Document Parser for Complex Documents
Flexible API Pricing
Whether you are using our Java DOM parser or Python to parse the XML file, you only pay for what you use. For instance, parsing two pages may cost you two credits only, which in turn cost you $0.05 per credit. Furthermore, the pricing of credits can get as low as $0.002 if you buy more.
Secure Doc parser
Safety and security of private data are important, and a legal requirement. We know that, which is why we have hosted our API on secure Amazon AWS servers. When you connect to our APIs to upload and parse text files, PDFs, images, CSV among other files, our connection encrypts each piece of data with SSL/TLS before transmission.
Parsing Large Files with Document Parsing API
Our file parser can handle small and large files, although you need to tell the API to run large files in the background. Large files or documents with over 100 pages may time out after 25 seconds.
To avoid getting a timeout error when parsing large files in Python, Java, PHP, etc., parse them in Async mode. To do that, set “async” to “true”. Async mode parses in the background, and gives an output when parsing is complete.
How to Upload Files and Parse in Asynchronous Mode in the Background
Step 1: Request URLs Upload and Access URLs
You need a temporary URL for uploading your files, which you can get by using “/file/upload/get-presigned-url”. That will output two URLs; “presignedUrl” and “URL”, which you can use to upload and access the uploaded files respectively.
Step 2: Upload Your Files
To upload your files to “presignedUrl” that you received in Step 1, use “PUT”, and then access it using “URL” in the next step.
Step 3: Call API Method
With the file uploaded, you can call the API method with “URL” param as the input. Moreover, set “async” param to “true” to get the API method to return an output “URL” or a set of “URLs”, and corresponding “jobId”. You can use the “jobId” to check the status of that parsing job.
Step 4: Check Document Parsing Status
To check the “status” of the background job, use “/job/check” API method with “jobId” param. The API will then output its current execution status.
Step 5: Access Generated Data
Conclusion – File Parsing in Java
You may also check these guides on how to use our APIs to parse your Resume, invoices, Purchase Orders, CSVs, or scanned PDFs documents online.
Check PDF.co API functions: