In this tutorial, we will show you how to extract an email address in a PDF in Javascript using Web API. Below is the image of the source PDF and the extracted email address output in JSON format.

Input PDF And Extracted Email Address Output
Input PDF and Extracted Email Address Output

Step 1 – Source Code

To begin, open your favorite editor and save the following files:

Grab your copy of the sample PDF here.

Step 2 – Install Request

We will use the request module for file upload. Kindly type in npm install request in your terminal.

Step 3 – Add API Key

In line 25, enter your API Key. You can get your API Key in the dashboard here.

Enter API Key

Step 4 – Source and Destination Files

Let’s add the source file in line 28 and the destination file name in line 32. Aside from JSON, you can also extract the text in CSV and XML formats.

Source File Path And Destination Filename

Step 5 – Add Template

In line 109, add the template name. The Document Parser supports both JSON and YML template formats.

For more information about Document Parser Template, check out this page.

Add Template

Step 6 – Run Program

To run the program, type in node app.js in the Terminal.

Step 7 – Extract Email Address Demo

Here’s a quick demo to parse an email address in a PDF from start to finish.

Extract Email Address Demo
Extract Email Address Demo

In this tutorial, you learned how to extract an email address in a PDF in JavaScript using Web API. You learned how to set up the environment to run the program successfully. You also learned about the Document Parser to parse text in a PDF.