How to convert PDF to HTML from URL for PDF to HTML API in PowerShell and PDF.co Web API
How to convert PDF to HTML from URL for PDF to HTML API in PowerShell: Step By Step Tutorial
Writing of the code to convert PDF to HTML from URL in PowerShell can be done by developers of any level using PDF.co Web API. PDF to HTML API in PowerShell can be implemented with PDF.co Web API. PDF.co Web API is the Rest API that provides set of data extraction functions, tools for documents manipulation, splitting and merging of pdf files. Includes built-in OCR, images recognition, can generate and read barcodes from images, scans and pdf.
Fast application programming interfaces of PDF.co Web API for PowerShell plus the instruction and the code below will help to learn how to convert PDF to HTML from URL. Follow the instruction and copy – paste code for PowerShell into your project’s code editor. Tutorials are available along with installed PDF.co Web API if you’d like to dive deeper into the topic and the details of the API.
Our website provides free trial version of PDF.co Web API that includes source code samples to help with your PowerShell project.
On-demand (REST Web API) version:
Web API (on-demand version)
On-premise offline SDK for Windows:
60 Day Free Trial (on-premise)
ConvertPdfToHtmlFromUrl.ps1
# The authentication key (API Key). # Get your own by registering at https://app.pdf.co/documentation/api $API_KEY = "***********************************" # Direct URL of source PDF file. $SourceFileUrl = "https://bytescout-com.s3.amazonaws.com/files/demo-files/cloud-api/pdf-to-html/sample.pdf" # Comma-separated list of page indices (or ranges) to process. Leave empty for all pages. Example: '0,2-5,7-'. $Pages = "" # PDF document password. Leave empty for unprotected documents. $Password = "" # Destination HTML file name $DestinationFile = ".\result.html" # Set to $true to get simplified HTML without CSS. Default is the rich HTML keeping the document design. $PlainHtml = $false # Set to $true if your document has the column layout like a newspaper. $ColumnLayout = $false # Prepare URL for `PDF To HTML` API call $query = "https://api.pdf.co/v1/pdf/convert/to/html" # Prepare request body (will be auto-converted to JSON by Invoke-RestMethod) # See documentation: https://apidocs.pdf.co $body = @{ "name" = $(Split-Path $DestinationFile -Leaf) "password" = $Password "pages" = $Pages "simple" = $PlainHtml "columns" = $ColumnLayout "url" = $SourceFileUrl } | ConvertTo-Json try { # Execute request $response = Invoke-WebRequest -Method Post -Headers @{ "x-api-key" = $API_KEY; "Content-Type" = "application/json" } -Body $body -Uri $query $jsonResponse = $response.Content | ConvertFrom-Json if ($jsonResponse.error -eq $false) { # Get URL of generated HTML file $resultFileUrl = $jsonResponse.url; # Download HTML file Invoke-WebRequest -Headers @{ "x-api-key" = $API_KEY } -OutFile $DestinationFile -Uri $resultFileUrl Write-Host "Generated HTML file saved as `"$($DestinationFile)`" file." } else { # Display service reported error Write-Host $jsonResponse.message } } catch { # Display request error Write-Host $_.Exception }
run.bat
@echo off powershell -NoProfile -ExecutionPolicy Bypass -Command "& .\ConvertPdfToHtmlFromUrl.ps1" echo Script finished with errorlevel=%errorlevel% pause
VIDEO
ON-PREMISE OFFLINE SDK
See also:
ON-DEMAND REST WEB API
Get Your API Key
See also:
PDF-co-Web-API-PowerShell-Convert-PDF-To-HTML-From-URL.pdf