dosya.io

extract data from PDF files

What is it?

dosya.io is a web service (API) which lets you extract data from PDF files.

How it works

Make a POST request with JSON paylod. Give coordinates, get data. This is an example payload. You can play with it.
Check out the sample PDF used on this test.

{ "url": "https://s3.amazonaws.com/dosya.io/sample.pdf", "pages":{ "1": { "address": { "area": { "x": 35, "y": 150, "width": 250, "height": 30 }}, "email": { "area": { "x": 60, "y": 180, "width": 250, "height": 10 } }, "customer": { "area": { "x": 343, "y": 210, "width": 100, "height": 30 } } } } }

Interested?

Join early testers, leave your email address, get your API key!