Simon Willison on Nostr: I built a new tool: - it runs OCR against images and PDFs entirely in your browser ...
Published at
2024-03-30 18:04:07Event JSON
{
"id": "328a9927fcfed51b90548791f3ff0b41062bbd12879bce056e0ebde036255e1c",
"pubkey": "4315a187e024818492e61938093ba683dae66624d202cd43738de5b8ba198c0f",
"created_at": 1711821847,
"kind": 1,
"tags": [
[
"proxy",
"https://fedi.simonwillison.net/users/simon/statuses/112185956608395470",
"activitypub"
],
[
"L",
"pink.momostr"
],
[
"l",
"pink.momostr.activitypub:https://fedi.simonwillison.net/users/simon/statuses/112185956608395470",
"pink.momostr"
]
],
"content": "I built a new tool: https://tools.simonwillison.net/ocr - it runs OCR against images and PDFs entirely in your browser (no file upload needed) using Tesseract.js and PDF.js\n\nI wrote more about the tool and how I built it (with copious amounts of Claude 3 Opus and a little bit of ChatGPT) here: https://simonwillison.net/2024/Mar/30/ocr-pdfs-images/\nhttps://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/112/185/950/303/038/617/original/496d8a233536b3e4.mp4\n",
"sig": "65ae746d650c54f70e76e280fc50b4230f0116501fdcdac2bf1243b9d5d5ca06f47f3f66659000b48572436025c2f486419ca2ebcc101f122f407ee0789313cc"
}