PDF to Word. Text extraction, honestly.

Extract readable text from any PDF into an editable .docx file. Formatting is not preserved — this is a text extraction tool, not a layout converter.

Text extraction only — layout, tables, images, and formatting are not preserved. The output is plain editable text in a .docx file. For layout-accurate conversion, a server-based service is required.
Drop your PDF here
or click to browse files from your device
— text-based PDFs only · no scanned images
✓ 100% private · ✓ no uploads · ✓ free forever
— How it works
Step 1
Add your PDF
Drop any PDF. Text is extracted page by page using PDF.js — nothing is uploaded.
Step 2
Text is extracted
All readable text is pulled from the PDF in reading order. Formatting is not preserved.
Step 3
Download a DOCX file
The text is packaged into a .docx file you can open in Word, Google Docs, or LibreOffice.

What this tool does — and what it doesn't

This tool extracts the readable text from your PDF and packages it into a .docx file. It is a text extraction tool, not a layout conversion tool. The output document will contain your text in a clean, editable format — but it will not preserve the original layout, columns, tables, images, headers, footers, or precise font styling from the PDF.

If your PDF is a scanned image (a photograph of a page with no embedded text), this tool will produce an empty document. Scanned PDFs require OCR (optical character recognition) to extract text, which is not performed here.

For PDFs that are mostly text — reports, articles, contracts, emails exported to PDF — this tool works well and produces a clean, editable DOCX. For complex layouts like brochures, multi-column documents, or heavily formatted materials, a server-side conversion service will produce better results.

Why full PDF-to-Word conversion requires a server

A PDF stores content as positioned elements — text glyphs placed at precise X/Y coordinates, not as sentences or paragraphs. To reconstruct a Word document that looks like the original, software must analyse the spatial relationships between those elements and infer structure: which lines belong to the same paragraph, which elements form a table, which text is a heading versus body copy. This analysis requires substantial computation — typically performed by tools like LibreOffice or Adobe's conversion engine running on a server.

No JavaScript library currently performs this reconstruction accurately enough to produce reliable results in the browser. This tool does what is honestly achievable client-side: it extracts the text content so you can edit it, without overpromising on layout fidelity.

— FAQ
Will the formatting from my PDF be preserved?
No. This tool extracts text only. The output DOCX will contain your text in a plain format — no columns, tables, images, or precise font styling. If layout preservation matters, you need a server-side conversion service.
Will it work on scanned PDFs?
No. Scanned PDFs are images of pages — they contain no embedded text for the tool to extract. The output will be empty or minimal. You need an OCR tool first to make the text machine-readable.
What PDFs work best?
PDFs that were originally created digitally — exported from Word, Google Docs, or other applications — contain embedded text and work well. The more text-focused the document (reports, contracts, articles), the better the result.
Is my file uploaded to a server?
No. Text extraction runs in your browser using PDF.js. Your file never leaves your device.
Can I edit the output in Google Docs?
Yes. The output is a standard .docx file that can be opened in Microsoft Word, Google Docs (via upload), LibreOffice, or any other word processor.
— Ready when you are

Open the toolkit. Keep your files.

Browse all tools