Document Processing
Process contracts, invoices, and reports without uploading them anywhere. Built-in skills extract, merge, and convert PDF, Word, Excel, and PowerPoint files on your machine.
Impact
What changes
Without Lapu AI
A team lead spends 30 minutes uploading PDFs to an online extractor, copying text, pasting into a Word doc, and reformatting. Sensitive contracts go through a third-party web tool.
With Lapu AI
Lapu AI extracts text from all PDFs, merges the content, and generates the summary document — all locally, in under 5 minutes. No uploads, no third-party tools.
The challenge
Processing documents — extracting text from PDFs, converting formats, merging files — usually means uploading to a web service, installing single-purpose tools, or writing custom scripts. For sensitive documents like contracts, financial reports, or HR files, uploading to third-party services is a non-starter.
How Lapu AI solves this
Lapu AI includes built-in document skills for PDF, DOCX, XLSX, and PPTX files. The agent can extract text from PDFs, merge multiple PDFs into one, create and edit Word documents, work with spreadsheet data, and handle presentations. These skills run as typed operations through the Skills Runtime — no external services, no file uploads, no API keys.
Document processing runs locally through built-in skills. Files are not sent to third-party services.
Workflow
How it works
Point the agent at your documents
Tell Lapu AI which files to process. The agent reads the file metadata and uses the appropriate skill — PDF, DOCX, XLSX, or PPTX — based on the file type.
Describe the operation
Ask for what you need in plain language: 'Extract the text from these 5 invoices and combine them into a single summary document.' The agent activates the right skill and runs the operation.
Review and save the output
The agent produces the result — extracted text, merged PDF, new DOCX, or reformatted data — and writes it to your chosen location. You approve the file write before it happens.
Try it yourself
What you would type
Copy any of these into Lapu AI to get started immediately.
>Extract the text from all PDF files in ~/invoices/ and save each as a .txt file with the same name.
>Merge these 3 PDF contracts into a single document in the order they are listed.
>Read the data from quarterly-report.xlsx and create a summary document in Word format.
Ready to try this workflow?
Download Lapu AI and run it on your own machine. Free to start.
Download for freeFAQ
Common questions
Which document formats are supported?
Lapu AI has built-in skills for PDF (text extraction and merging), DOCX (creation and text extraction), XLSX (spreadsheet operations), and PPTX (presentation operations). For other formats, the agent can use shell tools or Python scripts via sandbox execution.
Are my documents sent to the cloud?
Document processing runs locally through built-in skills. The files themselves are not uploaded anywhere. When the agent needs AI reasoning (e.g., to summarize extracted text), relevant context is sent to AI model providers — but the raw documents stay on your machine.
Can it handle scanned PDFs?
The built-in PDF skill extracts text from text-based PDFs. For scanned/image-based PDFs, the agent can use OCR tools if they are installed on your machine (like Tesseract), running them via the Shell tool.
Explore more
Related use cases
Desktop Automation
Stop copy-pasting between apps. The agent sees your screen, clicks buttons, fills forms, and moves data between applications — so you do not have to.
See how it worksDataData Processing
Describe the transformation you need in plain English. The agent writes and runs the Python script, shell pipeline, or Node job — you just approve and get clean output.
See how it works
