tabula

Extract tables from PDF files. More information: https://tabula.technology.

Extract all tables from a PDF to a CSV file:
tabula -o {{file.csv}} {{file.pdf}}
Extract all tables from a PDF to a JSON file:
tabula --format JSON -o {{file.json}} {{file.pdf}}
Extract tables from pages 1, 2, 3, and 6 of a PDF:
tabula --pages {{1-3,6}} {{file.pdf}}
Extract tables from page 1 of a PDF, guessing which portion of the page to examine:
tabula --guess --pages {{1}} {{file.pdf}}
Extract all tables from a PDF, using ruling lines to determine cell boundaries:
tabula --spreadsheet {{file.pdf}}
Extract all tables from a PDF, using blank space to determine cell boundaries:
tabula --no-spreadsheet {{file.pdf}}

License and Disclaimer

The content on this page is copyright © 2014—present the tldr-pages team and contributors.
This page is used with permission under Creative Commons Attribution 4.0 International License.

While we do attempt to make sure content is accurate, there isn't a warranty of any kind.