pup
Command-line HTML parsing tool. More information: https://github.com/ericchiang/pup.
- 
Transform a raw HTML file into a cleaned, indented, and colored format:
 cat {{index.html}} | pup --color
- 
Filter HTML by element tag name:
 cat {{index.html}} | pup '{{tag}}'
- 
Filter HTML by id:
 cat {{index.html}} | pup '{{div#id}}'
- 
Filter HTML by attribute value:
 cat {{index.html}} | pup '{{input[type="text"]}}'
- 
Print all text from the filtered HTML elements and their children:
 cat {{index.html}} | pup '{{div}} text{}'
- 
Print HTML as JSON:
 cat {{index.html}} | pup '{{div}} json{}'
License and Disclaimer
The content on this page is copyright © 2014—present the tldr-pages team and contributors.This page is used with permission under Creative Commons Attribution 4.0 International License.
While we do attempt to make sure content is accurate, there isn't a warranty of any kind.