文档:https://pypi.org/project/html2text/
安装:
pip install html2text
Option | Description |
---|
–version | Show program’s version number and exit |
-h, --help | Show this help message and exit |
–ignore-links | Don’t include any formatting for links |
–escape-all | Escape all special characters. Output is less readable, but avoids corner case formatting issues. |
–reference-links | Use reference links instead of links to create markdown |
–mark-code | Mark preformatted and code blocks with [code]…[/code] |
>>> import html2text
>>>
>>> print(html2text.html2text("Zed's dead baby, Zed's dead.
"))
**Zed's** dead baby, _Zed's_ dead.
>>> import html2text
>>>
>>> h = html2text.HTML2Text()
>>> # Ignore converting links from HTML
>>> h.ignore_links = True
>>> print h.handle("Hello, world!")
Hello, world!>>> print(h.handle("
Hello, world!"))Hello, world!>>> # Don't Ignore links anymore, I like links
>>> h.ignore_links = False
>>> print(h.handle("
Hello, world!"))
Hello, [world](http://earth.google.com/)!