[BangPypers] HTML to doc conversion

Anand B Pillai anandpillai at letterboxes.org
Wed Sep 30 10:45:55 CEST 2015


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Wednesday 30 September 2015 01:56 PM, Sandeep wrote:
> Hello Bangpypers,
> 
> I am working on a project where in I need to convert a HTML page
> which has the formatting information provided by CSS and also
> images in it into a word document.

This is not a Python specific problem.
> 
> I have searched for such packages but was not able to find a
> solution

Try pandoc [1]. See [2] for examples.

There is also a very nice package in PHP called dompdf [3] which
converts complex HTML with CSS etc into nicely formatted PDF. I have
used it to develop training material successfully taking the path from
markdown->html->pdf .

> 
> Can someone advise me on this problem
> 
> Thanks in advance Sandeep 
> _______________________________________________ BangPypers mailing
> list BangPypers at python.org 
> https://mail.python.org/mailman/listinfo/bangpypers
> 

[1]: http://pandoc.org
[2]: http://pandoc.org/demos.html
[3]: https://github.com/dompdf/dompdf

- -- 
Regards,

- --Anand

- ----------------------------
Software Architect/Consultant
anandpillai at letterboxes.org

Cell: +919880078014
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iQEcBAEBAgAGBQJWC6E+AAoJEHKU2n17CpvDetYIAKPpDbs0W+OUhhxu6rC7Fuw/
fMgf8UJQcX1ar1b/ytLU7KQDFaG3iqtZ1JQkVNBAZEp/w1b+asoEnVqvOBMGkOQs
lGW8JPhcpZQVDEdhEPkhgXE1FweHE7V/ud+rg0KMmrgdq11D0KrO+pbBxNbnXEtd
QpzyqY7ZpSmttbOuuC/aG+dwEy9PTCCO5/HKd9ZsL8wXKBcj/8azhSAQ/jTDbCNe
CENAGUgnS/Ijnz1XWZOg5dS59pTdDPRcI6aIj64AM62xpOM/KC8e/7Qu5obJb1WC
cfWuUhduDF7oqmIGd5QpbUwZwomp4K5aeVen9TdyUXYLpZbRxEvyz+WzBaoC0js=
=pM1x
-----END PGP SIGNATURE-----


More information about the BangPypers mailing list