Giter VIP home page Giter VIP logo

html-to-docx's Introduction

html-to-docx

NPM Version

html-to-docx is a js library for converting HTML documents to DOCX format supported by Microsoft Word 2007+, LibreOffice Writer, Google Docs, WPS Writer etc.

It was inspired by html-docx-js project but mitigates the problem of documents generated being non-compatiable with word processors like Google Docs and libreOffice Writer that doesn't support altchunks feature.

html-to-docx earlier used to use libtidy to clean up the html before parsing, but had to remove it since it was causing so many dependency issues due to node-gyp.

Disclaimer

Even though there is an instance of html-to-docx running in production, please ensure that it covers all the cases that you might be encountering usually, since this is not a complete solution.

Currently it doesn't work with browser directly, but it was tested against React.

Installation

Use the npm to install foobar.

npm install html-to-docx

Usage

await HTMLtoDOCX(htmlString, headerHTMLString, documentOptions, footerHTMLString)

full fledged examples can be found under example/

Parameters

  • htmlString <String> clean html string equivalent of document content.
  • headerHTMLString <String> clean html string equivalent of header. Defaults to <p></p> if header flag is true.
  • documentOptions <?Object>
    • orientation <"portrait"|"landscape"> defines the general orientation of the document. Defaults to portrait.
    • pageSize <?Object> Defaults to U.S. letter portrait orientation.
      • width <Number> width of the page for all pages in this section in TWIP. Defaults to 12240. Maximum 31680. Supports equivalent measurement in pixel, cm or inch.
      • height <Number> height of the page for all pages in this section in TWIP. Defaults to 15840. Maximum 31680. Supports equivalent measurement in pixel, cm or inch.
    • margins <?Object>
      • top <Number> distance between the top of the text margins for the main document and the top of the page for all pages in this section in TWIP. Defaults to 1440. Supports equivalent measurement in pixel, cm or inch.
      • right <Number> distance between the right edge of the page and the right edge of the text extents for this document in TWIP. Defaults to 1800. Supports equivalent measurement in pixel, cm or inch.
      • bottom <Number> distance between the bottom of text margins for the document and the bottom of the page in TWIP. Defaults to 1440. Supports equivalent measurement in pixel, cm or inch.
      • left <Number> distance between the left edge of the page and the left edge of the text extents for this document in TWIP. Defaults to 1800. Supports equivalent measurement in pixel, cm or inch.
      • header <Number> distance from the top edge of the page to the top edge of the header in TWIP. Defaults to 720. Supports equivalent measurement in pixel, cm or inch.
      • footer <Number> distance from the bottom edge of the page to the bottom edge of the footer in TWIP. Defaults to 720. Supports equivalent measurement in pixel, cm or inch.
      • gutter <Number> amount of extra space added to the specified margin, above any existing margin values. This setting is typically used when a document is being created for binding in TWIP. Defaults to 0. Supports equivalent measurement in pixel, cm or inch.
    • title <?String> title of the document.
    • subject <?String> subject of the document.
    • creator <?String> creator of the document. Defaults to html-to-docx
    • keywords <?Array<String>> keywords associated with the document. Defaults to ['html-to-docx'].
    • description <?String> description of the document.
    • lastModifiedBy <?String> last modifier of the document. Defaults to html-to-docx.
    • revision <?Number> revision of the document. Defaults to 1.
    • createdAt <?Date> time of creation of the document. Defaults to current time.
    • modifiedAt <?Date> time of last modification of the document. Defaults to current time.
    • headerType <"default"|"first"|"even"> type of header. Defaults to default.
    • header <?Boolean> flag to enable header. Defaults to false.
    • footerType <"default"|"first"|"even"> type of footer. Defaults to default.
    • footer <?Boolean> flag to enable footer. Defaults to false.
    • font <?String> font name to be used. Defaults to Times New Roman.
    • fontSize <?Number> size of font in HIP(Half of point). Defaults to 22. Supports equivalent measure in pt.
    • complexScriptFontSize <?Number> size of complex script font in HIP(Half of point). Defaults to 22. Supports equivalent measure in pt.
    • table <?Object>
      • row <?Object>
        • cantSplit <?Boolean> flag to allow table row to split across pages. Defaults to false.
    • pageNumber <?Boolean> flag to enable page number in footer. Defaults to false. Page number works only if footer flag is set as true.
    • skipFirstHeaderFooter <?Boolean> flag to skip first page header and footer. Defaults to false.
    • lineNumber <?Boolean> flag to enable line numbering. Defaults to false.
    • lineNumberOptions <?Object>
      • start <Number> start of the numbering - 1. Defaults to 0.
      • countBy <Number> skip numbering in how many lines in between + 1. Defaults to 1.
      • restart <"continuous"|"newPage"|"newSection"> numbering restart strategy. Defaults to continuous.
    • numbering <?Object>
      • defaultOrderedListStyleType <?String> default ordered list style type. Defaults to decimal.
    • decodeUnicode <?Boolean> flag to enable unicode decoding of header, body and footer. Defaults to false.
    • lang <?String> language localization code for spell checker to work properly. Defaults to en-US.
  • footerHTMLString <String> clean html string equivalent of footer. Defaults to <p></p> if footer flag is true.

Returns

<Promise<Buffer|Blob>>

Notes

Currently page break can be implemented by having div with classname "page-break" or style "page-break-after" despite the values of the "page-break-after", and contents inside the div element will be ignored. <div class="page-break" style="page-break-after: always;"></div>

CSS list-style-type for <ol> element are now supported. Just do something like this in the HTML:

  <ol style="list-style-type:lower-alpha;">
    <li>List item</li>
    ...
  </ol>

List of supported list-style-types:

  • upper-alpha, will result in A. List item
  • lower-alpha, will result in a. List item
  • upper-roman, will result in I. List item
  • lower-roman, will result in i. List item
  • lower-alpha-bracket-end, will result in a) List item
  • decimal-bracket-end, will result in 1) List item
  • decimal-bracket, will result in (1) List item
  • decimal, (the default) will result in 1. List item

Also you could add attribute data-start="n" to start the numbering from the n-th.

<ol data-start="2"> will start the numbering from ( B. b. II. ii. 2. )

Font family doesnt work consistently for all word processor softwares

  • Word Desktop work as intended
  • LibreOffice ignores the fontTable.xml file, and finds a font by itself
  • Word Online ignores the fontTable.xml file, and finds closest font in their font library

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to branch new branches off of develop for contribution.

Support

"Buy Me A Coffee"

License

MIT

Contributors

Made with contrib.rocks.

html-to-docx's People

Contributors

amrita-syn avatar braun1928 avatar charuthab avatar erenard avatar hlerebours avatar juralio-james avatar kurukururuu avatar nicolasiscoding avatar privateomega avatar robinminso avatar taro-shono avatar tasola avatar zedtux avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

html-to-docx's Issues

How to use with webpack / react

Hi! I'm seeing Uncaught TypeError: fs.readFileSync is not a function. Any suggestions? Have tried importing from dist in addition to the normal package.

Export CSS style

Hello, @privateOmega

I would like to thank you for the last issue you resolved to me.

I have a question how i can also export the CSS style.

What should i add to also export CSS style ?


import HTMLtoDOCX from 'html-to-docx';
import { save } from 'save-file';
import "./App.css";

const htmlString = `<!DOCTYPE html>
<html lang="en">
    <head>
        <meta charset="UTF-8" />
        <title>Document</title>
    </head>
    <body>
        <div className="style">    
        <p> Hello </p>
        </div>
    </body>
</html>`;

const downloadFile = async () => {
  const fileBuffer = await HTMLtoDOCX(htmlString, null, {
    table: { row: { cantSplit: true } },
    footer: true,
    pageNumber: true,
  });

  await save(fileBuffer, 'trial.docx')
};

function App() {
  return (
    <div className="App">
      <header className="App-header">
        <button onClick={downloadFile}>HELLO</button>
      </header>
    </div>
  );
}

export default App;


When margins are set - MS Word Error

Hi, default margins are bad for me, so i use this

margins: {
  top: '1cm',
  right: '1.5cm',
  bottom: '2cm',
  left: '2.5cm',
},

but
MS Word shows an Error

image
it is said: "Open the file with reserve copies converter / file is corrupted"

only Google Docs could open
help pls!

How to set custom list style/numbering

I have a multi-level list, I want to set custom numbering to each level, can you tell me how that can be done ?
image
This is what I get from library if I add a span in each list to display number.
Thanks :)

Is it possible to provide template.docx like pandoc?

This project is fantastic. It is small and simple. I used to use pandoc to convert html to docx. In Pandoc, I can provide a template.docx with some default format for title(Font size 18, blue), author(Font size 10, black), headings, etc.. After submitting a new html file, the output docx will be automatically formated. Is it possible for this project to provide this kind of function?

Problems when deploying to Heroku

We are getting the following error when deploying the app to Heroku. It works fine in development (Amazon Cloud 9).

Full log from Heroku:

-----> Node.js app detected
       
-----> Creating runtime environment
       
       NPM_CONFIG_LOGLEVEL=error
       NODE_ENV=production
       NODE_MODULES_CACHE=true
       NODE_VERBOSE=false
       
-----> Installing binaries
       engines.node (package.json):  unspecified
       engines.npm (package.json):   unspecified (use default)
       
       Resolving node version 12.x...
       Downloading and installing node 12.20.0...
       Using default npm version: 6.14.8
       
-----> Restoring cache
       Cached directories were not restored due to a change in version of node, npm, yarn or stack
       Module installation may take longer for this build
       
-----> Installing dependencies
       Installing node modules
       npm ERR! code ENOENT
       npm ERR! syscall open
       npm ERR! path /tmp/build_4xxx1_/node_modules/html-to-docx/package.json
       npm ERR! errno -2
       npm ERR! enoent ENOENT: no such file or directory, open '/tmp/build_4xxx1_/node_modules/html-to-docx/package.json'
       npm ERR! enoent This is related to npm not being able to find a file.
       npm ERR! enoent 
       
       npm ERR! A complete log of this run can be found in:
       npm ERR!     /tmp/npmcache.nxxxp/_logs/2020-12-xxx7Z-debug.log
-----> Build failed
       
       We're sorry this build is failing! You can troubleshoot common issues here:
       https://devcenter.heroku.com/articles/troubleshooting-node-deploys
       
       Some possible problems:
       
       - Node version not specified in package.json
         https://devcenter.heroku.com/articles/nodejs-support#specifying-a-node-js-version
       
       Love,
       Heroku
       
 !     Push rejected, failed to compile Node.js app.
 !     Push failed

Broken list text

  • List text
  • gets broken into multiple paragraph if it contains any formatting element due to latest version changes.

    Related to new list parser built for supporting nested lists.

    Image Issue with Tables

    Hello Team

    I hope you are doing well. I have used your NPM library html-to-docx for converting html templates to docx files. I am facing some issues with inserting images into them.
    I have tried multiple ways of inserting images inside html.

    1. Absolute URL - Links to an external image that is hosted on another website
    2. Relative URL - Links to an image that is hosted within the website.
      The above two methods didn't work for me. Then I tried with the base64 image data source and it worked for me.
      Now, when I am using base64 images within html template, I am facing some issues with the tables. The table border disappears and I am not getting the docx generated correctly.
      While if I do not insert any image to the html template, then the tables are working fine. I am using the official MS office to view the docs.

    I have shared two sample docx files one with image and one without image.

    The sample source code to insert an image -
    <table style="width: 100%; border: none; border-spacing: 0; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"> <tr style="border-collapse: collapse; border-spacing: 0;"> <td style="padding: 0px; margin: 0px; text-align: left; border-collapse: collapse; border-spacing: 0;" align="left"> <h1 style="color: #161950; font-size: 35px; font-weight: bold;">Tesla, Inc.</h1> <p style="color: #6565aa; font-size: 18px; font-family: 'Helvetica'; line-height: 17px; margin: 5px 0 20px; letter-spacing: 0px;">Company Snapshot</p> </td> <td style="padding: 0px; margin: 0px; text-align: right; border-collapse: collapse; border-spacing: 0;" align="right"> <img src="" /> </tr> </table>

    Can you please look into this issue and why there are table border issues when I am trying to insert an image in the html template.

    Looking forward to hearing from you.

    Snapshot-without-image.docx
    Snapshot-with-image.docx

    Images don't render when inside certain tags

    Hello, I'm very excited to find this project and see that it gets around the altchunks problem. Should this library support the output of base64 images that are in the HTML? I gave it a quick test and it did not appear to output any images. I just wanted to make sure I'm not doing something wrong. If we do not support images yet here, consider this a feature request and/or offer of help (if you could point me in the right direction to the relevant code).

    Div style is not reflecting

    Hi,
    I was trying to align text to the right in a div but looks like styling from <div> is not reflecting.

    I used the example you have

    <div style="text-align:right;background-color:#f3f3f3">
                <p>Taken from wikipedia</p>
                <img
                    src="data:image/png;base64, iVBORw0KGgoAAAANSUhEUgAAAAUAAAAFCAYAAACNbyblAAAAHElEQVQI12P4//8/w38GIAXDIBKE0DHxgljNBAAO9TXL0Y4OHwAAAABJRU5ErkJggg=="
                    alt="Red dot"
                />
            </div>
    

    Expected result:
    Taken from wikipedia and the image should be on the right side.

    Actual result:
    No impact.

    I tried to put the styling in the <p> and its working fine for the text.

    Ex:

    <div>
                <p  style="text-align:right;background-color:#f3f3f3">Taken from wikipedia</p>
                <img
                    src="data:image/png;base64, iVBORw0KGgoAAAANSUhEUgAAAAUAAAAFCAYAAACNbyblAAAAHElEQVQI12P4//8/w38GIAXDIBKE0DHxgljNBAAO9TXL0Y4OHwAAAABJRU5ErkJggg=="
                    alt="Red dot"
                />
            </div>
    

    Ideally I would like to have one style for the contents inside the div.

    Will this project work in browser?

    Searched NPM and found this project. Tried in browser but gives me error about fs. Is this project for node only or I miss some settings...

    Skip header/footer or different header/footer on first page

    Hello,
    First of all, it is a great package but I ran into one issue when I am setting headerType to "First or even", it is not working at all. It only works when set to "default" option. Also is there any setting where we can skip header/footer on the first page or show different header/footer on first page.
    I have installed this package and it contains file in minified version so I am not able to find the code that you have added in PR.

    Any help would be appreciated. Thanks

    node-pre-gyp error

    Does anyone know what is causing this error?

    Only seems to be when I try to install this package, all others are fine.

    Output below:

    node-pre-gyp install --fallback-to-build

    node-pre-gyp WARN Using needle for node-pre-gyp https download
    node-pre-gyp WARN Tried to download(404): https://github.com/gagern/node-libtidy/releases/download/v0.4.1/node-v72-darwin-x64.tar.gz
    node-pre-gyp WARN Pre-built binaries not found for [email protected] and [email protected] (node-v72 ABI, unknown) (falling back to source compile with node-gyp)
    No receipt for 'com.apple.pkg.CLTools_Executables' found at '/'.

    No receipt for 'com.apple.pkg.DeveloperToolsCLILeo' found at '/'.

    No receipt for 'com.apple.pkg.DeveloperToolsCLI' found at '/'.

    gyp: No Xcode or CLT version detected!
    gyp ERR! configure error
    gyp ERR! stack Error: gyp failed with exit code: 1
    gyp ERR! stack at ChildProcess.onCpExit (/usr/local/lib/node_modules/npm/node_modules/node-gyp/lib/configure.js:351:16)
    gyp ERR! stack at ChildProcess.emit (events.js:315:20)
    gyp ERR! stack at Process.ChildProcess._handle.onexit (internal/child_process.js:275:12)
    gyp ERR! System Darwin 19.6.0
    gyp ERR! command "/usr/local/bin/node" "/usr/local/lib/node_modules/npm/node_modules/node-gyp/bin/node-gyp.js" "configure" "--fallback-to-build" "--module=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib/tidy.node" "--module_name=tidy" "--module_path=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib" "--napi_version=6" "--node_abi_napi=napi" "--napi_build_version=0" "--node_napi_label=node-v72"
    gyp ERR! cwd /Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated
    gyp ERR! node -v v12.18.3
    gyp ERR! node-gyp -v v5.1.0
    gyp ERR! not ok
    node-pre-gyp ERR! build error
    node-pre-gyp ERR! stack Error: Failed to execute '/usr/local/bin/node /usr/local/lib/node_modules/npm/node_modules/node-gyp/bin/node-gyp.js configure --fallback-to-build --module=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib/tidy.node --module_name=tidy --module_path=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib --napi_version=6 --node_abi_napi=napi --napi_build_version=0 --node_napi_label=node-v72' (1)
    node-pre-gyp ERR! stack at ChildProcess. (/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/node_modules/node-pre-gyp/lib/util/compile.js:83:29)
    node-pre-gyp ERR! stack at ChildProcess.emit (events.js:315:20)
    node-pre-gyp ERR! stack at maybeClose (internal/child_process.js:1021:16)
    node-pre-gyp ERR! stack at Process.ChildProcess._handle.onexit (internal/child_process.js:286:5)
    node-pre-gyp ERR! System Darwin 19.6.0
    node-pre-gyp ERR! command "/usr/local/bin/node" "/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/node_modules/.bin/node-pre-gyp" "install" "--fallback-to-build"
    node-pre-gyp ERR! cwd /Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated
    node-pre-gyp ERR! node -v v12.18.3
    node-pre-gyp ERR! node-pre-gyp -v v0.14.0
    node-pre-gyp ERR! not ok
    Failed to execute '/usr/local/bin/node /usr/local/lib/node_modules/npm/node_modules/node-gyp/bin/node-gyp.js configure --fallback-to-build --module=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib/tidy.node --module_name=tidy --module_path=/Users/antnewman/Desktop/Projects/doc-translator/node_modules/libtidy-updated/lib --napi_version=6 --node_abi_napi=napi --napi_build_version=0 --node_napi_label=node-v72' (1)
    npm WARN [email protected] No description
    npm WARN [email protected] No repository field.

    npm ERR! code ELIFECYCLE
    npm ERR! errno 1
    npm ERR! [email protected] install: node-pre-gyp install --fallback-to-build
    npm ERR! Exit status 1
    npm ERR!
    npm ERR! Failed at the [email protected] install script.
    npm ERR! This is probably not a problem with npm. There is likely additional logging output above.

    Exporting HTML with MathML included

    I am trying to export HTML which includes MathML elements but the equations are not appearing in the docx file.

    <math xmlns="http://www.w3.org/1998/Math/MathML">...</math>

    The mathML elements are enclosed by <p>'s But that didn't seem to make a difference.

    Google Docs does not open the generated file

    Hi! Thank you for this project and keep up the great work.

    I generated a hello world docx file with this library but Google Docs simply fails to open it.

    image

    If I try to open it with Google Docs, it shows a blank page.

    Html with <script> is not working.

    I am tying to generate word doc from html string. My html string contains script tag. html-to-docx is not rendering the html with the executed script. I am providing the sample for reference below,

    let str2 = '<!DOCTYPE html><html><body><h1>The script element</h1><p id="demo"></p><script>document.getElementById("demo").innerHTML = "Hello JavaScript!";</script></body></html>'
    const fileBuffer = await HTMLtoDOCX(str2, null, {});
    fs.writeFile(filePath, fileBuffer, (error) => {
         if (error) {
             console.log('Docx file creation failed');
          }
          console.log('Docx file created successfully');
    });
    

    How to get it working?

    Getting a strange dependency error

    Failed to compile.
    
    /Users/jvu/dev/athena/node_modules/libtidy-updated/node_modules/node-pre-gyp/lib/info.js
    Module not found: Can't resolve 'aws-sdk' in '/Users/jvu/dev/athena/node_modules/libtidy-updated/node_modules/node-pre-gyp/lib'
    

    I have 2 issues with this library in terms of dependencies. First of, working locally works just fine and works perfectly, but when injected in our build process it runs to several problems. We have a NODE application that I have this installed in which causes some conflict in Node version due to a deep dependency of node-pre-gyp which basically manages node of that particular machine. That deep dependency is derived from libtidy-updated which it seems like you use to validate and clean up HTML string. I had to fork this repo and remove the filter and it worked perfectly. So I'm unsure how you want to address this, maybe have HTML cleanup a responsibility for the user OR have it be optional. Secondly, because of these dependencies, this code DOES NOT work on browser since it's requiring node and npm. Maybe there's a different Tidy/Minifier filter you can find or maybe include detailed instructions on how to manually tidy up HTML for the user and have it work that way.

    Right to left text

    What is the best way to set up right to left text (e.g. Arabic text)?

    Leading and Trailing spaces

    Hi there,

    Thanks for the library, this is a great time saver.

    I'm running a bit into an issue. I have a paragraph <p> that gets converted properly but I would like to indent the first line of the paragraph by 3 spaces.

    My first attempt was using a margin-left which doesn't look to be supported. I then tried to go the dumb way and use cheerio to add three space characters after the <p> which works in the HTML but it looks like html-to-docx is trimming the text

    Any pointers on how to go about this?

    Thanks

    How to remove header and footer from first page of DOCX (html to docx) ?

    Hello,

    We are generating DOCX report into our application for that we used html to docx , html-docx-js node packages and it gives us expected result.
    Now, we need to add cover page into first page docx so automatically header/footer will be started from second page but we are not getting any way any attribute which help us to do same.
    Please provide some suggestion to fulfill requirement.

    Thanks,
    Jyoti

    How to remove table border?

    Hi,

    I saw you recommended using a table with a border="1" attribute #2 (comment), and was hoping I could use a table without borders, but it doesn't work. Not sure if it's a bug or just a feature that doesn't exist.

    I have the following markup

    <table class="images" width="100%" border="0" style="border:0; width:100%;">
        <tr>
            <td width="33%" style="width:33.33%">
                <img src="" width="100%" style="width:100%" />
            </td>
            <td width="33%" style="width:33.33%">
                <img src="" width="100%" style="width:100%" />
            </td>
            <td width="33%" style="width:33.33%">
                <img src="" width="100%" style="width:100%" />
            </td>
        </tr>
    </table>
    

    And get the following result:

    image

    FYI - my end goal is to have 3 images side by side and I couldn't get this working unless I used a table. The script always added paragraphs after the images even without line breaks in between them.

    Custom page size?

    Thanks for this extremely useful library!

    I think a useful (and hopefully trivial) improvement would be to support custom paper sizes as a property in the documentOptions parameter. I think the maximum page size supported my Microsoft Word is 22"x22". Perhaps instead of passing orientation, one could pass size.height and size.width in any supported unit.

    Unhandled Rejection (TypeError): fs.writeFile is not a function

    Hello,
    @privateOmega @amrita-syn

    when i try to use the library in a react js component i have this error

    Unhandled Rejection (TypeError): fs.writeFile is not a function

    When i replace const fs = require('fs'); with import fs from 'fs';
    I have this error
    Unhandled Rejection (TypeError): fs__WEBPACK_IMPORTED_MODULE_2___default.a.writeFile is not a function

    This is my code :

    import React from 'react';
    
    
    // FIXME: Incase you have the npm package
     //import HTMLtoDOCX from 'html-to-docx';
     
     import HTMLtoDOCX from 'html-to-docx';
     //import fs from 'fs';
    
     const fs = require('fs');
    
    const Export = props => {
     
    
      const filePath = "./example.docx";
       const htmlString = `<!DOCTYPE html>
       <html lang="en">
           <head>
               <meta charset="UTF-8" />
               <title>Document</title>
           </head>
           <body>
           <p>Taken from wikipedia</p>
           </body>
     </html>`;
    
    
      function exportdoc(){
        console.log(htmlString);
    
      (async () => {
        const fileBuffer = await HTMLtoDOCX(htmlString, null, {
          table: { row: { cantSplit: true } },
          footer: true,
          pageNumber: true,
        });
      
        fs.writeFile(filePath, fileBuffer, (error) => {
          if (error) {
            console.log('Docx file creation failed');
            return;
          }
          console.log('Docx file created successfully');
        });
        fs.end();
      })();
    }
        
      return (
        
        <div>
    <button onClick={exportdoc}>export</button>
    
       
        
    
         <div>
       
      
     </div>
    
        </div>
      );
    
    }
    export default Export;
    
    

    Html raw content to docx - unable to render base64 image to docx and formatting content issue

    @privateOmega , previously we used html-docx-js to generate docx.
    In which we provide htmlContent in payload and pass it to package. Please go through the attachment where you will get 3 files html-docx-js output , html-to-docx output file and html content input file. Please let us know what we are doing wrong we are expecting the same result from html-to-docx package also.
    Any help would be appreciated.

    Thank you !

    html-docx-js output.zip

    First page Blank

    It is creating docx from HTML string but making first page blank and starting with page 2 in docx. Can you please look into that ?

    typescript definition

    declare module 'html-to-docx'
    {
        interface Margins
        {
            /**
             * top <Number> distance between the top of the text margins for the main document and the top of the page for all pages in this section in TWIP.
             * Defaults to 1440. Supports equivalent measurement in pixel, cm or inch.
             */
            top: number;
    
            /**
             * right <Number> distance between the right edge of the page and the right edge of the text extents for this document in TWIP.
             * Defaults to 1800. Supports equivalent measurement in pixel, cm or inch.
             */
            right: number;
    
            /**
             * bottom <Number> distance between the bottom of text margins for the document and the bottom of the page in TWIP.
             * Defaults to 1440. Supports equivalent measurement in pixel, cm or inch.
             */
            bottom: number;
    
            /**
             * left <Number> distance between the left edge of the page and the left edge of the text extents for this document in TWIP.
             * Defaults to 1800. Supports equivalent measurement in pixel, cm or inch.
             */
            left: number;
    
            /**
             * header <Number> distance from the top edge of the page to the top edge of the header in TWIP.
             * Defaults to 720. Supports equivalent measurement in pixel, cm or inch.
             */
            header: number;
    
            /**
             * footer <Number> distance from the bottom edge of the page to the bottom edge of the footer in TWIP.
             * Defaults to 720. Supports equivalent measurement in pixel, cm or inch.
             */
            footer: number;
    
            /**
             * gutter <Number> amount of extra space added to the specified margin, above any existing margin values. This setting is typically used when a document is being created for binding in TWIP.
             * Defaults to 0. Supports equivalent measurement in pixel, cm or inch.
             */
            gutter: number;
        }
    
        interface Row
        {
            /**
             * cantSplit <?Boolean> flag to allow table row to split across pages. Defaults to false.
             */
            cantSplit?: boolean;
        }
    
        interface Table
        {
            row?: Row;
        }
    
        export interface DocumentOptions
        {
            /**
             * orientation <"portrait"|"landscape"> defines the general orientation of the document. Defaults to portrait.
             */
            orientation?: "portrait" | "landscape";
    
            margins?: Margins
    
            /**
             * title <?String> title of the document.
             */
            title?: string;
    
            /**
             * subject <?String> subject of the document.
             */
            subject?: string;
    
            /**
             * creator <?String> creator of the document. Defaults to html-to-docx
             */
            creator?: string;
    
            /**
             * keywords <?Array<String>> keywords associated with the document. Defaults to ['html-to-docx'].
             */
            keywords?: string[];
    
            /**
             * description <?String> description of the document.
             */
            description?: string;
    
            /**
             * lastModifiedBy <?String> last modifier of the document. Defaults to html-to-docx.
             */
            lastModifiedBy?: string;
    
            /**
             * revision <?Number> revision of the document. Defaults to 1.
             */
            revision?: number;
    
            /**
             * createdAt <?Date> time of creation of the document. Defaults to current time.
             */
            createdAt?: Date;
    
            /**
             * modifiedAt <?Date> time of last modification of the document. Defaults to current time.
             */
            modifiedAt?: Date;
    
            /**
             * headerType <"default"|"first"|"even"> type of header. Defaults to default.
             */
            headerType?: "default" | "first" | "even";
    
            /**
             * header <?Boolean> flag to enable header. Defaults to false.
             */
            header?: boolean;
    
            /**
             * footerType <"default"|"first"|"even"> type of footer. Defaults to default.
             */
            footerType?: "default" | "first" | "even";
    
            /**
             * footer <?Boolean> flag to enable footer. Defaults to false.
             */
            footer?: boolean;
    
            /**
             * font <?String> font name to be used. Defaults to Times New Roman.
             */
            font?: string;
    
            /**
             * fontSize <?Number> size of font in HIP(Half of point). Defaults to 22. Supports equivalent measure in pt.
             */
            fontSize?: number;
    
            /**
             * complexScriptFontSize <?Number> size of complex script font in HIP(Half of point). Defaults to 22. Supports equivalent measure in pt.
             */
            complexScriptFontSize?: number;
    
            table?: Table;
    
            /**
             * pageNumber <?Boolean> flag to enable page number in footer. Defaults to false. Page number works only if footer flag is set as true.
             */
            pageNumber?: boolean;
    
            /**
             * skipFirstHeaderFooter <?Boolean> flag to skip first page header and footer. Defaults to false.
             */
            skipFirstHeaderFooter?: boolean;
        }
    
        /**
         * @param htmlString <String> clean html string equivalent of document content.
         * @param headerHTMLString <String> clean html string equivalent of header. Defaults to <p></p> if header flag is true.
         * @param documentOptions <DocumentOptions>
         * @param footerHTMLString <String> clean html string equivalent of footer. Defaults to <p></p> if footer flag is true.
         * @constructor
         * @private
         */
       export default function HTMLtoDOCX(htmlString: string, headerHTMLString: string, documentOptions: DocumentOptions, footerHTMLString?: string): Promise<Buffer | Blob>;
    }
    
    
    

    CSS style is not being applied

    I have added some styles into the html. But generated docx file doesn't include the expected style.

    const htmlString = `<!DOCTYPE html>
    <html lang="en">
        <head>
            <meta charset="UTF-8" />
            <title>Document</title>
          <style> 
    	  #tbldemo { font-family: "Trebuchet MS", "Arial", "Helvetica", "sans-serif"; font-size: 14px; border-collapse: collapse; width: 100%;} 
    	  #tbldemo td, #tbldemo th { border: 1px solid #cdd2db; padding: 8px;} 
    	  #tbldemo th { padding-top: 12px; padding-bottom: 12px; text-align: left; background-color: #e4eaf4; color: balck;} 
          </style>		
        </head>
        <body>
            <div>
                <p>Taken from wikpedia</p>
                <img
                    src="data:image/png;base64, iVBORw0KGgoAAAANSUhEUgAAAAUAAAAFCAYAAACNbyblAAAAHElEQVQI12P4//8/w38GIAXDIBKE0DHxgljNBAAO9TXL0Y4OHwAAAABJRU5ErkJggg=="
                    alt="Red dot"
                />
            </div>
            <p>
                <strong>
                    Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make
                    a type specimen book.
                </strong>
                <i>It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged.</i>
                <u>It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages,</u>and more recently with desktop publishing software
                <span style="color: hsl(0, 75%, 60%);"> like Aldus PageMaker </span>including versions of Lorem Ipsum.
                <span style="background-color: hsl(0, 75%, 60%);">Where does it come from? Contrary to popular belief, Lorem Ipsum is not simply random text.</span>
                It has roots in a piece of classical Latin literature from 45 BC, making it over 2000 years old.
            </p>
            <ul style="list-style-type: circle;">
                <li>Unordered list element</li>
            </ul>
            <br>
            <ol style="list-style-type: decimal;">
                <li>Ordered list element</li>
            </ol>
            <div class="page-break" style="page-break-after: always"></div>
            <ul>
                <li>
                    <a href="https://en.wikipedia.org/wiki/Coffee">
                        <strong>
                            <u>Coffee</u>
                        </strong>
                    </a>
                </li>
                <li>Tea
                    <ol>
                        <li>Black tea
                            <ol>
                                <li>Srilankan <strong>Tea</strong>
                                    <ul>
                                        <li>Uva <b>Tea</b></li>
                                    </ul>
                                </li>
                                <li>Assam Tea</li>
                            </ol>
                        </li>
                        <li>Green tea</li>
                    </ol>
                </li>
                <li>Milk
                    <ol>
                        <li>Cow Milk</li>
                        <li>Soy Milk</li>
                    </ol>
                </li>
            </ul>
            <br>
            <table id="tbldemo">
                <tr>
                    <th>Country</th>
                    <th>Capital</th>
                </tr>
                <tr>
                    <td>India</td>
                    <td>New Delhi</td>
                </tr>
                <tr>
                    <td>United States of America</td>
                    <td>Washington DC</td>
                </tr>
            </table>
        </body>
    </html>`; 
    

    This is how it's looks like.
    image

    What could be the issue?

    List inside table not working

    If we add list inside a table, then the result not showing the contents of list tag. Is this known issue? or we need to do any workaround to achieve this? Any suggestions would be helpful.

    Links does not render in non-block elements

    • version: 1.1.34
    • node: 15.6

    When experimenting with links I noticed that links that are in non block elements disappear. I saw #30 where links work but following example does not work for me:

    <div>
    	<!-- Working examples: -->
    	<h6>This is heading 6 <a href="www.google.se">heading link</a></h6>
    	<div>
    		<a href="www.google.se">Content</a>
    	</div>
    
    	<!-- non working examples -->
    	<div>
    		<!-- This one does not show the "link" at all -->
    		<span>some styled text <a href="www.google.se">link</a> </span>
    		<!-- This one shows the text but no link -->
    		<b>some styled text <a href="www.google.se">link</a> </b>
    	</div>
    </div>

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google โค๏ธ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.