Pdf.js: Can I use pdf.js in node.js to extract the pdf file's data like image, text(or json), fonts ..., and save them in a folder?

Created on 15 Nov 2016  ·  8Comments  ·  Source: mozilla/pdf.js

I just want to convert a pdf file to a folder that include text/json, image, fonts, and render them in browser By myself.

Are there relevant solutions?

Most helpful comment

You can find it here

https://github.com/zeddysoft/pdf-processor/blob/master/handler.js

Don't forget to star the repo :)

All 8 comments

Hi @yurydelendik , thanks for the link you shared, but i noticed that the script just converts each page of the pdf to an svg, how about one that extracts the images from all the pages, something like what this does.

Hi @zeddysoft , I am looking for a way to extract images from a pdf page too. Have you found something ?

Yes i have

Haha nice, and can you share it ? :) Because i tried many things... but everything failed x)

You can find it here

https://github.com/zeddysoft/pdf-processor/blob/master/handler.js

Don't forget to star the repo :)

Thank you, i will check that ! And i won't forget the star ;)

Hi @zeddysoft Could you explain how to use that repo

Was this page helpful?
0 / 5 - 0 ratings

Related issues

anggikolo11 picture anggikolo11  ·  3Comments

AlexP3 picture AlexP3  ·  3Comments

azetutu picture azetutu  ·  4Comments

xingxiaoyiyio picture xingxiaoyiyio  ·  3Comments

jigskpatel picture jigskpatel  ·  3Comments