Openlibrary: Better integration of project gutenberg's material

Created on 26 Oct 2019  ·  4Comments  ·  Source: internetarchive/openlibrary

Right now, the 'text's are in a txt or img format. That's difficult to integrate into the Open Library. I would like to recommend contacting Project Gutenberg to create complementary pdf's for their html texts to incorporate into the Open Library.
project gutenberg: https://archive.org/details/gutenberg/downloader/index.php?tab=collection
example: https://archive.org/details/cookeryanddining29728gut
ideal html of example to convert to pdf: http://www.gutenberg.org/files/29728/29728-h/29728-h.htm
example location on open library: https://openlibrary.org/search?q=apicius&author_key=OL4645453A&m=edit&mode=ebooks&has_fulltext=true

Data Import Triage 3 Feature Request

Most helpful comment

They also distribute texts in HTML and ePUB formats. I would have thought that ePub which is supported natively by online bookreaders would be much more desirable than PDF.

All 4 comments

They also distribute texts in HTML and ePUB formats. I would have thought that ePub which is supported natively by online bookreaders would be much more desirable than PDF.

I prefer PDF, because while the epub format's for tablets, PDF's for both tablets and computers. Also, it doesn't capture the formatting well enough to always use.

I think this is going to have to be solved by the IA Product team (they know about it), tagging @bfalling and closing.

@mekarpeles Awesome. I should've closed this sooner. Thanks!

Was this page helpful?
0 / 5 - 0 ratings