Grants:IEG/Public Domain Textbook Import

From Meta, a Wikimedia project coordination wiki

status: withdrawn

Individual Engagement Grants
Individual Engagement Grants
Review grant submissions
review
grant submissions
Visit IdeaLab submissions
visit
IdeaLab submissions
eligibility and selection criteria

project:

Public Domain Textbook Import


project contact:

taosubmarines(_AT_)mail.ru

participants:


grantees: User:Herper.gr


summary:

Working on https://en.wikibooks.org/w/index.php?title=Snakes_of_Europe, this makes me wonder how to work with a single scanned pdf page, how to import the OCR text from archive.org, extract plates, tables or diagrams as an image, and ensure approximate layout is the same on a heavily modified pdf/odt output (printed ebook)





2014 round 1

Project idea[edit]

What is the problem you're trying to solve?[edit]

The book, The Snakes Of Europe is a 100 year old public domain text. There is no modern day comparable open access/source field-guide/reference/text-book, that has photos to assist species identification of the subject matter, and information. My idea is to enable easy cross-referencing of external scans (pdf/image/...) (ie from archive.org) and of external OCR sources, and to look at ways to improve import of OCR (https://en.wikibooks.org/wiki/Snakes_of_Europe/Definition_and_Classification has various tables incorporated that have not been pasted legibly, whereas https://en.wikibooks.org/wiki/Snakes_of_Europe/Habits is a fairly simple copy and paste, fully legible, even without correct formatting).

What is your solution?[edit]

I would like to develop an extension for firefox and chrome to have external pages (a pdf page from archive.org) on a top side panel and be able to work with OCR text on a lower side panel, so as to be able to import (copy) a piece of the pdf with say an image or a table of interest. With this concept I would be able to paste selected tables to the 'Definition_and_Classification' article above, and have them appear as images. I could also copy in image plates, and taxobox type features, of individual species easily. In this extension i could also look at incorporating addtional utilities such as GOCR, to work in only this extension.

Project goals[edit]


Ready to create the rest of your proposal?
Use the button below just once to create the remaining sections you'll need!


Part 2: The Project Plan[edit]

Project plan[edit]

Scope[edit]

Activities[edit]

Budget[edit]

Total amount requested[edit]

Budget breakdown[edit]

Intended impact[edit]

Target audience[edit]

Community engagement[edit]

Fit with strategy[edit]

Sustainability[edit]

Measures of success[edit]

Need target-setting tips?

Participant(s)[edit]

Discussion[edit]

Community Notification[edit]

Please paste a link below to where the relevant communities have been notified of this proposal, and to any other relevant community discussions. Need notification tips?

Endorsements[edit]

Do you think this project should be selected for an Individual Engagement Grant? Please add your name and rationale for endorsing this project in the list below. Other feedback, questions or concerns from community members are also highly valued, but please post them on the talk page of this proposal.

  • Community member: add your name and rationale here.