Community Wishlist Survey 2020/Wikisource/Improve workflow for uploading books to Wikisource

Random proposal ►◄ Wikisource The survey has concluded. Here are the results!

Improve workflow for uploading books to Wikisource

Problem:

Uploading books to Wikisource is difficult.

In the current workflow you need to upload the file on Commons, then go to Wikisource and create the Index page (and you need to know the exact URL). :The files need to be DJVU, which has different layers for the scan and the text. This is important for tools like Match & Split (if the file is a PDF, this tool doesn't work).

More importantly, the current workflow (especially for library uploads) includes Internet Archive, and the famous IA-Upload tool. This tool is now fundamental for many libraries and uploaders, but it has several issues.

As Internet Archive stopped creating the DJVU files from his scans, the international community has struggled solving the issue of creating automatically a DJVU for uploading on Commons and then Wikisource.

This has created a situation where libraries love Internet Archive, want to use it, but then get stuck because they don't know how to create a DJVU for Wikisource, and the IA-Upload is bugged and fails often.

Summary

- IA-Upload tool is bugged and fails often when creating DJVU files.
- M&S doesn't work with PDF files.
- Users do not expect to upload to Commons when transferring files from Internet Archive to Wikisource.
- Upload to Internet Archive is an important feature expecially for GLAMs (ie. libraries).

Who would benefit:

- all Wikisource communities, especially new users
- new GLAMs (libraries and archives) who at the moment have an hard time coping with the Wiki ecosystem.

Proposed solution:

Improve the IA-Upload tool: https://tools.wmflabs.org/ia-upload/commons/init

The tool should be able to create good-quality DJVU from Archive files, and do not fail as often as it does now.

it should also hide, for the end-user, the uploading to Commons phase. The user should be able to upload a file on Internet Archive, and then use the ID of the file to directly create the Index page on Wikisource. We could have an "Advanced mode" that shows all the passages for experienced user, and a "Standard" one that makes things more simple.

More comments:
Phabricator tickets: related: phab:T154413
Proposer: originally proposed by Aubrey (talk) in 2017 - re-proposed by Candalua (talk) 16:15, 6 November 2019 (UTC)[reply]

Discussion

Voting

Support Consulnico (talk) 11:32, 21 November 2019 (UTC)[reply]
Support MartinPoulter (talk) 14:21, 21 November 2019 (UTC)[reply]
Support Sadads (talk) 21:37, 21 November 2019 (UTC)[reply]
Support Viticulum (talk) 22:06, 21 November 2019 (UTC)[reply]
Support Libcub (talk) 08:19, 22 November 2019 (UTC)[reply]
Support Jahl de Vautban (talk) 09:17, 22 November 2019 (UTC)[reply]
Support Alan ^Talk 12:41, 22 November 2019 (UTC)
Support Bodhisattwa (talk) 14:47, 22 November 2019 (UTC)[reply]
Support Alf7e (talk) 16:44, 22 November 2019 (UTC)[reply]
Support Emptyfear (talk) 17:16, 23 November 2019 (UTC)[reply]
Support VIGNERON * ^discut. 10:08, 24 November 2019 (UTC)[reply]
Support Liuxinyu970226 (talk) 10:31, 24 November 2019 (UTC)[reply]
Support Marta Arosio (WMIT) (talk) 12:43, 25 November 2019 (UTC)[reply]
Support Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:20, 25 November 2019 (UTC)[reply]
Support Shev123 (talk) 14:21, 25 November 2019 (UTC)[reply]
Support Blue Rasberry (talk) 15:32, 25 November 2019 (UTC)[reply]
Support –MJL ‐Talk‐^☖ 15:44, 25 November 2019 (UTC)[reply]
Support 游魂 16:43, 25 November 2019 (UTC)[reply]
Support Slager (talk) 17:56, 25 November 2019 (UTC)[reply]
Support DraconicDark (talk) 21:10, 25 November 2019 (UTC)[reply]
Support Geonuch (talk) 01:35, 26 November 2019 (UTC)[reply]
Support Risker (talk) 05:04, 26 November 2019 (UTC)[reply]
Support Hsarrazin (talk) 14:29, 26 November 2019 (UTC)[reply]
Support Francesca Lissoni (WMIT) (talk) 09:12, 27 November 2019 (UTC)[reply]
Support--Francesca Ussani (WMIT) (talk) 09:27, 27 November 2019 (UTC)[reply]
Support GioRan (talk) 11:56, 27 November 2019 (UTC)[reply]
Support Acélan (talk) 13:15, 27 November 2019 (UTC)[reply]
Support Pyb (talk) 18:04, 27 November 2019 (UTC)[reply]
Support Toto256 (talk) 22:42, 27 November 2019 (UTC)[reply]
Support Wellparp (talk) 19:09, 28 November 2019 (UTC)[reply]
Support Marajozkee (talk) 14:44, 29 November 2019 (UTC)[reply]
Support Gurtej Chauhan (talk) 03:49, 1 December 2019 (UTC)[reply]
Support Candalua (talk) 16:36, 1 December 2019 (UTC)[reply]
Support सुबोध कुलकर्णी (talk) 12:04, 2 December 2019 (UTC)[reply]
Support YES, PLEASE Sannita - not just another it.wiki sysop 13:10, 2 December 2019 (UTC)[reply]
Support This is why I'm not contributing to Wikisource. Trizek ^{from FR} 13:37, 2 December 2019 (UTC)[reply]
Support Novak Watchmen (talk) 17:53, 2 December 2019 (UTC)[reply]