Indic-TechCom/Requests/IWCC2018

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search

Wikisource community in India had an Indic Wikisource Community consultation(IWCC) programme in Kolkata on November 24th and 25th. It was organized by CIS-A2K. Indic-Tech com was requested to take part in this event and help the Wikisource community in finxing the broken gadgets, creating new Java scripts which will help in Wikisource editing, creating new tools etc. During this event Indic-Tech com launched an OCR tool which was created User:Jayprakash12345. The following requests were made by the Wikisource community during the event.

Wscontest tool[edit]

To prepare a tool which will help show the number of pages proofread, validated etc by a user in a particular time. This tools will help our community to have a Proofread-a-thon and see and who has contributed more for the Proofread-a-thon. This tool will also help the community members to look at their own contribution and bost up there energy and contribute more to Wikisource.

Wiki source stat[edit]

The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section.

To create a Wikisource stats page for Indic Wikisource communities. This page should contain the stats of Page namespace, Main namespace and their subcategories like not proofread, proofread, validated, with the scan, without the scan and etc. This list should be sortable and this tool should be only for Indic languages. Currently, we have a tool which is running based on the number of articles in categories like to proofread, validates and etc but this tool is but manually and it's for all the languages.

@Jayantanth, Titodutta, Bodhisattwa, and Ananth subray:, Yes check.svg https://tools.wmflabs.org/indic-wsstats/ --Jayprakash >>> Talk 05:37, 26 December 2018 (UTC)
  • Looks awesome Jay. Please change the © symbol to "Created and maintained by". -- Tito Dutta (talk) 06:59, 26 December 2018 (UTC)
Tito Dutta Ji, Changed--Jayprakash >>> Talk 07:52, 26 December 2018 (UTC)
Thank you. -- Tito Dutta (talk) 08:36, 26 December 2018 (UTC)
Awesome. Jayantanth (talk) 17:16, 26 December 2018 (UTC)

The above discussion is preserved as an archive. Please do not modify it. Subsequent comments should be made in a new section.

hOCR[edit]

The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section.

hOCR is a format for representing OCR output, which includes the layout information, text and style information. It embeds this information invisibly in standard HTML format. Previous OCR formats, the recognized text, and OCR-related information co-exist in the same file and when you upload the books to commons and create the INdex page on Wikisource the OCRed text will be avaliable. Which reduces the community members effort in running the OCR. To get this tool working tesseract should be updated from 3.0 to 4.0.

@Jayantanth, Titodutta, Bodhisattwa, and Ananth subray:, X mark.svg Not done I have talked with Tpt See phab:T208711#4780257. There is no need to change in tools's repository. The problem is that Toolsforge provides us Ubuntu 14.04 But tesseract-ocr 4.0 will need ubuntu 18.04. We can't do anything from our side. The WMF Cloud team will provide us Debian Jessie after some time (around 4-6 months). So closing it as not done. because It needs to run only one command. Which Tpt will take care of. If You need it any cost right now. then we have to host this tool at third-party hosting. Which may cost 800 Rs per month. And I think 1st option will better to wait 4-6 month. Thanks--Jayprakash >>> Talk 11:34, 21 December 2018 (UTC)
Thanks @Jayprakash12345:, for your effort. I can under stand the the main issue is. As of now really we don't need it urgently, because we have two option now for OCR ( Google Drive OCR and Fusion OCR). If google have stopped this service, it will required urgently. Lets wait for WMF Cloud service to update Ubuntu 18 at their server. Thanks again. Jayantanth (talk) 16:40, 21 December 2018 (UTC)

The above discussion is preserved as an archive. Please do not modify it. Subsequent comments should be made in a new section.

Android app for reading Wikisource books[edit]

To create an Andriod app which will help the readers to download the books or read online from the Wikisource website. This will increase the number of readers in every langauges.

Image Crop[edit]

The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section.

This is a tool which will help all the people to add the images in the actual text. When this tool is enabled it will allow you to select the part of the page which you want. Currently, this is having some issue and not able to use it properly.

@Jayantanth, Titodutta, Bodhisattwa, and Ananth subray:, Yes check.svg wikisource:kn:ಸದಸ್ಯ:Jayprakash12345/Cropimage.js, I have made some other changes as well like remove percentage encoding and change its position from advance to the main tool. Thanks :)--Jayprakash >>> Talk 15:30, 21 December 2018 (UTC)

The above discussion is preserved as an archive. Please do not modify it. Subsequent comments should be made in a new section.

Qr code generator[edit]

This is a request to create a tool which will generate a QR code of HD quality of each book on Wikisource. Which can be given to different libraries.

Symbol wait.svg Work in Progress--Jayprakash >>> Talk 11:18, 21 December 2018 (UTC)

Voice typing button[edit]

Similar to OCR, Voice typing is working completely fine on a Documents. Community members requested to integrate this tool on Wikisource os that it will save a lot of energy and time of the volunteers.