Indic-TechCom/Tools/IndicOCR

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search


Main page

Tools

Requests

Management

Indic Wikimedia Portal

Talk

Indic-OCR is a tool for Indic community to OCR the Images on Wikisource. The tool URL is https://indic-ocr.toolforge.org/

What it do[edit]

It converts Image to text for Wikisource.

Beneficial Wikisources[edit]

Currently Google OCR is not working for 4 Indic languages, which is following

  1. Malayalam Wikisource
  2. Oriya Wikisource
  3. Gujrati Wikisource
  4. Tamil Wikisource


Demo of IndicOCR userscirpt




Automatically[edit]

You can OCR hand to hand on wiki page. See demo for that.

Installation

Add the following code to your local wiki common.js page.

mw.loader.load('//meta.wikimedia.org/w/index.php?title=User:Indic-TechCom/Script/IndicOCR.js&action=raw&ctype=text/javascript');

If you want to add extra button in Visual Editor then add the following code also to your local wiki common.js page.

mw.loader.load('//meta.wikimedia.org/w/index.php?title=User:Indic-TechCom/Script/OCR4VE.js&action=raw&ctype=text/javascript');