Indic-TechCom/Tools/IndicOCR

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search

Indic-OCR is a tool for Indic community to OCR the Images on Wikisource. The tool URL is https://tools.wmflabs.org/indic-ocr/

What it do[edit]

It converts Image to text for Wikisource.

Beneficial Wikisources[edit]

Currently Google OCR is not working for 6 Indic languages, which is following

  1. Malayalam Wikisource
  2. Oriya Wikisource
  3. Gujrati Wikisource
  4. Tamil Wikisource

Automatically[edit]

Demo of IndicOCR userscirpt

You can OCR hand to hand on wiki page. See demo for that.

Installation

Add the following code to your local wiki common.js page.

mw.loader.load('//meta.wikimedia.org/w/index.php?title=User:Indic-TechCom/Script/IndicOCR.js&action=raw&ctype=text/javascript');