Jump to content

Community Resources and Partnerships/India Rapid Project/Project LipiSetu - A Research Project for Digitizing and Decoding Sanskrit Manuscripts - 2

From Meta, a Wikimedia project coordination wiki
statusNot funded
Project LipiSetu - A Research Project for Digitizing and Decoding Sanskrit Manuscripts - 2
proposed start date2026-06-01
proposed end date2026-11-30
amount requested (INR)500000 INR
amount requested (USD)5000 USD
applicant• NehalDaveND
organization (if applicable)• N/A

Applicant

[edit]
Main Wikimedia username. (required)

NehalDaveND

Organization

N/A

If you are a group or organization leader, board member, president, executive director, or staff member at any Wikimedia group, affiliate, or Wikimedia Foundation, you are required to self-identify and present all roles. (required)

N/A

Describe all relevant roles with the name of the group or organization and description of the role. (required)

N/A

Project

[edit]
1. Please state the title of your proposal. This will also be the Meta-Wiki page title.

Project LipiSetu - A Research Project for Digitizing and Decoding Sanskrit Manuscripts - 2

2. and 3. Proposed start and end dates for the proposal.

2026-06-01 - 2026-11-30

4. Where will this proposal be implemented? (required)

India

5. Are your activities part of a Wikimedia movement campaign, project, or event? If so, please select the relevant project or campaign. (required)

Other (please specify) Wiki Loves Manuscripts https://meta.wikimedia.org/wiki/Wikisource_Loves_Manuscripts

6. What is the change you are trying to bring? What are the main challenges or problems you are trying to solve? Describe this change or challenges, as well as main approaches to achieve it. (required)

Countless manuscripts in Sanskrit and other Indian languages remain largely inaccessible, with very few concerted efforts dedicated to their digitization and interpretation. My objective is to curate these manuscripts and transcribe them into modern scripts—converting, for example, old Devanagari or Sharda into their contemporary counterparts.

While this is an incredibly time-intensive endeavor—where decoding a single manuscript can span years—I am fully committed to contributing to the Sanskrit Wikisource project. This work directly serves the Free Knowledge Movement, which seeks to democratize access to knowledge long sequestered in private collections or traditional archives.

Through [my previous project 1|https://meta.wikimedia.org/wiki/Grants:Programs/Wikimedia_Community_Fund/Rapid_Fund/Digitization_of_manuscripts_and_other_material_for_Sanskrit_Wikisource_and_Wikipedia_(ID:_21976965))] and [project 2|[1]], it became evident that merely digitizing images of these manuscripts is insufficient for a meaningful contribution. True utility lies in transcribing and decoding these scripts into accessible, modern formats, ensuring this ancient knowledge is genuinely available to the public.

7. What are the planned activities? (required) Please provide a list of main activities. You can also add a link to the public page for your project where details about your project can be found. Alternatively, you can upload a timeline document. When the activities include partnerships, include details about your partners and planned partnerships.

Main Approaches and Planned Activities:

Approach 1: Phased Manuscript Preparation and Digitization

  • Activity: Sourcing and initial assessment of physical rare books and manuscripts.
  • Activity: Engaging a preservation specialist on a short-term, task-basis to safely handle, clean, and prepare fragile hard copies before they are processed.
  • Activity: Contracting a dedicated scanning technician (on a need-basis) to systematically capture high-resolution images of the prepared manuscripts.

Approach 2: Digital Asset Management and Core Decoding

  • Activity: Establishing a structured data management system to organize the physical inventory and the newly created digital files (soft copies).
  • Activity (My Primary Focus): Step-by-step transcription and decoding of the archaic scripts (such as Sharda or old Devanagari) into modern, machine-readable Devanagari.
  • Activity: Formatting and uploading the finalized, decoded text to the Sanskrit Wikisource platform to make it publicly accessible.

Rationale: Why this approach?I have chosen a modular, task-based management strategy rather than hiring full-time staff. Decoding ancient manuscripts is a highly time-consuming process that moves at its own pace. By dividing the labor and bringing in specialized help strictly for preservation and scanning when a batch of manuscripts is ready, I ensure that funds are used efficiently—paying only for active work.

This step-by-step workflow guarantees the physical safety of the rare books by using preservation experts, while entirely freeing my time to focus on project management and my core expertise: decoding the complex texts. This model ensures high-quality output at every stage without creating unsustainable financial overhead.

I can't give any timeline, but I will work in phases. 

Phase 1 - Choose books for preservation and scanning.

Phase 2 - Digitalization and data management.

Phase 3 - Decoding and uploading in wikisourse. 


This will be as per situations. 


8. Describe your team. Please provide their roles, Wikimedia Usernames and other details. (required) Include more details of the team, including their roles, usernames, Wikimedia group, and whether they are salaried, volunteers, consultants/contractors, etc.

Last time I told some wikipedian and other new users, but they were not able to give time for such time-taking and hardwork. So only Nitesh Upadhyay will help and I will take help from helper on per session time.

Organizing project and deconding - Nehal Dave

Helping in Manuscript hard words - Nitesh Upadhyay

Scanning of Manuscripts and rare books - Any normal person, who does not have much computer knowledge but can help in scaning work. I will make him arrange hard copy in proper way.

Digitalization and data management - I will decode and give him to upload. He also make 

9. Who are the target participants and from which community? How will you engage participants before and during the activities? How will you follow up with participants after the activities? (required)

Sanskrit Scholars and Sanskrit lover public

9.1. If your project includes in-person activities, are there any international participants travelling to India for them? (required)
No
9.1.1. List all countries of participation. (required)


9.2. Will the project be transferring funds to any international participants? (required)
No
9.2.1. List all international participants receiving funding and their countries. (required)

N/A

10. Does your project involve work with children or youth? (required)
No
10.1. Please provide a link to your Youth Safety Policy. (required) If the proposal indicates direct contact with children or youth, you are required to outline compliance with international and local laws for working with children and youth, and provide a youth safety policy aligned with these laws. Read more here.

N/A

11. How did you discuss the idea of your project with your community members and/or any relevant groups? Please describe steps taken and provide links to any on-wiki community discussion(s) about the proposal. (required) You need to inform the community and/or group, discuss the project with them, and involve them in planning this proposal. You also need to align the activities with other projects happening in the planned area of implementation to ensure collaboration within the community.

[[2]|here]



12. Does your proposal aim to work to bridge any of the content knowledge gaps (Knowledge Inequity)? Select one option that most apply to your work. (required)

Language

13. Does your proposal include any of these areas or thematic focus? Select one option that most applies to your work. (required)

Culture, heritage or GLAM

14. Will your work focus on involving participants from any underrepresented communities? Select one option that most apply to your work. (required)

Digital Access

15. In what ways do you think your proposal most contributes to the Movement Strategy 2030 recommendations. Select one that most applies. (required)

Innovate in Free Knowledge

Metrics

[edit]
17. What do you hope to learn from your work in this project or proposal? (required)

1. Tackling Highly Complex Sanskrit and Ancient ScriptsBuilding upon the foundational expertise gained in my previous project, I am now equipped to tackle much more complex and heavily degraded manuscripts. My deepened understanding of historical script variations (like archaic Devanagari and Sharda) and Indian philosophy allows me to decode deeply layered texts with greater academic rigor and accuracy for Sanskrit Wikisource.

2. Executing High-Volume, Precision TranscriptionHaving refined my decoding techniques, this phase focuses on applying those skills at a higher volume and with greater precision. Moving past the initial learning curve, I can now efficiently transcribe archaic scripts into modern Unicode Devanagari at a faster pace, while successfully recovering critical text from partially damaged or severely aged folios using the methodologies I have established.

3. Streamlining Modular Project ManagementI will apply the project management experience I gained previously to streamline my "phase-based" workflow. By highly optimizing how I coordinate with preservation specialists and scanning technicians on a task-basis, I can process a larger physical inventory of rare books. This streamlined management ensures the physical handling happens seamlessly, allowing me to maintain uninterrupted focus on the core decoding work.

4. Scaling Contributions to the Free Knowledge MovementMy goal now is to significantly scale my tangible impact on the Wikimedia community. Transitioning from my initial uploads, I aim to provide a continuous, high-volume stream of decoded texts to Sanskrit Wikisource. This sustained effort will directly accelerate the availability of previously sequestered academic and religious texts for public and scholarly use.

5. Applying Advanced Digital Humanities StandardsHaving mastered the best practices of digital preservation and text encoding, I will now apply these standards systematically to a larger dataset. Every manuscript decoded in this phase will be processed with advanced metadata and precise structural formatting from the very beginning, ensuring maximum discoverability and seamless integration into global digital archives.

6. Efficiently Resolving Textual and Preservation ChallengesI no longer just expect to encounter challenges like illegible scripts or incomplete folios; I now have the targeted problem-solving experience to overcome them swiftly. In this phase, I will utilize my established network of scholars and my refined technical approach to quickly resolve textual ambiguities, ensuring that difficult manuscripts do not stall the transcription pipeline.

18. What are your Wikimedia project targets in numbers (metrics)? (required)
Number of participants, editors, and organizers
Other Metrics Target Optional description
Number of participants 1 Nitesh Upadhyay, but he is not much active in wikipedia, but I tought him something. He will help sometimes and other times may take fees.
Number of editors 10 I will teach this decoding methods to Ranchi University students. I hope 10 editors may learn this.
Number of organizers 1 Nehal Dave
Number of content contributions to Wikimedia projects
Wikimedia project Number of content created or improved
Wikipedia
Wikimedia Commons
Wikidata
Wiktionary
Wikisource 30
Wikimedia Incubator
Translatewiki
MediaWiki
Wikiquote
Wikivoyage
Wikibooks
Wikiversity
Wikinews
Wikispecies
Wikifunctions or Abstract Wikipedia
Optional description for content contributions.

10 Manuscripts and 20 Rare Books.

19. Do you have any other project targets in numbers (metrics)? (optional)

No

Main Open Metrics Data
Main Open Metrics Description Target
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
N/A N/A N/A
20. What tools would you use to measure each metrics? Please refer to the guide for a list of tools. You can also write that you are not sure and need support. (required)

Measuring Decoding Effort and Complexity

In evaluating the transcription of ancient Sanskrit manuscripts, rigid quantitative metrics—such as basic page or paragraph counts—fail to capture the true scope of the work. To accurately reflect the demands of this project, I utilize a qualitative evaluation model focused on cognitive load and scholarly effort:

1. Assessing Character-Level Cognitive LoadDeciphering archaic scripts is rarely straightforward and requires deep paleographic analysis. Rather than using mechanical time-trackers, I measure the mental exertion and specialized focus required to accurately identify complex, often degraded characters. The true metric is not the clock time, but the cognitive effort necessary to resolve script ambiguities.

2. Evaluating Segmental Decoding EffortTranscribing continuous text demands sustained concentration. I assess the mental bandwidth required to comprehend and accurately transcribe complete lines or paragraphs. This approach measures the overall scholarly exertion and mental focus needed to process complex textual segments, providing a much more accurate picture of the workload than simply counting paragraphs.

3. Quantifying Overall Mental StrainThe core of manuscript preservation is the intense cognitive effort required to interpret ancient texts. By acknowledging the mental strain and fatigue inherent in continuous, high-level transcription, I can better pace the workflow. This ensures that the intense focus required for decoding archaic scripts is managed sustainably throughout the project lifecycle.

4. Sustaining Contextual FidelityTrue decoding is not merely rapid transliteration; it requires preserving the original meaning. I evaluate the mental resources dedicated to maintaining strict contextual accuracy. This guarantees that my focus remains on the faithful, high-quality representation of the text's historical depth, prioritizing absolute fidelity over sheer speed.

5. Contextualizing Accuracy and Error RatesWhen evaluating transcription accuracy, I analyze any error rates directly through the lens of cognitive fatigue and textual complexity. Mistakes in this highly specialized field are often indicators of severely degraded source material or the mental strain of prolonged focus. By reviewing accuracy in the context of the effort expended, I can continuously refine my workflow to maintain the highest scholarly standards.

Budget

[edit]
21. Please upload your budget for this proposal or indicate the link to it. (required)
22. What is the amount you are requesting for this proposal? Please provide the amount in Indian Rupees. (required)

500000 INR

22.1. Convert the amount requested into USD using the Oanda converter. This is done to help you assess the USD equivalent of the requested amount. Your request should be between 500 - 5,000 USD. (required)

5000 USD

By submitting this proposal request you agree with the Institutional Partner Privacy Policy, Application Privacy Statements, WMF Friendly Space Policy and Universal Code of Conduct.

Yes


This is an automatically generated Meta-Wiki page. The page was copied from Fluxx, the web service of Wikimedia Foundation Funds, where the user has submitted their application. Please do not make any changes to this page because all changes will be removed after the next update. Use the discussion page for your feedback. The page was created by CR-FluxxBot.