Community Resources and Partnerships/India Rapid Project/Project LipiSetu - A Research Project for Digitizing and Decoding Sanskrit Manuscripts
Applicant
[edit]- Main Wikimedia username. (required)
NehalDaveND
- Organization
N/A
- If you are a group or organization leader, board member, president, executive director, or staff member at any Wikimedia group, affiliate, or Wikimedia Foundation, you are required to self-identify and present all roles. (required)
N/A
- Describe all relevant roles with the name of the group or organization and description of the role. (required)
N/A
Project
[edit]- 1. Please state the title of your proposal. This will also be the Meta-Wiki page title.
Project LipiSetu - A Research Project for Digitizing and Decoding Sanskrit Manuscripts
- 2. and 3. Proposed start and end dates for the proposal.
2025-08-01 - 2026-01-31
- 4. Where will this proposal be implemented? (required)
India
- 5. Are your activities part of a Wikimedia movement campaign, project, or event? If so, please select the relevant project or campaign. (required)
Other (please specify) Wikisource Loves Manuscripts
- 6. What is the change you are trying to bring? What are the main challenges or problems you are trying to solve? Describe this change or challenges, as well as main approaches to achieve it. (required)
There are many manuscripts in Sanskrit and other Indian languages, but very few efforts have been made to digitize and decode them. My goal is to collect these manuscripts and transcribe them into modern scripts—such as converting old Devanagari to modern Devanagari, or Sharda script to modern Devanagari.
This is a highly time-consuming process, and decoding a single manuscript could take years. However, I am committed to contributing to the Sanskrit Wikisource project. This effort could benefit the Free Knowledge Movement, which aims to make knowledge that has long been confined to libraries or owned by individuals more accessible to the public.
I previously worked on a project (you can view it here) and realized that simply capturing images of the manuscripts is not enough to contribute meaningfully to Sanskrit Wikisource. It became clear that decoding the script and making it accessible in modern formats is essential for it to be useful to the general public.
What is the change you are trying to bring?
I’m working to digitize and decode ancient Sanskrit and other Indian language manuscripts. The key change I want to bring is making these manuscripts accessible in modern digital formats by transcribing old scripts—like Devanagari and Sharda—into Unicode-compatible Devanagari. This effort will help ensure that valuable texts, which have long been confined to libraries or in private collections, are freely available to the public, scholars, and anyone interested in ancient knowledge.
I aim to contribute to Sanskrit Wikisource, a platform dedicated to freely sharing Sanskrit texts. By decoding and digitizing these manuscripts, I hope to support the Free Knowledge Movement and make these rich texts widely accessible.
What are the main challenges or problems you are trying to solve?
- Lack of Decoded and Accessible Manuscripts:There are many manuscripts, but very few have been decoded and made available in modern, accessible formats. Most efforts so far have only focused on capturing images of these manuscripts, which doesn’t help the public. Without proper transcription and decoding into modern scripts, the texts remain locked in their old formats, limiting their accessibility.
- Time-Consuming Process:Decoding ancient scripts is a highly labor-intensive task. Each manuscript could take years to decode, especially when the texts are written in less common or outdated scripts like Sharda. This makes the process slow, requiring careful attention to detail and patience.
- Limited Access to Knowledge:Many ancient manuscripts are currently locked away in physical form—either in libraries or private collections. Without digital access, much of this knowledge remains out of reach for the general public. The lack of digitization makes it harder for scholars, students, and even curious readers to study and benefit from these important texts.
- 7. What are the planned activities? (required) Please provide a list of main activities. You can also add a link to the public page for your project where details about your project can be found. Alternatively, you can upload a timeline document. When the activities include partnerships, include details about your partners and planned partnerships.
Main Activities for the Project:
- 8. Describe your team. Please provide their roles, Wikimedia Usernames and other details. (required) Include more details of the team, including their roles, usernames, Wikimedia group, and whether they are salaried, volunteers, consultants/contractors, etc.
Planner - Dr. Nehal Dave (NehalDaveND)
Coordinators - 1 - LYU Jayati, Coordinator - 2 - LYUkavyat3008Coordinator - 3 - LYU khushibagade
- 9. Who are the target participants and from which community? How will you engage participants before and during the activities? How will you follow up with participants after the activities? (required)
How My Time Will Be Occupied:
[edit]In this case, I’ll be spending a significant portion of my time personally decoding the Sanskrit manuscripts. This involves carefully studying the ancient scripts, deciphering each character, and ensuring accurate transcription. The process is meticulous and requires focused attention, as each text is rich in meaning and historical context.
I’ll be utilizing various tools, including dictionaries, reference materials, and advanced technology, to assist in decoding the text. I’ll also need to spend time verifying the accuracy of the transcription, cross-referencing with existing translations, and ensuring that the final result is faithful to the original text.
Additionally, I will be dedicating time to reviewing my own work, making revisions where necessary, and continuously refining the decoded text to ensure clarity and precision. This is a time-intensive process, demanding both patience and expertise.
I’ll also be handling administrative tasks, such as tracking the progress of the decoding process, documenting key findings, and making sure that each step of the project aligns with the original objectives.
Why an Honorarium Should Be Paid to Me:
[edit]Decoding Sanskrit manuscripts is a complex and specialized task that requires both deep knowledge of the language and the ability to understand ancient scripts. The effort involved in decoding, transcribing, and verifying the accuracy of each manuscript is significant and time-consuming.
The process requires careful research, expert analysis, and the constant revision of work to ensure the highest level of accuracy. As I am solely responsible for both the intellectual and technical aspects of this task, the time and effort invested in decoding these texts cannot be underestimated.
Given the depth of expertise required and the level of dedication necessary to ensure the accuracy and quality of the transcriptions, an honorarium is a fair recognition of the work I am doing. It reflects the value of my time, knowledge, and the detailed work involved in decoding these important historical manuscripts.
- 9.1. If your project includes in-person activities, are there any international participants travelling to India for them? (required)
- No
- 9.1.1. List all countries of participation. (required)
- 9.2. Will the project be transferring funds to any international participants? (required)
- No
- 9.2.1. List all international participants receiving funding and their countries. (required)
N/A
- 10. Does your project involve work with children or youth? (required)
- Yes
- 10.1. Please provide a link to your Youth Safety Policy. (required) If the proposal indicates direct contact with children or youth, you are required to outline compliance with international and local laws for working with children and youth, and provide a youth safety policy aligned with these laws. Read more here.
If I choose second option, I will teach Graduate or Post-Graduate students. We are not going to organize any residential program. Also, Bhagwat Vidyapeeth is equipped with CCTV cameras, the norms of no ragging etc. of UGC are already in place, so this workshop will be organized in a safe environment.
- 11. How did you discuss the idea of your project with your community members and/or any relevant groups? Please describe steps taken and provide links to any on-wiki community discussion(s) about the proposal. (required) You need to inform the community and/or group, discuss the project with them, and involve them in planning this proposal. You also need to align the activities with other projects happening in the planned area of implementation to ensure collaboration within the community.
I have invited idea from community. But cordinators and other old contributors will try to help me.
- 12. Does your proposal aim to work to bridge any of the content knowledge gaps (Knowledge Inequity)? Select one option that most apply to your work. (required)
Language
- 13. Does your proposal include any of these areas or thematic focus? Select one option that most applies to your work. (required)
Culture, heritage or GLAM
- 14. Will your work focus on involving participants from any underrepresented communities? Select one option that most apply to your work. (required)
Digital Access
- 15. In what ways do you think your proposal most contributes to the Movement Strategy 2030 recommendations. Select one that most applies. (required)
Innovate in Free Knowledge
Metrics
[edit]- 17. What do you hope to learn from your work in this project or proposal? (required)
1. Deepen My Knowledge of Sanskrit and Ancient Scripts
[edit]- While I already have some understanding of Sanskrit, this project will allow me to further enhance my expertise in ancient scripts and the nuances of Sanskrit texts. So I can contribute to Sanskrit Wikisource with greater proficiency.
- I’ll gain in-depth knowledge of historical variations in scripts like Devanagari, Sharda, and other ancient Indian writing systems.
- I’ll also improve my understanding of textual analysis and paleography (the study of ancient handwriting), which will deepen my academic grounding in the field.
2. Improve My Decoding and Transcription Skills
[edit]- This project will provide me with hands-on experience in decoding and transcribing complex ancient manuscripts. I expect to sharpen my skills in recognizing old scripts, translating them accurately into modern Devanagari Unicode, and ensuring fidelity to the original text.
- I will also learn how to efficiently deal with the challenges of partially damaged or unclear manuscripts, figuring out the best techniques for restoration and preservation.
3. Enhance My Project Management and Collaboration Skills
[edit]- If I collaborate with other Sanskrit scholars and Students, I’ll improve my ability to work with a team, manage diverse contributors, and guide others through complex tasks.
- I’ll gain experience in organizing workshops, teaching specialized knowledge, and creating learning resources that make it easier for others to engage with ancient scripts.
- Managing the project will help me hone my organizational skills, as I will be overseeing multiple tasks, ensuring deadlines are met, and adjusting strategies as needed.
4. Contribute to the Free Knowledge Movement
[edit]- One of the key things I hope to learn is how my efforts can have a tangible impact on the broader Wikimedia community. I want to see how the decoding and transcription of these manuscripts will contribute to the growth of Sanskrit Wikisource and increase access to knowledge that has long been locked in physical manuscripts.
- This will give me insight into the long-term impact of contributing to the Free Knowledge Movement and how digital archives can preserve and share cultural heritage for future generations.
5. Learn Best Practices in Digital Humanities
[edit]- As I work with ancient manuscripts and digitize them, I hope to learn more about digital preservation, text encoding, and other techniques in the digital humanities field. This will include understanding the best practices for preserving ancient content while ensuring it is accessible to modern audiences.
- I hope to gain insights into creating high-quality metadata and how it facilitates the discoverability and accessibility of digitized texts online.
6. Develop Insights into the Challenges of Preserving Ancient Texts
[edit]- I expect to encounter numerous challenges in preserving and transcribing ancient texts, whether it’s due to incomplete or damaged manuscripts or difficulties in interpreting ancient scripts. I hope to learn problem-solving techniques to tackle these issues effectively.
- Additionally, I’ll learn how to collaborate with others, whether they’re fellow transcribers, scholars, or members of the Wikimedia community, to share knowledge and support each other in overcoming obstacles.
- 18. What are your Wikimedia project targets in numbers (metrics)? (required)
| Other Metrics | Target | Optional description |
|---|---|---|
| Number of participants | 10 | This is very hard process, so I may find very few people for the contribution. If no-one will eligible for editing, I will do all the decoding myself. But I have contected Bhagwat Vidyapith, so I will get some students. After one workshop, I will choose some of them and start my work with them. |
| Number of editors | 10 | it will depend on script and work. So we should count per palm leaf and not per edit. |
| Number of organizers | 1 | This one organization has two sub-organizations (Colledges). B.Ed. College and Sanskrit College. |
| Wikimedia project | Number of content created or improved |
|---|---|
| Wikipedia | |
| Wikimedia Commons | |
| Wikidata | |
| Wiktionary | |
| Wikisource | 100 |
| Wikimedia Incubator | |
| Translatewiki | |
| MediaWiki | |
| Wikiquote | |
| Wikivoyage | |
| Wikibooks | |
| Wikiversity | |
| Wikinews | |
| Wikispecies | |
| Wikifunctions or Abstract Wikipedia |
- Optional description for content contributions.
100+ Pages.
Decoding Sanskrit manuscripts requires a significant amount of time, effort, and specialized expertise, especially when dealing with rare and complex texts in their oldest versions. The process is not just about transcribing the text but involves thorough research, careful analysis, and critical revision to ensure accuracy. Sanskrit, being a classical language, is already challenging, and working with the ancient versions of manuscripts makes the task even more intricate.
The decoding process should rightfully incur the highest portion of the budget, as it forms the core of the entire project. If the primary work—the decoding of the manuscript—does not receive adequate financial support, then the project loses its essence and value. I am not simply decoding and transcribing; I am also ensuring that this valuable historical and cultural heritage is made available to the public for free.
Unlike many projects where individuals decode manuscripts, publish them, and generate profit, our goal is to share this knowledge freely with the community. This is a courageous and noble endeavor, but it also requires a considerable investment of time, resources, and funds. The work involved is substantial, and while many similar projects charge for the finished product, our mission is to offer it at no cost to society.
The budget constraints should not diminish the importance of this project. While in the past, I have completed similar tasks without charging any fees, this particular project is far more demanding and time-intensive. Given the scale of the task and the dedication required, I will inevitably need the support of others to assist with various aspects of the work. This will require financial resources.
I understand that I am requesting funding, but it is essential to recognize that this project demands both time and expertise. I am fully committed to completing it as quickly as possible, with minimal errors, and on time. However, to achieve this, it is crucial to secure the necessary funds to cover the costs of the work.
- 19. Do you have any other project targets in numbers (metrics)? (optional)
No
| Main Open Metrics | Description | Target |
|---|---|---|
| N/A | N/A | N/A |
| N/A | N/A | N/A |
| N/A | N/A | N/A |
| N/A | N/A | N/A |
| N/A | N/A | N/A |
- 20. What tools would you use to measure each metrics? Please refer to the guide for a list of tools. You can also write that you are not sure and need support. (required)
When decoding Sanskrit manuscripts, it’s important to focus on the effort and time required to complete the task, rather than simply counting the pages or paragraphs. Here’s how I would explain measuring the effort and time for each metric:
Character Recognition Time: Identifying each character in a Sanskrit manuscript is not always straightforward, especially given the complexity of the script. Rather than using time-tracking tools, we should focus on the mental time and effort spent on recognizing each character. This can be measured by evaluating how long it takes to decode a character and what cognitive load it requires. The time taken isn’t just a number—it reflects the cognitive effort involved in identifying and understanding the script.
Line/Paragraph Decoding Time: Similarly, decoding an entire line or paragraph requires more than just counting time. It's about assessing how long it takes, considering the mental effort and focus required to understand and transcribe the content accurately. So, while we won’t use a time tracker, we should still measure how much effort is put into decoding each line/paragraph based on its complexity. The challenge is to gauge the overall time and mental exertion to decode that segment.
Effort or Cognitive Load: Decoding a manuscript involves significant cognitive effort—this is the mental strain that one experiences while interpreting and transcribing the manuscript. This will help us measure how taxing the task feels to the person doing the decoding, highlighting how much effort is involved.
Contextual Accuracy: Ensuring the decoded text is contextually accurate is also essential. To check this, we’ll focus on the accuracy of the decoding process and the mental resources needed to maintain that accuracy. It's not just about speed but the mental effort required to ensure the text is decoded faithfully to its original meaning.
Decoding Accuracy: For assessing accuracy, we should focus on the error rate—but again, this is linked to the effort involved. If there are errors, they may be a result of mental fatigue or a lack of focus due to the complexity of the text. This can be tracked by reviewing the decoded text for mistakes, but it should also be seen in the context of how much effort went into each decoding task.
Budget
[edit]- 21. Please upload your budget for this proposal or indicate the link to it. (required)
- 22. What is the amount you are requesting for this proposal? Please provide the amount in Indian Rupees. (required)
270600 INR
- 22.1. Convert the amount requested into USD using the Oanda converter. This is done to help you assess the USD equivalent of the requested amount. Your request should be between 500 - 5,000 USD. (required)
3300 USD
- By submitting this proposal request you agree with the Institutional Partner Privacy Policy, Application Privacy Statements, WMF Friendly Space Policy and Universal Code of Conduct.
Yes
This is an automatically generated Meta-Wiki page. The page was copied from Fluxx, the web service of Wikimedia Foundation Funds, where the user has submitted their application. Please do not make any changes to this page because all changes will be removed after the next update. Use the discussion page for your feedback. The page was created by CR-FluxxBot.