Grants:Programs/Wikimedia Community Fund/Rapid Fund/WikiTutur 2.0:Expanding Open-Access Lexicographic Materials for Languages in Western Indonesia (ID: 22733637)/Final Report
Application type: Standard application
Part 1: Project and impact
1. Describe the implemented activities and results achieved. Additionally, share which approaches were most effective in supporting you to achieve the results. (required)
WikiTutur 2.0 was carried out in two formats: Lokakarya (workshops) and Kopdar (meetups and edit-a-thons). The workshops took place from October 2024 to April 2024. Meanwhile, the Kopdar sessions were held between November 2024 and April 2025 as follow-up activities to improve recordings and entries, as well as to engage new participants who had not had the chance to attend the workshops.
We conducted in-person workshops in 9 different cities:
- Padang (22 October 2024), in collaboration with Komunitas Wikimedia Padang, the West Sumatra Language Office, and the Wikimedia Community User Group Malaysia.
- Bandar Lampung (17 November 2024), in collaboration with Komunitas Wikimedia Bandar Lampung and Sekubal Unila.
- Malang (23 November 2024), in collaboration with KlubWiki Universitas Brawijaya, Faculty of Cultural Studies at Universitas Brawijaya, and Perdukunan Tengger.
- Palembang (23 November 2024), in collaboration with Komunitas Wikimedia Palembang and Dewan Pembinaan Adat Kota Palembang.
- Marabahan (1 December 2024), in collaboration with Komunitas Wikimedia Banjar and Komunitas Perahu Kata Community.
- Jakarta (23 January 2025), in collaboration with Lembaga Kebudayaan Betawi.
- Tanjungpinang (15 February 2025), in collaboration with Komunitas Kalamelayu and the Wikimedia Community User Group Malaysia.
- Pontianak (15 March 2025), in collaboration with Komunitas Wikimedia Pontianak, the Communication Science Student Association (HIMAKOM) of Universitas Tanjungpura, and the Wikimedia Community User Group Malaysia.
- Jambi (27 April 2025), in collaboration with Wikimedia volunteers in Jambi, Forum Penggiat Literasi Jambi (FPLJ), and the Wikimedia Community User Group Malaysia.
The workshops began with presentations introducing the Indonesian Wiktionary, Lingua Libre, Wikimedia Commons, and WikiTutur. Participants then collaboratively compiled a list of vocabulary in the targeted language, followed by guided recording sessions using the Lingua Libre tool with native speakers. At the end of the event, participants were trained to write and edit the Indonesian Wiktionary entries by embedding the Lingua Libre audio files into the lemma pages.
WikiTutur 2.0 Kopdar (Meetups and edit-a-thons) were held five times with the Jakarta community:
- 16 November 2024
- 22 December 2024
- 20 April 2025
- 29 April 2025
- 30 April 2025 (Hybrid)
The success of WikiTutur 2.0 came from our collaboration—Wikimedia Jakarta Community together with various local Wikimedia communities across Indonesia. Local communities played an essential role as organizers in preparing venues and event logistics, and in serving as bridges to reach participants. We also collaborated with language-related communities, organizations, and institutions outside of Wikimedia, who supported vocabulary collection and served as native speakers—an essential element in language preservation.
In addition, our collaboration with the Wikimedia Community User Group Malaysia has been invaluable. They contributed as workshop trainers and shared their experiences in developing WikiKata, a similar project in Malaysia. This mutual exchange has allowed us to support one another in expanding the Wiktionary project.
2. Documentation of your impact. Please use space below to share links that help tell your story, impact, and evaluation. (required)
Share links to:
- Project page on Meta-Wiki or any other Wikimedia project
- Dashboards and tools that you used to track contributions
- Some photos or videos from your event. Remember to share access.
You can also share links to:
- Important social media posts
- Surveys and their results
- Infographics and sound files
- Examples of content edited on Wikimedia projects
- The individual report pages for each events can be found on the Meta-Wiki page (Link here).
- Outreach Dashboard for the whole campaign (Link here)
- Photos from the events (Link here)
Additionally, share the materials and resources that you used in the implementation of your project. (required)
For example:
- Training materials and guides
- Presentations and slides
- Work processes and plans
- Any other materials your team has created or adapted and can be shared with others
- Lokakarya slide decks (Link here)
- A video introduction for WikiTutur (Link here)
3. To what extent do you agree with the following statements regarding the work carried out with this Rapid Fund? You can choose “not applicable” if your work does not relate to these goals. Required. Select one option per question. (required)
| A. Bring in participants from underrepresented groups | Strongly agree |
| B. Create a more inclusive and connected culture in our community | Strongly agree |
| C. Develop content about underrepresented topics/groups | Strongly agree |
| D. Develop content from underrepresented perspectives | Strongly agree |
| E. Encourage the retention of editors | Strongly agree |
| F. Encourage the retention of organizers | Strongly agree |
| G. Increased participants' feelings of belonging and connection to the movement | Strongly agree |
| F. Other (optional) |
Part 2: Learning
4. In your application, you outlined some learning questions. What did you learn from these learning questions when you implemented your project? How do you hope to use this learnings in the future? You can recall these learning questions below. (required)
You can recall these learning questions below:
- What is the percentage increase in the number of Lingua Libre audio files for Western Indonesian languages in Wikimedia Commons after the second edition of WikiTutur?
- What is the percentage increase in the number of lexical entries for the languages of Western Indonesia in Indonesian Wiktionary after the second edition of WikiTutur?
- What are the challenges in compiling lexicographic materials from the target languages for Lingua Libre recording and Wiktionary entries?
- How does the quality of Wiktionary entries in the second edition of WikiTutur compare to those from the first edition, particularly in terms of depth and complexity?
The program succeeded in recording 16 language varieties, or 13 languages by ISO 639-3 codes. Five of the languages (by codes) have no prior recordings in Wikimedia Commons (Tengger [tes], Lampung Nyo [abl], Keninjal [knl], Kerinci [kvr], Kanayatn [knx]), while the rest saw a significant increase in number of recordings. Out of the 10,525 recordings available in Commons for these languages, 5,411 were recorded during the program, thus amounting to a 105.8% of increase compared to the number of recordings without WikiTutur 2.0. For Indonesian Wiktionary, the increase was less significant, with 1,505 new entries (out of a total 11,627 entries) added during the program, amounting to a 14.87% increase compared to the number of entries without WikiTutur 2.0.
As stated in the original proposal, this phase aimed for a more detailed and in-depth description of the languages in the entries compared to the first phase. To that point, we attempted to compile not only simple list of basic words but also alternative forms, synonyms, derived terms, and usage examples. This, however, proved to be rather time-consuming, as we needed to confirm everything with the consultants first before moving to recording sessions. As a result, most of the times we only managed to conduct data collection and recordings with little-to-no Wiktionary edit-a-thons. On the other hand, the data we compiled were detailed and comprehensive enough for the Jakarta community to conduct separate edit-a-thons to input them into Wiktionary without the help of the original consultants.
The entries inputted into Indonesian Wiktionary vary in terms of quality, but in general, each entry consists of pronunciation, definitions, and occasionally synonyms, usage examples, and derivations. Compared to the first phase where the focus was to get as many lexical items as possible, the entries in the second phase generally have more subsections and more structured, though not necessarily more content. For example, the new entry for Betawi beludik, besides having definitions and pronunciation, also has sections on etymology and semantic relation, though it has no usage examples. Originally, we also planned to include audio recordings for all usage examples, though this was quickly abandoned as it took far more time to collect usage examples for each lexical items and record them. Some of the earlier entries during this phase do include recording of usage examples though, as seen in the entry for Minangkabau abang.
One thing to note is that unlike the first phase where there were several online edit-a-thon sessions with varying degree of success, events in the second phase were all conducted onsite, and all consistently show a higher number of results (541.1 recording per events) compared to the onsite events of the first phase (414.6 recordings per events). Overall, despite the challenges in implementing our experimental approaches, the results were modestly positive and may inform further attempts at language documentation involving Wikimedia projects.
5. Did anything unexpected or surprising happen when implementing your activities? This can include both positive and negative situations. What did you learn from those experiences? (required)
In several events, many of the participants were not able to access internet with their own devices, whether due to poor connection or other technical issues. In those cases, we mitigated it by assigning groups for the data compilation and recording sessions, in which the consultants were recorded using the same devices (and often the same few Wikimedia accounts, though with different Speaker IDs on Lingua Libre).
On a more positive note, we found that dividing the events into separate data collection and recoding sessions for each consultants might be more effective (even if time-consuming) than doing them simultaneously (as was the case for the first phase). Compared to recordings from the first phase, the ones from the second phase are generally clearer with no overlapping sounds; cf. Palembang [File:LL-Q12497929_(mis)-dwi_(Dwvtaa)-akar.wav akar] from the first phase vs. Betawi akar and Malay akar (1, 2) from the second phase.
6. What is your plan to share your project learnings and results with other community members? If you have already done it, describe how. (required)
As of May 2025, Indonesian Wikimedians focusing on language description and documentation have initiated an informal grouping under the name "Komunitas Wiktionary Indonesia" to further develop various Wiktionary editions in the languages of Indonesia. Since last year, the WikiTutur team have continuously updated the project's results with the group, which consists of Wiktionary editors from various region-based and language-based Wikimedia communities in indonesia.
In addition, we have also shared preliminary learning and results from the second phase of the project in an ESEAP Community Call on 4 May 2025. We are open to sharing the results with any other communities interested in conducting a similar language documentation project via Wiktionary and Lingua Libre.
Part 3: Metrics
7. Wikimedia Metrics results. (required)
In your application, you set some Wikimedia targets in numbers (Wikimedia metrics). In this section, you will describe the achieved results and provide links to the tools used.
| Target | Results | Comments and tools used | |
|---|---|---|---|
| Number of participants | 96 | 101 | |
| Number of editors | 80 | 123 | From the Outreach Dashboard for WikiTutur 2.0 |
| Number of organizers | 16 | 22 | Counting unique instances of trainers and local trainer assistants in all events |
| Wikimedia project | Target | Result - Number of created pages | Result - Number of improved pages |
|---|---|---|---|
| Wikipedia | |||
| Wikimedia Commons | 4000 | 5411 | 0 |
| Wikidata | |||
| Wiktionary | 1500 | 1505 | 0 |
| Wikisource | |||
| Wikimedia Incubator | |||
| Translatewiki | |||
| MediaWiki | |||
| Wikiquote | |||
| Wikivoyage | |||
| Wikibooks | |||
| Wikiversity | |||
| Wikinews | |||
| Wikispecies | |||
| Wikifunctions or Abstract Wikipedia |
8. Other Metrics results.
In your proposal, you could also set Other Metrics targets. Please describe the achieved results and provide links to the tools used if you set Other Metrics in your application.
| Other Metrics name | Metrics Description | Target | Result | Tools and comments |
|---|---|---|---|---|
9. Did you have any difficulties collecting data to measure your results? (required)
No
9.1. Please state what difficulties you had. How do you hope to overcome these challenges in the future? Do you have any recommendations for the Foundation to support you in addressing these challenges? (required)
Part 4: Financial reporting
[edit]10. Please state the total amount spent in your local currency. (required)
74075528
11. Please state the total amount spent in US dollars. (required)
4796.22
12. Report the funds spent in the currency of your fund. (required)
Provide the link to the financial report https://docs.google.com/spreadsheets/d/1LZn02hlHWtWBbD5EhAHRmAIKige3POMu-c_qtbv5ouo/edit?usp=drivesdk
12.2. If you have not already done so in your financial spending report, please provide information on changes in the budget in relation to your original proposal. (optional)
13. Do you have any unspent funds from the Fund?
Yes
13.1. Please list the amount and currency you did not use and explain why.
IDR 3,392,768.2 (include IDR 252,654 from the past WikiTutur)
13.2. What are you planning to do with the underspent funds?
B. Propose to use them to partially or fully fund a new/future request with PO approval
13.3. Please provide details of hope to spend these funds.
We will use it for the next grant request (WikiTutur 3.0; the complete program name and the link to the application will be provided soon)
14.1. Are you in compliance with the terms outlined in the fund agreement?
Yes
14.2. Are you in compliance with all applicable laws and regulations as outlined in the grant agreement?
Yes
14.3. Are you in compliance with provisions of the United States Internal Revenue Code (“Code”), and with relevant tax laws and regulations restricting the use of the Funds as outlined in the grant agreement? In summary, this is to confirm that the funds were used in alignment with the WMF mission and for charitable/nonprofit/educational purposes.
Yes
15. If you have additional recommendations or reflections that don’t fit into the above sections, please write them here. (optional)
Review notes
[edit]Review notes from Program Officer:
N/A
Applicant's response to the review feedback.
N/A