Grants:IEG/Pronunciation Recording (Finish incomplete GSoC project)

From Meta, a Wikimedia project coordination wiki
Jump to: navigation, search

Individual Engagement Grants This project is funded by an Individual Engagement Grant

proposal people timeline & progress finances midpoint report


status: selected

project:

Pronunciation Recording (Finish incomplete GSoC project)


project contact:

E-Mail to Rillke; Project communication page

participants:



grantees:

volunteers:


summary:

Provide PronunciationRecording as a gadget.

engagement target:

Wiktionary

strategic priority:

Improving Quality, Infrastructure

total amount requested:

EUR 1450


2014 round 1

Imagine you could create, upload and embed pronunciation samples within just a few seconds. This is the aim of the Pronunciation Recording Gadget.

Project idea[edit]

Video: English, German

If you have trouble watching this video on Wikimedia sites, there is a copy on Vimeo.

What is the problem you're trying to solve?[edit]

Hey, have you ever had a look into the Wiktionary? Did you ever came across a word like Mòcheno? And you possibly knew how to pronounce that correctly? Or you were just wondering? Let me tell you, it is a laborious if not even tedious process, recording a pronunciation and uploading it: First, one needs recording software, then one has to record, save in correct file format and upload it to Wikimedia Commons, including preparation of the file description page, and finally inclusion into the Wiktionary entry is required. What would you say if you could do all this in your browser without having to care for uploading, file description preparation, recording software and how this plays together?

What is your solution?[edit]

The idea of extension "Pronunciation Recording" was to provide a web-interface written in JavaScript and utilizing the browser's audio API allowing users to record the pronunciation of a word which is quite useful for Wikitionary.

Unfortunately this project was, like a lot of GSoC-Projects, not completed. Since IEG does not support extension-work, I would like to fork the code and remove the Upload Wizard-dependency because only FormData is required for uploading in modern browsers that support the audio API and none of the user-interface-elements of Upload Wizard are required. In the end, a wizard-guided-process for adding good pronunciations, driven by JavaScript will be the solution, that I am going to offer.

Workflow (Draft)[edit]

there is also material on mw:User:Rahul21/Gsoc2013/Proposal

  1. Contributor, who is willing to provide a pronunciation sample visits Wiktionary entry
  2. Pronunciation Recording Gadget (PRG) checks whether visitor is logged-in and the relevant APIs are implemented in the visitor's browser
  3. Evaluation of page content: Pronunciation samples for all IPAs? Is there is a template like {{rfap}} in the Wiktionary entry?
  4. PRG adds a button (or icon) for starting the process (location dependent on Wiktionary's preferences)
  5. When clicked, PRG starts a wizard-guided-process (no specific morphology draft available, yet) is started:
    1. Learn: If user preferences are unknown to the tool, a learn page will be displayed first, telling the contributor a few words about the flow of this tool, best practices for recording and that it is going to request access to the microphone when proceeding (to check everything is installed correctly, we'll request access to your microphone ...)
    2. Access and Preferences: If the contributor decided to proceed, the wizard will be greyed out, request access to the user's microphone, and if successful, continue to the next step, asking for preferences including a section with a license-picker, as well as a section asking how the user would like to be attributed. When the user decides to proceed, these preferences will be saved into the Wiktionary user account. These two steps will be skipped in future when invoked again by the contributor. User accounts must not be shared so it's okay to save the license selection.
    3. Recording: Now, a recording toolbar is shown with the word and IPA to speak. It allows recording of multiple samples (listing them for playback) so, in the end, the contributor can decide which one to submit. If it should turn out that simple audio visualization (graph) and manipulation tools (cutting, volume-level) are required they could be also implemented.
    4. Once the contributor has chosen the best sample, it is uploaded to Wikimedia Commons without having to leave the page. Upload progress will be shown.
    5. As soon as the upload to completed, the Wiktionary entry is edited automatically, adding the audio sample. When successful, a thank you or success-message is shown and the page is either re-loaded or the sound file is dynamically inserted at the correct position in the current page-rendering, or the fresh page text is retrieved via API (not sure whether the media player respects the mw.hook for new content, yet).

Project goals[edit]

To enable Wiktionary users as fast as possible to use the tool, I intend forking the extension, adding the missing features and providing it as a RL-module at Wikimedia Commons together with instructions how to load it from other wikis. Plan B is, if Commons Community believes that there are license-incompatibilities, to host it on Toollabs and if this turns out being too unreliable, plan C is an own labs instance.


Part 2: The Project Plan[edit]

Project plan[edit]

Scope:[edit]

Scope and activities[edit]

We will spend most of the time coding and gathering feedback from the Wiktionary community.

Timeline[edit]
Date Event
03-31 Deadline for committing proposals
04-20 End of community review
05-05 End of committee review
05-30 Grantees announced
06-01 Setting up GitHub or Gitorious repo
06-01 Setting up Project and Feedback page(s)
06-02 Creation of the code necessary to implement Pronunciation Recording
06-15 Integration of the code necessary to upload
07-01 Installation on Commons (I am admin there and this gadget is clearly in scope of a multimedia repo)
07-02 Testing on Commons
07-10 Gathering first feedback by a small number of users
07-10 Using that feedback for troubleshooting
07-25 Next feedback cycle, this time with more testers
08-05 Using feedback to improve code base
08-05 Using feedback to write guide about best practices for best results ("learn step" of the wizard)
08-08 Wikimania presentation
08-15 Starting midterm report
09-01 Midterm report
09-01 Designing Wiktionary integration and workflow
09-30 Coding and providing installation snippet for Wiktionary
10-01 Creation of written documentation
10-05 Creation of a tutorial video
10-25 Notifying Wiktionary administrators about the availability and how they can install this gadget locally without having to send their users to Commons
10-25 Measuring use: If <80% of users willing to record pronunciation are not able due to missing audio-APIs and the number could be raised by >15% by using Flash, consider Flash-Shim plan.
11-15 Gathering feedback
11-15 Using feedback to improve code base
12-15 Writing final report
12-31 Final report
Pronunciation Recording Gadget will build on top of jQuery, which is included in all standard MediaWiki installations.
The HTML5 audio API will be used as well as canvas or SVG drawing, XHR2 and FormData

Tools, technologies, and techniques[edit]

During software development, there are multiple feedback cycles scheduled allowing Wikitionary community members to actively take the product into the right direction ensuring the software we produce fits their needs. Most funded time will be spent by coding JavaScript, bringing the browser's audio API, FormData for file upload and canvas or SVG elements for audio visualization into play. The ultimate goal is the creation of a gadget. Rillke owns a tool labs account and is capable using SSH tunnels, Databases and SFTP, therefore the gadget could be alternatively hosted there. Usage metrics will be sent to, saved and evaluated at tool labs respecting the privacy policy, of course. There are no intentions paying for external usability testers as long as we'll get sufficient useful feedback by the Wiktionary community.

Browser compatibility[edit]

Compatibility table According to caniuse.com

Newer versions of:

Mozilla Firefox 3.5 logo 256.png
Mozilla Firefox
Google Chrome icon (2011).png
Google Chrome and Chromium
Opera O.svg
Opera

Safari can be expected supporting getUserMedia (the required API) in one of the next versions as Apple hasn't announced the contrary. Internet Explorer will most likely not support the API used because Microsoft declared that they would like to implement something differently. If a huge part of users willing to add pronunciation recording cannot due to missing browser support, we consider authoring/adding a Flashplayer-Shim but we'll first try without that as it means a lot of extra-efforts and promoting technologies that are not favourable. Rillke going to ask at Microsoft STC if he's able to, how they suggest to implement audio recording in Internet Explorer.

There is a test page (not created, nor maintained by grantee) for HTML5 audio API.

Budget:[edit]

Total amount requested[edit]

EUR 1450


Budget breakdown[edit]

Grantee Task / Expenses for / Role Requested Total requested
Rillke Project management, Branding and logo design - EUR 850
Food, drink and a nice, calm place for efficient coding for 4 months 5 hrs/week EUR 450
Audio-editing and visualization tools (38 h) EUR 200 (possibly not required; depends)
Setting up the development environment and exactly complying with MediaWiki's coding conventions, Drafting code specifications EUR 100 (optional; the IEG committee may decide not to fund this module)
Align to Extension work: Creating automated tests that could be run by Jenkins and providing code allowing the progress made here to be more easily integrated into the extension EUR 100 (optional; the IEG committee may decide not to fund this module)
Ungoliant MMDCCLXIV Quality assurance: Time providing feedback, running automated and manual testing, gathering community feedback, usability testing EUR 200 (about $ 274.50) EUR 200
Infovarius Old browser testing, Wiktionary and Wikidata community feedback, bot assistance EUR 200 (about $ 274.50) EUR 200
Volunteers, Contest winners Wikimedia merchandise for volunteers (WMF shop): Providing volunteers helping organizing community engagement (making the tool known, organizing a pronunciation rally, including authoring banners, awards, advertising, etc.) with small gifts and providing them with merchandize items for contest winners (will be requested on demand); Please consider the number of wiktionaries when evaluating the amount requested here max. EUR 200 (about $ 274.50) EUR 200
TOTAL: EUR 1450

Intended impact:[edit]

Target audience[edit]

Wiktionary users, Commons maintainers[edit]
Background icon iOS style.svg
Commons-logo.svg
Foreground icon iOS.svg
Background icon iOS style.svg
Wiktionary book logo.png
Foreground icon iOS.svg
Background icon iOS style.svg
Wiktionary-logo-en.svg
Foreground icon iOS.svg
Background icon iOS style.svg
Wiktionary-logo.svg
Foreground icon iOS.svg

By easing the creation, uploading and inclusion process, there are substantial benefits for Wiktionary users creating pronunciation samples. Visitors of Wiktionary, especially non-native speakers will hopefully get more audio samples and Wikimedia Commons community members, getting perfect file description pages profit from the development of this gadget. The open source world and the MediaWiki extension development, depending on the IEG-FDC's decision, may get positive impulses from the gadget-development.

Pronunciation samples, often as audio samples, are frequently requested at Wiktionary: en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary

Some users feel IPA transcriptions are not useful: en.wiktionary, en.wiktionary, en.wiktionary, en.wiktionary

FLAC encoder required, otherwise transfer volume of uncompressed audio is too huge
Wikipedia users[edit]

Without a FLAC or Opus encoder, it's not suitable for recording full Wikipedia articles, if the resulting audio would be longer than 5 minutes. If you desire creating me a JavaScript FLAC/Opus encoder, please comment below. It will be a huge pile of work (approx. 5 months with 10-20 hrs/week of coding). If you found a JavaScript FLAC encoder, just point me to the direction and things will be a lot easier. I know there is speex.js but it appears to cut the input at the beginning and the quality is not that great.

There are interested parties in getting Opus encoder support and the W3C made a draft allowing clients to request audio samples in a format of their desire and Firefox on Windows even records to this format by default when using its MediaRecorder API. Though, this limits the ability to edit the audio sample as it would have to be done prior to encoding. This could be probably achieved creating a MediaStream from the recorded sample data or an AudioBuffer in an OfflineAudioContext from the WAVE in-memory-file. Both have the disadvantage that they are rendered in real time (thus the user has to wait for the complete recorded sample to be played).

Community engagement[edit]

First, I am going to send a message to the Grease pit and the equivalent discussion forums in other language Wiktionaries. I am also going to contact people who regularly upload pronunciations to Commons directly (either via IRC or onWiki). Looking for one or two Wiktionary Community members interested in working as consultants and testers supporting the creation of an awesome tool that really fits Wiktionary's needs. Depending on good-will by volunteers is too risky for me but of course is also welcome. Just add your name at the volunteers: section, if you're willing to give me a hand. A project page and a feedback page will be created at Commons; Pronunciation Recording will offer a link to both of them. Shortly before completing Phase I and Phase II (c.f. #Measures of success), I am going ask for first feedback, either through the consultants or directly.

Fit with strategy[edit]

Offering this easy-to-use tool will enrich the Wiktionary's editing tools enhancing efficiency and therefore possibly increase participation and it will also improve Wiktionary's quality for the visitors if more spoken samples are available.


Code will be open source. Audio editing tools might be useful for other projects, beyond the bounds of the WMF as well.

Sustainability[edit]

The increasing browser support on the audio API in the near future will lead to more users who are able to make use of the tool.

I will optionally follow the MediaWiki JavaScript coding conventions here so some of the code that turns out to be useful can be backported to the extension. The code will be packed into modules, so, no matter what some upload, recording and UI code, as well as icons will be created that might be useful for other projects as well.

Measures of success[edit]

Let's be pragmatic here rather than writing a philosophical essay doing great analysis: If the small number of community members I am going to contact is happy with my implementation and will make use of the tool, I am also, leaving aside the question whether technical tools encourage participation or whether it is the climate in the community.

Some "hard" and measurable factors:

Phase I[edit]
  • Creation of the code necessary to implement Pronunciation Recording
  • Integration of the code necessary to upload
  • Installation on Commons (I am admin there and this is clearly in scope)
  • Testing on Commons (I'll make sure to keep distraction of my colleagues at Commons as low as possible)
  • Gathering feedback
  • Using feedback to improve code base
  • Using feedback to write guide about best practices for best results ("learn step" of the wizard)
  • Writing midterm report - the goal for midterm is not a high number of products (audio files) but suitable feedback and a working prototype running on Commons
Phase II[edit]
  • Wiktionary integration and workflow
  • Providing installation snippet for Wiktionary
  • Creation of written documentation
  • Creation of a tutorial video
  • Notifying Wiktionary administrators about the availability and how they can install this gadget locally without having to send their users to Commons
  • Measuring use
    • If <80% of users willing to record pronunciation are not able due to missing audio-APIs and the number could be raised by >15% by using Flash, consider Flash-Shim plan.
  • Gathering feedback
  • Using feedback to improve code base
  • Writing final report
How many[edit]

… but we're wondering how many users you expect to use the tool to upload sound files at the end of 6 months

First of all, I think purely focusing on user numbers without distinguishing which kind of users the PRG attracted (power users who e.g. recommend PRG to their mates, integrate it into Wiktionary's workflow; or users who only rarely contribute something; users who pay a lot of attention to do it correctly; users who are more sloppy about their actions) is not a suitable marker for success.

But it is a good start to develop a strategy for getting a clue about the impact. First we have to know:

  1. How many active users are there in Wiktionaries? And how does the edit-count-distribution look like. [graphs and numbers to be added]
  2. How many Wiktionary users have a Microphone or other suitable recording device at their service? [needs survey]
  3. How many of them are inclined recording pronunciation? [needs survey]
  4. How many of #3 will use PRG?
In numbers

We agreed in IRC that we are going to plot the number of pronunciation uploads and the number of users producing them versus time. Global usage is another interesting indicator we might want to measure.

Measuring use
  • As outlined by Infovarius, this can be done by adding a template the file description page and then querying templatelinks.
  • Or, and this allows more accurate tracking, a service running on tool-labs to which additional information like account age, user groups and edit count is submitted.
Wiktionary statistics

Data were taken from Labs using the following query:

SELECT CONVERT(rc_user_text USING utf8) AS user_name, COUNT(*) AS edit_count
FROM frwiktionary_p.recentchanges
WHERE (NOT rc_user_text LIKE '%Bot%') AND (NOT rc_user_text LIKE '%bot%') AND (rc_bot = 0)
GROUP BY rc_user ORDER BY edit_count DESC;

English language Wiktionary had 78208, Russian language Wiktionary 16523 and Portuguese language Wiktionary 3885 changes by non-bot users in 30 days.

Recent changes on selected wiktionaries in 30 d.svg


Distribution of active users by edit count in the last 30 days on selected wiktionaries.svg

Further statistics at stats.wikimedia.org.

Participant(s)[edit]

  • Rillke, as a community administrator at Wikimedia Commons, mainly caring about technical aspects, has gathered lot of experience in JavaScript coding over the last 2 years that lead to the development of several tools for both, less experienced ([1]) and more experienced ([2], [3]) users including upload implementations. Rillke furthermore contributed several small fixes to, for example Upload Wizard.
  • Ungoliant MMDCCLXIV. English Wiktionary member since January 2011 and administrator since August 2012. Made over 30000 contributions in that project. Has some experience with programming but little with JavaScript.
  • Infovarius. Russian Wiktionary member and administrator. Made over 40000 personal and 400000 automated edits in that particular project. Runs self-programmed bot for arbitrary tasks. Also planning to provide compatibility with Wikidata.

Good reasons to fund this proposal[edit]

  • The desired result is very specific. The risk that it fails for the reason of a fuzzy or abstract goal is therefore low. It is something that is going to be used directly by end users.
  • The result is desired, otherwise no extension work would have been started by the WMF.
  • The proposer is a trusted community member at Wikimedia Commons, experienced in coding JavaScript with active community involvement.
  • Extension work is stale (c.f. #Withdrawal 2013 and re-application 2014) and the gathered feedback and the solutions to address concerns are could be valuable to the extension development.

Discussion[edit]

Community Notification:[edit]

Please paste a link to where the relevant communities have been notified of this proposal, and to any other relevant community discussions, here.

First notification cycle
Second notification cycle
Messages sent by Staff

Withdrawal 2013 and re-application 2014[edit]

Due to an unforeseeable lack of time, I was unable to talk to the community in time. I therefore withdraw this proposal. Let's see how the work on the WMF-side is progressing in the next half year. And if it isn't I am probably going to re-launch this proposal. In the meanwhie I have more time familarizing with Wiktionary. Thanks for all the comments at the talk page. -- Rillke (talk) 17:58, 22 October 2013 (UTC)

Ok, Rillke - sorry to see you withdrawing this time, hope you'll resubmit in a future round! Siko (WMF) (talk) 16:17, 28 October 2013 (UTC)
There was zero progress after I've withdrawn the request, only some automated substitions and i18n updates: Commits to the extension: Last real codebase update was on 2013-09-23. Thus, there is still need for an actually working solution, Wiktionary users could benefit from. -- Rillke (talk) 08:13, 6 March 2014 (UTC)

Suggestions by commenters and feature requests[edit]

similar to brainwriting

  • secure a minimum standard of quality
  • Audio-processing like automated volume adjustment, de-noise, Adaptive Leveler (?)
  • with the possibility of eg noise reduction, normalization ( so that the audio files maintains a steady volume ) and so you can get visual feedback on how the recording was successful
  • Allow drag & drop a prepared sample
  • remember metadata about the speaker (e.g. their regional accent) to save re-entering it
  • mobile browsers
  • mobile apps
  • before suggesting to record a new file, it should try to find an existing file that would do the job
  • unless we include an encoder (which might or might not be doable with Emscripten)
  • mass recording: Just showing words you record it and proceed to the next word
  • proper categorization (including sort key)
  • sample sentences and how they are named and categorized
  • entire sentences or phrases should be possible
  • format to have proper links to the Commons file page, not just a link to the file itself
  • suggest standard formats for language prefixes, like the "en-" for English or "de-" for German
  • should be working from outside Wiktionaries as well -- Commons, Wikidata, Wikipedia all have demands, too
  • choose the target wiki for uploads as per an installation option of the wiki, Commons being he default
  • quickly redoing and then being able to choose is imho the best approach to a 1st step of quality assurance of recordings
  • oscillogram and a spectogram shown per recording - Praat is open source but written in c++
  • interface for inserting dialect information [where do you come from, where did you grow up, where did you learn the language you are talking in] and storing these meta data somewhere
  • DeEsser
  • Aussprache von Personennamen und Orten in der Wikipedia -> Wikidata eintragen
  • Namen des Mikrofons speichern

Audio Quality Standards and File Upload[edit]

First of all: I think this tool has much potential and could reduce the amount of work necessary to create pronunciation recordings to a minimum. It's a great idea! I created a few recordings for de.wiktionary.org and at first I had a few problems figuring out which steps are necessary to get a proper recording into Wiktionary. I am not familiar with programming and things like that but I have two notes regarding this tool:

1. To ensure a standard in audio quality it would be great to connect this tool to Auphonic.com. This austrian company runs a few algorithms on audio files and is able to improve the quality of a file. It's maybe even more important that they standardize the volume and meta data of a recording. They offer an API but I'm not sure if this tool and Auphonic could be working together properly.
2. In my case the pronunciation recordings are a byproduct of a podcast I produce. When I upload my recordings to commons, they are already done. It would be nice to use those finished recordings in the same way I could create a new one with this tool. My recommendation is to add a context menu for uploading existing files in the recording section. That would still improve my workflow a lot, because I would not have to upload my files to commons and add the recording at de.wiktionary.org separately. Also, I could use my favorite DAW to alter the audio like I want to. I know this tool is meant for more unexperienced users, but it could make the lifes of more sophisticated users easier, too.

Thanks for your work and time! --LarsvonSpeck (talk) 14:31, 15 March 2014 (UTC)

Hi LarsvonSpeck,
First, thanks for the feedback and these great suggestions.
1. If auphonic.com is inclined to provide us with free API access for at least the next 5 years, I will be inclined to consider building on top of their services. I am going to ask them. As this would involve communication to external servers, this would be an extra-button or setting a user has to explicitly opt-in and agree to their terms of service. They would also have to agree to keep the same standards as the WMF has in regard to data protection etc.
In case this fails, I can imagine a custom implementation of the Adaptive Leveler without having to rely on third party services. De-noise seems to be more complecated.
We'll mention some tricks to ensure a minimum of quality in the first step. I imagine that these tricks will be fetched from a page maintained by the community, enabling Wikitionary contributors to adjust their demand for a specific level of quality. If that page does not exist in a wiktionary, a default will be displayed.
2. Yes, this sounds like a great idea. This will be implemented, if the proposal is selected. Maybe not as context menu but there will be a way to insert recordings from the local file system. E.g. with Drag&drop into the recording-window. Dependent on the input format and browser, playback might not be available for these kind of samples before they're uploaded. -- Rillke (talk) 14:52, 15 March 2014 (UTC)
  • Currentlyy, I am using Shtooka Recorder, semi-automated patching, and uploading via PyWikiBot - working but complicated to set up. Good to have a better approach.
  • Quickly redoing and then being able to choose is imho the best approach to a 1st step of quality assurance of recordings.
  • Should be working from outside Wiktionaries as well. Commons, Wikidata, Wikipedia all have demands, too. User should be able to choose the target wiki for their uploads as per an installatino option of the wiki, Commons being he default.
  • As a user selectable option, having both an oscillogram and a spectogram shown per recording would be nice. Could be added later as well, since computing them is not trivial. Praat has free open source doing that.
  • Doing entire sentences or phrases should be possible, they are needed.
  • Thanks for your proposal and thanks for coming back this year!

--Purodha Blissenbach (talk) 08:55, 17 April 2014 (UTC)

Hey Purodha, thank you for commenting and all the suggestions. -- Rillke (talk) 12:55, 17 April 2014 (UTC)

Endorsements:[edit]

Do you think this project should be selected for an Individual Engagement Grant? Please add your name and rationale for endorsing this project in the list below. Other feedback, questions or concerns from community members are also highly valued, but please post them on the talk page of this proposal.

  • Community member: add your name and rationale here.
  • I think this would be a very valuable project. In my experience, Wiktionary contributors tend not to be very vocal when it comes to the technical side of things, so I think the silence here should not be seen as a concern. This, that and the other (talk) 05:18, 8 March 2014 (UTC)
  • The main principle of the wiki is that everybody makes his contribution to it. It worked well for the definitions, the proof being that we now have +2.5M pages. However, a lot of pronunciations are still missing to improve the quality of our projects. This is a great idea we have here, and Rilke seems quite motivated to develop it. Let's give him the means to achieve it. -- Quentinv57 (talk) 14:13, 8 March 2014 (UTC)
  • Nothing special to say but just a message to support strongly your work. I wish not become a consultant (mainly due to a lack of time) but I will be glad to test your extension. Pamputt (talk) 12:38, 9 March 2014 (UTC)
  • Based on the demo video, I think this would be a very useful gadget. Wiktionary would benefit from having more pronunciation info. Many people are unfamiliar with pronunciation transcription schemes, but most people are able to speak their native languages aloud without difficulty. A gadget like this would make it easy for them to record themselves speaking and uploading the files. Wiktionary also covers placenames and surnames, and if users uploaded pronunciations of these, Wikipedia would also benefit (en.WP's article on The Hague already includes the same audio file of the city's Dutch name that Wiktionary has, similar audio, though users might initially add it to WP, could then be copied to WP). PS, I agree with This, that and the other's assessment of Wiktionary as rather taciturn. :b -sche (talk) 00:03, 14 March 2014 (UTC)
    I support this as well. Wiktionary has a number of active editors from various backgrounds. I probably won't record a lot in Russian (my native tongue) but I can do it on request and will probably request others to record words in other languages I'd like to have recording for. Many languages lack audio recordings altogether. This could also be used in various linguistic discussions (which lead to decisions about the provided information), currently people can only use IPA or someone else's recordings. --Anatoli (talk) 01:06, 14 March 2014 (UTC)
  • This tool will encourage me to record Armenian pronunciations for all Armenian words I create on Wiktionary—something I don't do now because of the complexity of uploading. --Vahagn Petrosyan (talk) 06:58, 14 March 2014 (UTC)
  • The project sounds good to me. I hope that many users will participate. The realization seems to be very user-friendly. Keep up the good work! Best regards --Yoursmile (talk) 12:59, 15 March 2014 (UTC)
  • This tool would simplify the process of creating pronunciation recordings to a level that's more reasonable for unexperienced users and could - if it's done properly - secure a minimum standard of quality. The time and effort which are necessary to create a recording are too high at the moment. There are too many steps and work that has to be done manually, which could be done by software. This project could solve those problems. --LarsvonSpeck (talk) 13:47, 15 March 2014 (UTC)
  • I would like to add audio files of English words to Wiktionary, but I gave up on learning the recording and uploading process long ago. I would add IPA transcriptions to more words, but I often don't know how to accurately transcribe words in English, despite being a native speaker. I think this new method would be extremely beneficial to language learners. I would love to be able to add recordings of terms that can't be found in any other dictionary. Ultimateria (talk) 20:41, 22 March 2014 (UTC)
  • I'd certainly welcome this. Hopefully it would be able to remember metadata about the speaker (e.g. their regional accent) to save re-entering it. I am curious if this would also work on mobile browsers or if recording on devices would require a separate app? (iOS doesn't even have ogg playback support yet, so I'm guessing not on iOS) —Pengo (talk) 04:41, 25 March 2014 (UTC)
    • Yeah, mobile would be awesome but currently only BlackBerry Browser 10 implements the Web-APIs necessary (so I would even lack test devices) and gadgets cannot be deployed to mobile targets easily; the only way getting community JavaScript executed seems to be MediaWiki:Mobile.js. This development will focus on desktop devices first and I'll, if time permits, provide tools for collecting metrics about users on mobile willing to record pronunciation. The issue is that coding an app, even when using cross-platform-helpers like PhoneGap, for every single device type is becoming quite time-expensive. Nonetheless, I'll give it a try. -- Rillke (talk) 08:29, 25 March 2014 (UTC)
  • This is a great idea. It's only the hassle of uploading audio files that's put me off hitherto; I regularly add IPA. This gadget will probably lead to adding audio pronunciations becoming part of my routine when creating entries. I'm so meta even this acronym (talk) 18:13, 25 March 2014 (UTC)
  • I am confident that Rillke has the skills to succesfully complete this proposal. I also think it would be quite useful. Bawolff (talk) 17:10, 27 March 2014 (UTC)
  • Simply brilliant! I think I've recorded a few hundred sound samples, especially of single words and the like. Going through the normal uploading process is tedious business, so this gadget would such a boon to the project. I would just like to stress that the audio templates that link the files have to be formatted to have proper links to the Commons file page, not just a link to the file itself. This has been a problem I've noticed on English Wikipedia. Also, it would be great if it suggested standard formats for language prefixes, like the "en-" for English or "de-" for German. Keep up the good work! Peter Isotalo (talk) 17:42, 27 March 2014 (UTC)
  • This tool would be a great help. It would simplify the process of creating Hungarian pronunciation recordings for the English Wiktionary. --Panda10 (talk) 21:49, 27 March 2014 (UTC)
  • Rillke has clearly looked at the existing progress, and prepared for this round. His JavaScript knowledge makes him a good candidate to pick this up. Some of the software he's worked on (e.g. commons:MediaWiki:EnhancedStash.js) is in related areas of the API. Superm401 | Talk 09:27, 29 March 2014 (UTC)
  • I think this project could be valuable. I used to record pronunciation and it brings several problems. However this proposal doesnt solve problems, which people, who do mass pronunciation recordings, it comes with a new solution, how to enrich Wiktionary or Wikiversity with audio recordings. I think to have an easy way to make such recording directly on the "word" page would automatically attract more contributers.--Juandev (talk) 16:42, 30 March 2014 (UTC)
  • I tried the existing extension a few months ago, and really want to see more. With the endorsements from the devs above, I have full confidence in this candidate. Quiddity (talk) 23:00, 31 March 2014 (UTC)
  • I think this is an excellent idea, and I know that if this existed I would spend a least a few days just going through recording pronunciations for obscure English words I know. (Along with making a recording basically everytime I look something up on Wiktionary that doesn't have one yet) Zellfaze (talk) 17:51, 2 April 2014 (UTC)
  • I support endorsement of this project. Pronunciation aids for Wikipedia and Wiktionary would be phenomenally helpful to me on a near-daily basis, and above editors have expressed considerable confidence in the coding skills of the applicants such that they should be able to carry out the project capably. Chubbles (talk) 07:25, 7 April 2014 (UTC)
  • Reposted from nl.wiktionary: Seems like a really, really good idea. The words in Dutch here have recently largely been added by someone with admirable stamina and diligence, after a long drought, because it used to be really tedious to upload to media commons. I do have a question about that. At commons files need to be uploaded as nl-something.ogg or so but then alphabetized under 'something' otherwise everthing ends up under N. Is this automatically done? Another thing is: can we upload sentences. We try and give an example sentence for every word here and I have heard from people who want to learn Dutch that that is really useful. It could be made more so if the audio is there too. Again this would require some adaptation at commons, perhaps a different category than just the word category. It would really be nice if you could automate that. Jcwf (talk) 01:26, 8 April 2014 (UTC)
    • Thank you for re-posting here. I guess you are referring to the sort key? This one looks quite easy to implement. Yes, it will be automatically done.
    • Sentences are useful, sure. However, it would not only require a different category-system at Commons but also a different inserting and detection mechanism for the Wiktionary entry. I have to check how feasible this is first and possibly it does not fit in the time schedule set out above. Uploading and recording longer samples, let's say of about 7 seconds should work the same as recording single words of 2 s length. Just the whole procedure around it will differ. In case I do not manage producing sentence recording in round 1/2014, and given that the tool is going to be popular and used, I'll consider it for a renew request. -- Rillke (talk) 08:30, 8 April 2014 (UTC)
  • Endorse What a great project! Jane023 (talk) 20:41, 14 April 2014 (UTC)
  • Endorse --Purodha Blissenbach (talk) 08:55, 17 April 2014 (UTC)
  • Endorse. It's a pity the extension was never finished, but this seems like a great way to take it forward. I'm confident that Riilke has the required skills, and has obviously thought carefully about how to implement this. the wub "?!" 21:21, 19 April 2014 (UTC)
  • It could be a easyest way to obtain pronunciation files. --Dvdgmz (talk) 11:33, 21 April 2014 (UTC)
  • Support ! This is an awesome project. Yug (talk) 11:46, 1 October 2014 (UTC)