Grants:IEG/Enhance Proofreading for Dutch
What is the problem you're trying to solve?
Grammar as well as spelling errors are quite common in text, in Wikipedia texts too. LanguageTool can help a lot, but current rules are affected by the Wikipedia mark-up and spellchecking is off.
What is your solution?
I would adjust LT rules in a way that mark-up is no longer falsely reported. I would also add a faster spell checker containing millions of correct Dutch words and most proper names that are currently in Wikipedia.
Improve readability and of Wikipedia articles and thus a wider public.
I will use collected words and word frequencies and word correctness info into a spelling dictionary, better and faster than the current one, which is too slow to switch on. I will add and edit LanguageTool rules, specifically to reduce false alarms.
This will be time- and money-boxed at 1 day a week, for half a year, i.e. 2 days * 20 weeks * 8 hours * € 50
Modifications will be implemented gradually, thus resulting in feedback from daily use.
Additions will be part of regular LT distributions and as such part if its life cycle.
Measures of success
- No more false positives by Wikipedia mark-up - Spell checking enabled on Dutch Wikipedia - all correct Wikipedia proper names accepted by the spell checker
TaalTik - actually R.J. Baars, has created the Dutch LanguageTool implementation from 2006 and was biggest contributor to the Dutch spelling checker for Mozilla, Ooo, LibreOffice based on Hunspell.
Please paste links below to where relevant communities have been notified of your proposal, and to any other relevant community discussions. Need notification tips?
Do you think this project should be selected for an Individual Engagement Grant? Please add your name and rationale for endorsing this project below! (Other constructive feedback is welcome on the discussion page).