Talk:Mass content adding
one aspect in this project that is totally missing
There is one aspect in this project that is totally missing so far: it has to be checked, whether these small wikipedias really want this mass inclusion. Please do not add such content without consensus. HeikoEvermann 07:54, 24 February 2006 (UTC)
- Here is my answer to your question from the mailing list: For small wikipedias we thought about something more meaningful: pseudo-translation of countries, rivers, mountains, islands, movies, actors etc. from English Wikipedia. Yes, I know that concesus should be reached for 100.000 new articles, but if people from small Wikipedias want that, they will localize software. If they don't want, we wouldn't have even stubs in their language. --Millosh 10:04, 24 February 2006 (UTC)
Hi, I have CSV files for:
100 cities_japan_2.csv 93 cities_japan_3.csv 550 cities_japan.csv 96 departments_france.csv 32 libya_provinces.csv 34 luna_programme.csv 12 netherlands_prov.csv 48 prefectures_japan.csv 9 states_austria.csv 26 states_mexico.csv 15 syria_provinces.csv 40 universities_england.csv 22 venezuela.csv 1077 total
They include stuff like population, area, date founded, etc. Example data:
Omaezaki:御前崎市:Омаезаки:36298:65.78:551.81:[[1 апрел]], []:Шизуока:Omaezaki green tea fields.JPG
Creates an article like tg:Омаезаки (this is one of the better ones — they don't always include images). Anyone interested in this data and the various scripts should contact me at tg:User:Francis Tyers. - FrancisTyers 12:20, 10 February 2007 (UTC)
I think it would be more meaningful if we could add articles from List of articles. Also, we can form a database of events and add year articles containing a few events, a few births and a few deaths. While trying to create bot generated articles in abandoned wikipedia like Pali, Bhojpuri, Oriya, Sanskrit, Bod Skad etc. one major difficulty that I faced was transliteration. One has to manually transliterate all the data if they are to be of any use in non-Latin, non-Cyrillic, non-Greek scripts. Also, the bot scripts are too slow. In the bot script that I am using, it takes 3-5 minutes to create an article. Given that speed, it will take me a lifetime to create and manage a single small wikipedia. Is there any method by which articles can be created and categorized at a faster speed? If there is, please spare the code and instructions to use the code, a page over here as well. Thank you.--Eukesh 18:12, 11 September 2007 (UTC)
We are considering doing this at Norfuk wikipedia, what software tools are recommended to create the articles?
220.127.116.11 12:46, 13 September 2007 (UTC)
Isn't this a horrible idea? Philosophers themselves haven't been able to clarify human language. Even with a supercomputer, how can you expect to create an algorithm that actually generates meaningful "encyclopedic content" en masse, by simply ripping it from a certain source?
In terms of geography, you can make the argument that every tiny hamlet in every remote village in the world is "notable" (that's a bad argument, IMO, but one that's unfortunately influential). However, when it comes to translating things, you can't do that. Try using Google's translator. Even that doesn't work because languages can't yet be translated by machines.
Hence, I don't really understand what the point of this project is. The idea of "mass content adding" sends a chill up my spine, because it takes the power out of the hands of individual editors to dispute such content, as to whether it's notable or not, and so on. Zenwhat 23:26, 14 February 2008 (UTC)
How did this project go, what scripts did you end up using? What kinds of mass additions to small wikis are happening now? An update, even if it wraps up the project, would be useful. -- sj | translate | + 20:46, 1 August 2009 (UTC)
Why should we allow users to create tiny stubs using a bot on wikis without communities large enough to handle stuff like that? I can think of several "bot" wikis where many pages were created by a single bot (like AWB). Examples: Aromanian wiki had a recent mass-deletion of all mass-added pages. lmo:Special:Registri/Snowolf also a lot on Lombard WP. πr2 (t • c) 01:18, 3 January 2013 (UTC)
- If by "recent deletion" you mean 2008, and are willing to ignore the fact that all the year pages and some of the place pages have been recreated since, then your argument has some validity. 18.104.22.168