File:Abstract Wikipedia Data Science with Outreachy.mp3

From Meta, a Wikimedia project coordination wiki

Abstract_Wikipedia_Data_Science_with_Outreachy.mp3(MP3 audio file, length 41 min 4 s, 192 kbps overall, file size: 56.4 MB)

This is a file from the Wikimedia Commons. The description on its description page there is copied below.

Summary

Description
English: Aisha Katun and Liudmila Kalina (Jade) constructed a complex data pipeline using Wikimedia Cloud Services, culminating in a web-based tool to find user generated code (MediaWiki Scribunto modules written in Lua) that looks highly used and similar across different language editions of Wikipedia and its sibling projects (at the time of this recording, code is not presently centralized for the projects). Further data analysis, tooling, and the demo web application are discussed at https://meta.wikimedia.org/wiki/Abstract_Wikipedia/Data#Get_important_modules_and_find_modules_similar_to_each_other .

Aisha and Jade did this work as part of the Outreachy internship program ( https://www.outreachy.org/ ), the goal of which is to increase diversity in open source. Mentorship was provided by the Wikimedia Foundation, the nonprofit that supports Wikipedia and its sister projects.

Aisha and Jade's techniques and work fit into the Wikifunctions collaborative programming function authoring platform and Abstract Wikipedia natural language generation initiative. Wikifunctions and Abstract Wikipedia are described in this Communications of the ACM paper:

https://cacm.acm.org/magazines/2021/4/251343-building-a-multilingual-wikipedia/fulltext
Date
Source Own work
Author ABaso (WMF)

Licensing

I, the copyright holder of this work, hereby publish it under the following license:
w:en:Creative Commons
attribution share alike
This file is licensed under the Creative Commons Attribution-Share Alike 4.0 International license.
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
  • share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.

Captions

Outreachy interns Aisha Khatun and Liudmila Kalina (Jade) discuss analysis of Wikimedia project Scribunto module similarity analysis

Items portrayed in this file

depicts

2 March 2021

audio/mpeg

a4e13a5292033b81cd4d88351987878dfe454c30

59,142,092 byte

2,464.164 second

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current11:22, 25 March 202141 min 4 s (56.4 MB)ABaso (WMF)Uploaded own work with UploadWizard

Global file usage