User talk:Hjfocs
Add topicHi Marco - can you drop me an email or find me on Telegram to let me know your dad’s name as I need to put him on the list for SFMOMA. Thanks!
Feel free to get in touch with me here!
[edit]Hey Marco!
I am the founder of the project "WorldBrain - Verifying the Internet with Science" (worldbrain.io). Our goal is to develop a browser plugin, that shows you how trustworthy news articles are based on how the represent scientifically verifiable facts. The first step towards this is a bookmarking and meta-browsing tool for science communicators. It will allow users to retrieve related content and meta information to the content they currently consume. One of the first features will be the highlighting and linking of terms in web content that can be referenced with Wikipedia/Wikidata entries.
Daniel Mietchen pointed me to your project and it seems that we follow very similar goals.
It would be great to have a chat in the coming days, if that sounds interesting for you My email address is: oli@worldbrain.io
Looking forward to your answer. Oliver
StrepHit
[edit]Ciao, metto giú alcuni punti citati ieri.
- Per wm2016:Submissions, probabilmente il tuo punto principale è sulla fonte dell'autorevolezza/affidabilità: per Wikidata come per Wikipedia, deriva dalla fiducia nel processo. Tu proponi un processo che generi nuova fiducia/affidabilità nei dati in fieri di Wikidata.
- [1] / w:en:User:Charles Matthews. Altre fonti di biografie molto usate: [2] [3] [4] anche in Wikisource: [5] [6] [7].
- Proprietà famose: d:Wikidata:Database reports/List of properties/Top100, [8], [9] ecc.
- d:Q11985372, w:en:Category:Templates using data from Wikidata
- commons:User:Open Access Media Importer Bot
Nemo 09:21, 3 January 2016 (UTC)
- A (s)proposito, se pensassi di espanderti a un'altra lingua me la sento di sconsigliarti il francese! Questo è un esempio di tempesta che vuoi evitare: w:fr:Wikipédia:Bot/Statut#ListeriaBot. Nemo 12:45, 23 January 2016 (UTC)
Facto Post – Issue 1 – 14 June 2017
[edit]Facto Post – Issue 1 – 14 June 2017
This newsletter starts with the motto "common endeavour for 21st century content". To unpack that slogan somewhat, we are particularly interested in the new, post-Wikidata collection of techniques that are flourishing under the Wikimedia collaborative umbrella. To linked data, SPARQL queries and WikiCite, add gamified participation, text mining and new holding areas, with bots, tech and humans working harmoniously. Scientists, librarians and Wikimedians are coming together and providing a more unified view of an emerging area. Further integration of both its community and its technical aspects can be anticipated. While Wikipedia will remain the discursive heart of Wikimedia, data-rich and semantic content will support it. We'll aim to be both broad and selective in our coverage. This publication Facto Post (the very opposite of retroactive) and call to action are brought to you monthly by ContentMine.
If you wish to receive issues of Facto Post on English Wikipedia, please add your name to our mailing list. You can always remove it.
Newsletter delivered there by MediaWiki message delivery |
Charles Matthews (talk) 14:22, 14 June 2017 (UTC)
Wikidata Entity Linking
[edit]Hi! I am trying to make a map of the exinsting Entity Linking services that can output Wikidata identifiers. Of course most systems that output Wikipedia links can be converted a posteriori to return Qids (by using sitelinks) but I am interested in systems that deal with Wikidata specifically (i.e. which would be able to output Wikidata items which do not have any sitelinks). I saw that in StrepHit you use the Dandelion API. Are you happy with it, and are you aware of any alternatives? Thanks! − Pintoch (talk) 13:08, 29 June 2017 (UTC)
- @Pintoch: I've massively used Wikipedia entity linking services in the past, but as of today I've not explored any that directly output Wikidata identifiers. For StrepHit, I'm pretty much satisfied with the Dandelion APIs, although it doesn't output QIDs.
- Pasting below an old list of services that I've tried so far (and are still alive):
- http://www.opencalais.com/opencalais-demo/
- http://www.intelligenceapi.com/demo/
- http://babelfy.org/
- https://www.textrazor.com/demo
- http://www.txt-werk.de/
- http://nerd.eurecom.fr/documentation
- http://demo.dbpedia-spotlight.org/
- https://bitbucket.org/fbk/twm-lib and https://bitbucket.org/cgiuliano/twm-service
Hope you can find your way there! Cheers, --Hjfocs (talk) 10:33, 7 July 2017 (UTC)
Structured Data on Commons Newsletter, July 19, 2017
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter. Do inform others who you think will want to be involved in the project!
Structured Data on Wikimedia Commons?
[edit]The millions of files on Wikimedia Commons are described with a lot of information or (meta)data. With the project Structured Data on Wikimedia Commons, this data is structured more, and is made machine-readable. This will make it easier to view, search (also multilingually), edit, organize and re-use the files on Commons.
In early 2017, the Sloan Foundation funded this project (see documentation). Development takes place in 2017–2020. It involves staff from the Wikimedia Foundation and Wikimedia Deutschland (WMDE) and many volunteers. To achieve this, Wikibase support is added to Wikimedia Commons. Wikibase is the technology that is also used for Wikidata.
Recent developments: groundwork
[edit]- A new and crucial technical step (federation) now makes it possible to reference data from one Wikibase website in another. Because of this, it will be possible to use Wikidata's items and properties to describe media files on Commons.
- Another important piece of groundwork is under development: so-called Multi-Content Revisions. This feature allows structured data to be stored alongside wiki text, so that one wiki page can contain several types of content.
Team updates
[edit]- Amanda Bittaker was hired as Program Manager for Structured Data on Wikimedia Commons. Amanda will take care of the overall management of the project.
- Sandra Fauconnier (known as Spinster in her volunteer capacity) is the new Community Liaison. She will support the collaboration between the communities (Commons, Wikidata, GLAM) and the product development teams at the Wikimedia Foundation and Wikimedia Deutschland.
- We have open positions for a UX designer and a Product Manager!
Talking with communities and allies
[edit]- Long-term feedback from GLAMs. Besides the Wikimedia community, many external cultural and knowledge institutions (GLAMs - Galleries, Libraries, Archives and Museums) are interested in Structured Data on Commons and are willing to provide feedback on the long-term plans for the project. Alex Stinson, GLAM strategist at the Wikimedia Foundation, is currently in contact with Europeana, DPLA, the Smithsonian and the National Archives of the United States. Alex is also looking for other GLAM institutions who might be able to advise on the long term. If you know of an institution or partner that may be appropriate for consultation, do get in touch with Alex.
- Jonathan Morgan, design researcher, is starting to work on two projects:
- Researching batch upload workflows by interviewing GLAM institutions
- Researching the enrichment, organization and improvement tasks on already uploaded media files by engaging with active Commons contributors. This research follows up on existing research by Wikimedia Deutschland on heavy Commons users.
What comes next?
[edit]- The Structured Data on Commons team meets in the week after Wikimania to lay the groundwork for the next steps. This includes new backend development and design work, for better and more clear integration of the structured data in pages on Wikimedia Commons.
- The project's information pages on Wikimedia Commons will receive a long overdue update in the upcoming months. The team will also work on more and better communication channels. Feedback, wishes and tips are welcome at the project's general talk page.
Get involved
[edit]- Join us at Wikimania! We are present at the hackathon, and there will be a session on Saturday, August 12: Structured Commons: what changes are coming?
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
- Do you want to help out translating messages about Structured Data on Commons from English to your own language? Sign up on the translators page.
- Stay tuned for requests for input, discussion and participation as soon as the info portal is refreshed (see above). These will also be announced via this newsletter.
Many greetings from SandraF (WMF) (talk), Community Liaison for this project! 13:55, 19 July 2017 (UTC)
Structured Commons newsletter, October 25, 2017
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter. Do inform others who you think will want to be involved in the project!
- Community updates
- Rama published an article about Structured Commons in Arbido, a Swiss online magazine for archivists, librarians and documentalists: original in French, illustrated and the article translated in English.
- We now have a dedicated IRC channel: wikimedia-commons-sd webchat
- Join the community focus group!
- Translation. Do you want to help out translating messages about Structured Data on Commons from English to your own language? Sign up on the translators page.
- The documentation and info pages about Structured Data on Commons have received a thorough update, in order to get them ready for all the upcoming work. Obsolete pages were archived. There are undoubtedly still a lot of omissions and bits that are unclear. You can help by editing boldly, and by leaving feedback and tips on the talk pages.
- We have started to list tools, gadgets and bots that might be affected by Structured Commons in order to prepare for a smooth transition to the new situation. You can help by adding alerts about/to specific tools and developers on the dedicated tools page. You can also create Phabricator tasks to help keep track of this. Volunteers and developers interested in helping out with this process are extremely welcome - please sign up!
- Help write the next Structured Commons newsletter.
- Structured Data on Commons was presented at Wikimania 2017 in Montréal for a packed room. First design sketches for search functionality were discussed during a breakout session. Read the Etherpad reports of the presentation and the breakout session.
- Katherine Maher, Executive Director of the Wikimedia Foundation, answered questions on Quora. One of her answers, mentioning Structured Data on Commons, was republished on Huffington Post.
- Sandra Fauconnier, Amanda Bittaker and Ramsey Isler from the Structured Commons team will be at WikidataCon. Sandra presents Structured Commons there (with a focus on fruitful collaboration between the Wikidata and Commons communities). If you attend the conference, don't hesitate to say hi and have a chat with us! (phabricator task T176858)
- Team updates
Two new people have been hired for the Structured Data on Commons team. We are now complete! :-)
- Ramsey Isler is the new Product Manager of the Multimedia team.
- Pamela Drouin was hired as User Interface Designer. She works at the Multimedia team as well, and her work will focus on the Structured Commons project.
- Partners and allies
- We are still welcoming (more) staff from GLAMs (Galleries, Libraries, Archives and Museums) to become part of our long-term focus group (phabricator task T174134). You will be kept in the loop of the project, and receive regular small surveys and requests for feedback. Get in touch with Sandra if you're interested - your input in helping to shape this project is highly valued!
- Research
Design research is ongoing.
- Jonathan Morgan and Niharika Ved have held interviews with various GLAM staff about their batch upload workflows and will finish and report on these in this quarter. (phabricator task T159495)
- At this moment, there is also an online survey for GLAM staff, Wikimedians in Residence, and GLAM volunteers who upload media collections to Wikimedia Commons. The results will be used to understand how we can improve this experience. (phabricator task T175188)
- Upcoming: interviews with Wikimedia volunteers who curate media on Commons (including tool developers), talking about activities and workflows. (phabricator task T175185)
In Autumn 2017, the Structured Commons development team works on the following major tasks (see also the quarterly goals for the team):
- Getting Multi-Content Revisions sufficiently ready, so that the Multimedia and Search Platform teams can start using it to test and prototype things.
- Determine metrics and metrics baseline for Commons (phabricator task T174519).
- The multimedia team at WMF is gaining expertise in Wikibase, and unblocking further development for Structured Commons, by completing the MediaInfo extension for Wikibase.
- Stay up to date!
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
- Join the next IRC office hour and ask questions to the team! It takes place on Tuesday 21 November, 18.00 UTC.
Warmly, your community liaison, SandraF (WMF) (talk)
Message sent by MediaWiki message delivery - 14:26, 25 October 2017 (UTC)
Iscrizione itWikiCon
[edit]Ciao, ti scrivo perché hai manifestato l'intenzione di partecipare a itWikiCon 2017. L'evento si avvicina: il programma è disponibile in questa pagina. Se confermi la tua partecipazione ti chiediamo di compilare questo form entro il 14 novembre. Ci vediamo dal 17 al 19 novembre a Trento! --Jaqen (talk) 09:18, 28 October 2017 (UTC)
Structured Commons newsletter, December 13, 2017
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter. Do inform others who you think will want to be involved in the project!
- Community updates
- There was a IRC Office Hour about Structured Commons on November 21. You can read the log here.
- Our dedicated IRC channel: wikimedia-commons-sd webchat
- NEW: Participate in a survey that helps us prioritize which tools are important for the Wikimedia Commons and Wikidata communities. The survey runs until December 22. Here's some background.
- NEW: Help the team decide on better names for 'captions' and 'descriptions'. You can provide input until January 3, 2018.
- NEW: Help collect interesting Commons files, to prepare for the data modelling challenges ahead! Continuous input is welcome.
- Join the community focus group!
- Do you want to translate messages and information about Structured Data on Commons from English to your own language? Sign up on the translators page.
- Sandra presented the plans for Structured Commons during WikidataCon in Berlin, on October 29. The presentation focused on collaboration between the Wikidata and Commons communities. You can see the full video here.
- Partners and allies
- We are still welcoming (more) staff from GLAMs (Galleries, Libraries, Archives and Museums) to become part of our long-term focus group (phabricator task T174134). You will be kept in the loop of the project, and receive regular small surveys and requests for feedback. Get in touch with Sandra if you're interested - your input in helping to shape this project is highly valued!
- Research
- Research findings from interviews and surveys of GLAM project participants are being published to the research page. Check back over the next few weeks as additional details (notes, quotes, charts, blog posts, and slide decks) will be added to or linked from that page.
- The Structured Commons team has written and submitted a report about the first nine months of work on the project to its funders, the Alfred P. Sloan Foundation. The 53-page report, published on November 1, is available on Wikimedia Commons.
- The team has started working on designs for changes to the upload wizard (T182019).
- We started preliminary work to prototype changes for file info pages.
- Work on the MediaInfo extension is ongoing (T176012).
- The team is continuing its work on baseline metrics on Commons, in order to be able to measure the effectiveness of structured data on Commons. (T174519)
- Upcoming: in the first half of 2018, the first prototypes and design sketches for file pages, the UploadWizard, and for search will be published for discussion and feedback!
- Stay up to date!
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
- Join the next IRC office hour and ask questions to the team! It takes place on Tuesday, February 13, 18.00 UTC in wikimedia-office webchat.
Warmly, your community liaison, SandraF (WMF) (talk)
Message sent by MediaWiki message delivery - 16:32, 13 December 2017 (UTC)
Structured Data on Commons Newsletter - Spring 2018
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter and contribute to the next issue. Do inform others who you think will want to be involved in the project!
- Community updates
- Our dedicated IRC channel: wikimedia-commons-sd webchat
- Several Commons community members are working on ways to integrate Wikidata in Wikimedia Commons. While this is not full-fledged structured data yet, this work helps to prepare for future conversion of data, and helps to understand how Wikidata and Commons can work better together.
- Thanks to Jarekt and other contributors, some Commons templates can now be filled via Wikidata: {{Creator}} (Phabricator) and {{Institution}} (Phabricator). Work is ongoing on the {{Artwork}} template (Phabricator).
- Thanks to Mike Peel and others, Wikidata-powered infoboxes can now be added to Commons categories, with the template {{Wikidata Infobox}}. (Example)
- Multichill is working on an experimental workflow to upload images to Commons via Wikidata (and using metadata from Wikidata). See a part of it here.
- Join the community focus group!
- Do you want to help out translating messages about Structured Data on Commons from English to your own language? Sign up on the translators page.
- Contribute to the next newsletter.
- Discussions held
- Conversation about licensing and copyright modeling.
- High-level discussion on ontology for Commons.
- Review first designs for multilingual captions.
- IRC office hour, 13 February
- Events
- Wikimedia Conference, Berlin, 20-22 April (+ Learning Days 18-19 April): several sessions and workshops around Structured Commons
- EuropeanaTech Conference, Rotterdam, 15-16 May: several presentations + a full workshop day on Monday 14 May about Wikidata and Structured Commons
- Wikimedia Hackathon, Barcelona, 18-20 May: Structured Commons as a focus area.
- Partners and allies
- We are still welcoming (more) staff from GLAMs (Galleries, Libraries, Archives and Museums) to become part of our long-term focus group (phabricator task T174134). You will be kept in the loop of the project, and receive regular small surveys and requests for feedback. Get in touch with Sandra if you're interested - your input in helping to shape this project is highly valued!
- Research
- The research about GLAM contributions to Wikimedia Commons is concluded. A blog post on the Wikimedia blog provides a summary, and you can read the full results on meta.wikimedia.org.
- Prototypes will be available for Multilingual Captions soon.
- Stay up to date!
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
- Join the next IRC office hour and ask questions to the team! The date for next quarter will be announced soon.
-- Keegan (WMF) (talk)
Message sent by MediaWiki message delivery - 19:48, 3 April 2018 (UTC)
Structured Data on Commons Newsletter - Summer 2018
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter and contribute to the next issue. Do inform others who you think will want to be involved in the project!
- Community updates
- Our dedicated IRC channel: wikimedia-commons-sd webchat
- Since our last newsletter, the Structured Data team has moved into designing and building prototypes for various features. The use of multilingual captions in the UploadWizard and on the file page has been researched, designed, discussed, and built out for use. Behind the scenes, back-end work on search is taking place and designs are being drawn up for the front-end. There will soon be specifications published for the use of the first Wikidata property on Commons, "Depicts," and a prototype is to be released to go along with that.
- A workshop on what Wikidata properties Commons will need. This workshop will be open for the entire month of July 2018 at minimum.
- Join the community focus group!
- Do you want to help out translating messages about Structured Data on Commons from English to your own language? Sign up on the translators page.
- Contribute to the next newsletter.
- Discussions held
- In late February there was a discussion around how Commons generally sees data being modeled.
- The first discussion on copyright and licensing with Commons was held in March. This was a "high level" discussion, there will be a consultation later this summer about the deeper mapping of copyright and licensing in a structured way.
- In April there was an exercise for GLAM partners in metadata and ontology mapping.
- A discussion about the design for Multilingual Captions on the file page took place in May. You can still review the designs and leave feedback.
- There was an IRC office hour in June to discuss progress so far and future plans.
- Wikimania 2018
- Three sessions about Structured Commons are officially scheduled for Wikimania 2018 - Cape Town, South Africa - July 2018.
- Wikimedia Commons and GLAM needs around the world (Friday 20 July, 10:30 local time)
- Structured Data on Wikimedia Commons and knowledge equity (Friday 20 July, 14:00 local time)
- Design challenge workshop: How can multilingual structured metadata bring knowledge equity to Commons? (Friday 20 July, 14:30 local time)
- Structured Data on Commons is also a focus area during the Wikimania 2018 Hackathon. We will, among other things, do 'live' modelling of Wikidata properties for Commons - an offline spin-off of the community consultation taking place on wiki.
- Partners and allies
- We are still welcoming (more) staff from GLAMs (Galleries, Libraries, Archives and Museums) to become part of our long-term focus group (phabricator task T174134). You will be kept in the loop of the project, and receive regular small surveys and requests for feedback. Get in touch with Sandra if you're interested - your input in helping to shape this project is highly valued!
- Structured Data on Commons was presented to GLAM audiences during EuropeanaTech 2018 in Rotterdam (15 May 2018) and at the Deutsche Digitale Bibliothek Forum in Berlin (4 June 2018).
- Research
Two research projects about Wikimedia Commons are currently ongoing, or in the process of being finished:
- Research:Curation workflows on Wikimedia Commons—a project that seeks to understand the current workflows of Commons contributors who curate media (categorize it, delete it, link to it from other projects, etc.).
- Research:Technical needs of external re-users of Commons media—soliciting feedback from individuals and organizations that re-use Commons content outside of Wikimedia projects, in order to understand their current painpoints and unmet needs.
- Prototypes will be available for Depicts soon.
- Stay up to date!
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
- Join the next IRC office hour and ask questions to the team! The date for next quarter will be announced soon.
-- Keegan (WMF) (talk)
Message sent by MediaWiki message delivery - 21:07, 6 July 2018 (UTC)
Structured Data Newsletter - Research link fix
[edit]Greetings,
The newsletter omitted two interwiki prefixes, breaking the links on non-meta wikis as you might see above. Here are the correct links:
- m:Research:Curation workflows on Wikimedia Commons—a project that seeks to understand the current workflows of Commons contributors who curate media (categorize it, delete it, link to it from other projects, etc.).
- m:Research:Technical needs of external re-users of Commons media—soliciting feedback from individuals and organizations that re-use Commons content outside of Wikimedia projects, in order to understand their current painpoints and unmet needs.
My apologies, I hope you find the corrected links helpful.
Structured Data on Commons Newsletter - Fall 2018 edition
[edit]Welcome to the newsletter for Structured Data on Wikimedia Commons! You can update your subscription to the newsletter. Do inform others who you think will want to be involved in the project!
- Community updates
- Multilingual Captions, the first feature release for Structured Data, is coming in January of 2019
- Be on the lookout for the beta testing announcement
- Help using captions has been set up, if you'd like to go ahead and see the workflow
- Two IRC office hours were held since the last newsletter
- Our dedicated IRC channel: wikimedia-commons-sd webchat
Current:
- Help determine and propose properties on Wikidata for Commons
- Review designs for structured licensing and copyright
- Join the community focus group!
Since the last newsletter:
- Review a prototype for searching structured Commons (October 2018)
- "Good coverage" for depicts tagging (Sept. 2018)
- Review and discuss mockups for displaying the new metadata section of the file page (18 September - 9 October 2018)
- Depicts statements draft requirements (14 August - 31 August 2018)
- Identify Wikidata properties that Commons will need (26 June - 14 August 2018)
- Presentation by Keegan on the first features to be released for Structured Data, presented at Wikiconference North America, Columbus, Ohio, October 2018.
- Sandra presented a project update at the GLAM-Wiki conference in Tel Aviv, Israel, November 2018, as part of an update and panel discussion.
- Structured Data on Commons was the subject of a keynote presentation by Sandra (see slides) at the Baltic Audiovisual Archives Council conference in Tallinn, Estonia, November 2018.
- Partners and allies
- The info portal on Structured Commons now includes a section on GLAM (Galleries, Libraries, Archives and Museums).
- We are currently planning the first GLAM pilot projects that will use structured data on Wikimedia Commons. One project has already started: the Swedish Heritage Board researches and develops a prototype tool to provide improved metadata (translations, data additions...) from Wikimedia Commons back to the source institution. Read the project brief.
- The documentation for batch uploads of files to Wikimedia Commons will be improved in 2019, as part of preparing for Structured Data on Wikimedia Commons. To prepare, the GLAM team at the Wikimedia Foundation wants to understand better which types of documentation you already use, and how you like to learn new GLAM-Wiki skills and knowledge. Fill in a short survey to provide input!
- Stay up to date!
- Follow the Structured Data on Commons project on Phabricator: https://phabricator.wikimedia.org/project/profile/34/
- Subscribe to this newsletter to receive it on a talk page of your own choice.
-- Keegan (WMF) (talk)
Message sent by MediaWiki message delivery - 17:58, 7 December 2018 (UTC)
Captions in January
[edit]Structured Data - file captions coming this week (January 2019)
[edit]My apologies if this is a duplicate message for you, it is being sent to multiple lists which you may be signed up for.
Hi all, following up on last month's announcement...
Multilingual file captions will be released this week, on either Wednesday, 9 January or Thursday, 10 January 2019. Captions are a feature to add short, translatable descriptions to files. Here's some links you might want to look follow before the release, if you haven't already:
- Read over the help page for using captions - I wrote the page on mediawiki.org because captions are available for any MediaWiki user, feel free to host/modify a copy of the page here on Commons.
- Test out using captions on Beta Commons.
- Leave feedback about the test on the captions test talk page, if you have anything you'd like to say prior to release.
Additionally, there will be an IRC office hour on Thursday, 10 January with the Structured Data team to talk about file captions, as well as anything else the community may be interested in. Date/time conversion, as well as a link to join, are on Meta.
Thanks for your time, I look forward to seeing those who can make it to the IRC office hour on Thursday. -- Keegan (WMF) (talk) 21:09, 7 January 2019 (UTC)Sent email
[edit]Hi Hjfocs,
I got a message that you sent me an email about a week ago. Due to a change in my email address, I did not get that email. I just fixed it, so if the issue's still relevant, please contact me again.
Best, --RKDdata (talk) 10:21, 13 August 2019 (UTC)
- Same here. (About soweego and mix&match). Feel free to send it again.Maria zaos (talk) 21:34, 15 August 2019 (UTC)
Hi. I understand through the ping system that you sent me an email. However, I didn't receive it. See here for my email. Thanks. --Rosiestep (talk) 08:17, 16 August 2019 (UTC)
- Dear @RKDdata, Maria zaos, and Rosiestep: thanks for reaching out.
- Done. Could you kindly confirm you have received the e-mail? Thanks, Hjfocs (talk) 14:12, 17 August 2019 (UTC)
Could it be that you have a restrictive SPF setting on your email address, Hjfocs? I think your email address domain uses ~all but I might be wrong. phabricator:T65927 is marked resolved but I'm unaware of any actual mitigation used by the WMF's mail relay for MediaWiki emails. Nemo 12:05, 21 August 2019 (UTC)
- @Nemo bis: thanks for the heads-up. I checked if something went wrong with the accounts above, but it seems the the e-mails were correctly sent. I didn't receive any automatic mail delivery notifications neither, so I'm not sure what else I can do: if I understand phabricator:T65927, it might be a recipient problem too, right? What do you think I should check from my side? Cheers, Hjfocs (talk) 13:51, 21 August 2019 (UTC)
- Not all recipients enforce SPF, and those who enforce SPF do not necessarily return any error message. Nemo 18:08, 22 August 2019 (UTC)
Structured Data - blogs posted in Wikimedia Space
[edit]There are two separate blog entries for Structured Data on Commons posted to Wikimedia Space that are of interest:
- Working with Structured Data on Commons: A Status Report, by Lucas Werkmeister, discusses some ways that editors can work with structured data. Topics include tools that have been written or modified for structured data, in addition to future plans for tools and querying services.
- Structured Data on Commons - A Blog Series, written by me, is a five-part posting that covers the basics of the software and features that were built to make structured data happen. The series is meant to be friendly to those who may have some knowledge of Commons, but may not know much about the structured data project.
Vous m'avez envoyé... ?
[edit]Bonjour
Une notification, il y a 8 jours, me dit que vous m'avez envoyé un courrier, mais il m'est impossible de le trouver, pouvez-vous m'aider ? Cordialement, Zerline (talk) 13:08, 2 October 2019 (UTC)
- Bonjour Zerline et merci bien pour votre message.
- Il s'agit peut-être du même problème qui est arrivé à d'autres utilisateurs, du coup je vais vous coller le text du courriel (en anglais) sur votre page personnelle.
- À bientôt! --Hjfocs (talk) 15:42, 2 October 2019 (UTC)
Un grazie e un libro sulla conoscenza libera per te
[edit]Gentile Hjfocs,
oggi ti scrivo a nome dell'associazione Wikimedia Italia per ringraziarti del tempo che hai dedicato ai progetti Wikimedia.
Come piccolo omaggio avremmo piacere di spedirti una copia (tutta in carta riciclata) del libro di Carlo Piana, Open source, software libero e altre libertà. Fornisci un recapito per ricevere una copia del libro.
Pochi giorni fa il mondo ha festeggiato la giornata dell'amore per il software libero, ma ogni giorno è buono per ricordare le garanzie delle licenze libere e le centinaia di migliaia di persone che si sono unite per costruire questo bene comune della conoscenza. Speriamo che questo libro ti sia utile per apprezzare quanto hai fatto e per trasmettere la passione della conoscenza libera a una persona a te vicina.
Se desideri una copia ma non puoi fornirci un indirizzo a cui spedirla, contatta la segreteria Wikimedia Italia e troviamo una soluzione insieme.
Grazie ancora e a presto,
Lorenzo Losa (msg) 15:15, 18 February 2020 (UTC)
OpenLibrary
[edit]Hi Hjfocs, The OpenLibrary can make millions of the books cited by WP and other projects findable, and often makes them readable online for free. Obviously this has the potential to improve statements from potentially to actually verified.
I've been trying to make sense of where soweego stands now, but haven't had much success finding out the status of the project. It appears to have mostly stopped in 2017, but I expect that appearance is not reality.
OpenLibrary has three main identifier types of its own for Authors, Works, and Editions. These in turn support (but do not require) linkage to a number of associated external identifiers such as Wikidata items, VIAF, ISNI, OCLC, ISBN, HathiTrust ID, etc. As of today, there is very little consistency in the population of these identifiers, but the potential for a tool such as soweego to fill in some of these blanks would seem to be great. I would suggest that an initial focus on author identifiers has the largest potential to help. OL has a great number of redundant author records which means a large effort is still needed to merge them. The first step is simply identifying that (for instance) the 480 David Smith entries reflect far fewer than 480 authors, only one of whom wrote a book entitled Where the grass is greener. It turns out to have two records for that author David Marshall Smith, but neither record is directly linked to VIAF, ISNI, Wikidata or anything of the sort. Mining through editions to ISBNs on OCLC does, however, reveal that Q58212475, ISNI:0000000084121408 and VIAF:115129454 all pertain to the same author.
Am I right in thinking that soweego could help automate parts of such work? LeadSongDog (talk) 21:13, 11 March 2020 (UTC)
- Hey @LeadSongDog: thanks a lot for your comments and your endorsement to soweego 2, much appreciated. I'm a fan of what you guys are building at the Internet Archive. Maybe we met when we had lunch at the office after WikiCite 2018? See the thumbnail on the right.
- I think you set your expectations right, the soweego project is quite active: 2017 is when the first proposal was submitted, see Grants:Project/Hjfocs/soweego. The development of version 1 started in July 2018 with a 1-year grant: Grants:Project/Hjfocs/soweego/Timeline#Overview. Version 1.1 was supported by a follow-up rapid grant: Grants:Project/Rapid/Hjfocs/soweego 1.1. Right now, we are in the review phase for version 2 proposal.
- And yes, you're right that soweego can give the Open Library a hand. The main soweego duty is to mine links between Wikidata items and e.g., Open Library ones. Once a pair of entries gets connected, we can do something more like comparing the available information.
- While it's currently out of scope, soweego's core task could be applied for data de-duplication as well, so we can also discuss about using record linkage to find duplicate entries in your catalog.
- In any case, I believe it would make a lot of sense if we let soweego support the Open Library as a target catalog: you'll start seeing benefits once entry pairs get linked. Just added this discussion to my radar.
- soweego 2 proposal timeline is bound to Project Grants round 1 2020 schedule: Grants:Project. Let's definitely keep in touch!
- Best,
- Hjfocs (talk) 17:11, 16 March 2020 (UTC)
- Thank you, that is very encouraging. Sadly, I’ve not been able to get to any WikiCite meetings, but I’m sure that DarTar and company made good use of the time together. As I said, targeting the author identifiers first should show the most rapid improvement. Where would I find discussion on the choice of targets? LeadSongDog (talk) 05:05, 17 March 2020 (UTC)
- Yeah, we had a great time there with DarTar.
- soweego has been indeed targeting author identifiers. The current specific use case revolves around the music and movie domains. See these links for the analysis and choice of target catalogs: Grants:Project/Hjfocs/soweego/Timeline#August_2018:_big_fishes_selection and Grants:Project/Hjfocs/soweego/Timeline#July_2018:_target_selection_&_small_fishes.
- Hope this helps! Best,
- Hjfocs (talk) 10:49, 22 March 2020 (UTC)
Mail spam
[edit]Please don't mail spam me. Thanks. — billinghurst sDrewth 09:44, 17 March 2020 (UTC)
- Hi there, thanks for your comment. I sent you an e-mail because it seems you have been using Mix'n'match (Q28054658), so I believe my message may be relevant for you. I'm sorry to hear that you considered it as spam, although I fully disagree. Moreover, the WMF Project Grants program explicitly asks to encourage community members to post feedback, see Grants:Project/Apply, Next Steps section, bullet point 2.
- Best,
- Hjfocs (talk) 11:07, 22 March 2020 (UTC)
- I doubt that they were meaning for you to bulk mail a string of people drumming up votes as part of a "strategy to encourage". That would get horribly messy if you take that to any reasonable extrapolation. I think that you should rethink your tactics. — billinghurst sDrewth 12:16, 23 March 2020 (UTC)
- I see you are particularly interested in this topic, and will of course refrain from reaching out to you next time. Said that, I'm also persuaded I did my best to comply with WMF recommendations and to give you a rational reply. In any case, the more hostile and arrogant someone gets about something, the less likely I am to engage. As a result, I'm afraid I can't contribute further to this discussion.
- Cheers --Hjfocs (talk) 13:46, 23 March 2020 (UTC)
Email?
[edit]WP mentioned that you have send me an email. But I don't see anything in my inbox. Please use my Talk-Page or ping me. --Aeroid (talk) 12:42, 22 March 2020 (UTC)
- Hello and thank you for reaching out. This is perhaps due to phab:T65927. Anyway, I'll leave the message on your talk page.
- Cheers,
- Hjfocs (talk) 13:59, 22 March 2020 (UTC)
Where do I find the active Soweego community?
[edit]Hi Marco,
I'm interested in Soweego from a coding and linked data perspective. Specifically, I'd like to improve Wikidata coverage of Musicbrainz and potentially other third party IDs. I was wondering if there is a mailing list or part of a Wikimedia wiki somewhere where the developers of the software are hanging out, that I could maybe drop in on to learn more about the project. Feel free to contact me directly at: audiodude@gmail.com.
Thanks, -Travis -Audiodude (talk) 02:44, 2 December 2020 (UTC)
- Hey @Audiodude: thanks for reaching out and for your interest in the project! To cut a long story short, here are a few pointers:
- pretty much all the reading material is available here on Meta. Please have a look at Grants:Project/Hjfocs/soweego, and especially the final report
- the software documentation is at https://soweego.readthedocs.io/
- the first thing you can do from a coding perspective is to get your hands dirty, see https://soweego.readthedocs.io/en/latest/#get-ready
- the key contribution you can make is to import a new catalog: https://soweego.readthedocs.io/en/latest/#contribute
- There is no specific mailing list for the project, but please stay tuned for soweego 2: Grants:Project/Hjfocs/soweego_2
- Best, --Hjfocs (talk) 14:03, 12 December 2020 (UTC)
- @Hjfocs: Thanks for the reply! I've already installed the docker image and run through the steps in https://soweego.readthedocs.io/en/latest/#get-ready for the Musicbrainz catalog. It wasn't entirely clear to me what this had accomplished however, besides the output CSVs that I got. I guess what I'm really interested is how soweego actually creates links in Wikidata. Is this all done through Mix n Match? When I look at the Mix n Match category for Musicbrainz band, it looks like all the unmatched entities are listed based solely on their Musicbrainz ID, which isn't very helpful. Why don't they have listings based on the Musicbrainz "name" field, for example. You would think that the name combined with a location or a "years active" would be enough for a human to match in Mix n Match. Thanks! --Audiodude (talk) 23:00, 5 January 2021 (UTC)
- @Audiodude: I'm really sorry for the huge delay, I just struggle to keep track of these messages. Anyway, it's great to hear that you installed the software!
- To answer your question: soweego uploads confident links directly to Wikidata through a bot, i.e., d:User:Soweego_bot. Here's a pointer to the piece of code responsible for that.
- On the other hand, links that are likely to need curation are instead sent to Mix'n'match (Q28054658). Here's another pointer to the code.
- Concerning your feedback on Mix'n'match: the soweego client definitely needs some love, see this issue for instance.
- Thanks again for your invaluable time!
- Best, --Hjfocs (talk) 14:56, 3 May 2021 (UTC)
Structured data across Wikimedia is starting!
[edit]Ciao bello, come stai? :) Spero di trovarti bene.
Ti scrivo perché mi hanno affidato la comunicazione di un nuovo progetto, Structured Data Across Wikimedia (SDAW), che è già partito da qualche giorno. SDAW è un programma finanziato da un grant, con lo scopo di strutturare le pagine in wikitesto con dati strutturati, per rendere lettura, modifica e ricerca più facile e accessibile fra i progetti, nonché su Internet.
Ti coinvolgo nella questione perché conto di avere un tuo feedback sul progetto. Abbiamo una serie di domande già pronte, ma sentiti libero di condividere ciò che pensi. :) Se poi hai idee su chi coinvolgere in materia, dimmelo oppure invitali direttamente tu. Più siamo, meglio stiamo. :)
Un abbraccione e fatti sentire! --Sannita (WMF) (talk) 14:30, 20 March 2021 (UTC)
- Ma ciao @Sannita (WMF): che piacere leggerti.
- Il progetto mi sembra molto interessante: ho lasciato qualche riflessione iniziale sulla sua pagina di discussione.
- Quanto a persone pertinenti, mi viene in mente al volo chi gravita intorno a Schema.org (Q3475322), che propone uno standard per annotare le pagine Web in modo da essere indicizzate meglio dai motori di ricerca.
- Poi forse chi ha lavorato a DBpedia (Q465), che faceva un po' il contrario, ovvero estrarre dati strutturati da Wikipedia.
- Per non parlare di tutta la comunita' di ricerca (sottoscritto compreso) che si concentra sul processamento delle parti meno strutturate di Wikipedia (principalmente testo, ma anche liste e tabelle).
- Una cosa e' certa: dobbiamo approfondire il discorso.
- Un abbraccio, --Hjfocs (talk) 17:48, 4 May 2021 (UTC)
Notification about a grant request
[edit]Hi.
As someone who commented about my grant request (including the talk page) last year requesting funding to create the GlobalWatchlist extension, I thought you might want to know that I have requested another grant to continue development. You can see the new request at Grants:Project/DannyS712/Continued work on GlobalWatchlist extension - if you support this, I hope you'll consider endorsing the request.
My apologies if you were already aware of the new grant request (I sent a similar message to the subscribers of User:DannyS712/Global watchlist/Updates - sorry for any duplicates).
Thanks, --DannyS712 (talk) 00:43, 19 April 2021 (UTC)
- @DannyS712: thanks for the heads up and best of luck for your project!
- Cheers, --Hjfocs (talk) 15:08, 3 May 2021 (UTC)
Toolforge problem
[edit]Toolforge and its Mix and match doesn't work: https://mix-n-match.toolforge.org/
Do you have any ideas? Also pinging user:Edoardolenzi, user:Magnus Manske and user:MaxFrax96 Estopedist1 (talk) 08:12, 22 January 2024 (UTC)