Jump to content

User:Mudeh Dieudonne Mukum/Dicare-tool

From Meta, a Wikimedia project coordination wiki

Dicare Tools

Dicare Tools is a utility suite designed to help you gather statistics and contribute to Wikimedia projects. It provides specialized tools for data analysis, importing external datasets into Wikidata, tracking editing challenges, and exploring lexeme or property issues.

Project Status

Actively maintained – Last GitHub update: March 2025.

Basic Functionality

Dicare Tools is designed for Wikidata contributors, researchers, and Wikimedia challenge participants. It helps users:

  • Track editing challenges such as #100wikidays, providing insights into progress and contributions.
  • Import external data sources (e.g., BnF, IdRef) into Wikidata for enrichment and cross-referencing.
  • Analyze lexemes, properties, and inter-project relationships to better understand data structures within Wikimedia projects.

The suite relies on Wikidata dumps, APIs, and MySQL databases to process and visualize data. For instance, the #100wikidays tool retrieves user contributions across Wikimedia projects based on parameters such as username, project, namespace, and timezone.

Limitations

Dicare Tools cannot:

  • Automatically resolve data conflicts during imports—manual validation is required.
  • Support non-Wikimedia projects or import data in non-JSON formats.

Prerequisites

[edit]

Before using this tool, you should be familiar with:

  • Wikidata’s data structure and SPARQL queries.
  • Basic MySQL database management.

You should have:

  • A MySQL server with credentials (DB_HOST, DB_USER, DB_PASSWORD, DB_NAME).
  • Read/write access to Wikidata JSON dumps (stored in /home/wikidata/dumpfiles by default).
  • PHP and Apache/Nginx for hosting the tool.

How to use Dicare Tools

[edit]

Track #100wikidays Progress

[edit]
  1. Navigate to the {https://dicare.toolforge.org/100wikidays/}
  2. Enter your Wikimedia username.
  3. Select a 'Project' (e.g., Wikipedia, Wikidata) and 'Namespace' (e.g., Main, Lexeme).
  4. Adjust the timezone and limit for results.
  5. Click "Submit" to generate your progress report.

Import Data from BnF/IdRef to Wikidata

[edit]
  1. Given the URL of a notice in the catalog of the BnF (Bibliothèque nationale de France, French national Library), this tool
 produces QuickStatements commands to create a new Wikidata item about a person, with statements referenced by the BnF.
  1. Access the 'BnF To Wikidata' or 'IdRef To Wikidata' tool.
  2. Configure the source data path (default: /home/wikidata/dumpfiles).
  3. Map BnF/IdRef fields to Wikidata properties.
  4. Validate and execute the import process.

Explore Lexemes

[edit]
  1. Use 'Lexemes Party' or 'Lexemes Challenge' tools.
  2. Insert Title
  3. Inset A SPARQL query [ SELECT ?concept { wd:Q51401 p:P462 [ rdf:type wikibase:BestRank ; ps:P462 ?concept ; pq:P1545 ?rank ] } ORDER BY xsd:integer(?rank) ]
  4. Input a Wikidata concept (e.g., P5137 or Q3133).
  5. Filter lexemes by language or lexical category and search.
  6. Export results for further analysis.

Troubleshooting

[edit]

'Database Connection Errors'

  • Verify conf/conf.inc.php contains correct MySQL credentials.

'Missing Wikidata Dumps'

  • Ensure the /home/wikidata/dumpfiles directory is accessible and populated.

How to get help and report issues

[edit]

Developer documentation

[edit]

Technical implementation

[edit]

The tool uses 'PHP for backend logic' and 'MySQL for data storage'. Wikidata dumps are processed using wdtk-import.jar, a Java utility for batch imports.

Code structure

[edit]
  • 'scripts/': Contains wdtk-import.jar for data processing.
  • 'conf/': Stores configuration files like conf.inc.php.

Local development

[edit]

Set up

[edit]

How to set up development environment:

  1. Clone the repository:
 git clone https://github.com/envlh/dicare-tools

and navigate to file directory

  1. Read from {https://github.com/envlh/dicare-tools/blob/master/README.md#installation} for Instalation details

Configure

[edit]
  1. Read from {https://github.com/envlh/dicare-tools/blob/master/README.md#configuration} for proper config details

How contribute

[edit]

Ways to Participate:

  • Feature Requests & Bug Fixes: Submit via GitHub Issues.
  • Documentation Improvements: Enhance clarity or accuracy.

Credits

[edit]

Developed by [User:envlh|envlh] and hosted on [Toolforge|Wikimedia Toolforge].

License

[edit]

Check the {https://github.com/envlh/dicare-tools/blob/master/LICENSE} for licensing details.