Community Wishlist/Wishes/A way to see why a file is somewhere underneath a specific category (tool to show cat-path)
Description



On Commons, many categories, especially the higher-order large ones, have many off-topic files within their category branch. That usually is because of some miscategorization where somewhere far down in its subcategories, a category was included that isn't really about the subject of the parent category.
With the deepcategory search operator, one can see files of a category, including files in all its subcategories. This often shows many offtopic files due to the issue explained above, as can be seen in the first image on the right (another example are diagrams and normal photographs underneath the c:Category:Microscopic images which is supposed to be for microscopy photos only).
Fixing up categories and removing offtopic files is currently really difficult though. That is because the file could be included in the search results because it's in some subcategory 10 levels away from the category one has used deepcategory with. So for example if one uses the search deepcategory:"Buildings"
in MediaSearch and there is some microscopic image of a virus in the image results, one can't find out why it's there. One has to go to the file page and look at the file's categories and navigate upward on the category one thinks is the likely culprit and this often doesn't work and generally takes way too much time. For example it could be Buildings->Buildings by function->Agricultural buildings->[10 more levels]->Veterinary virology.
What is needed is a tool or functionality that shows the categorization path of the file to the parent category so that one can spot the faulty categorization and fix it. This is in fact already possible with the FastCCI tool but that tool usually doesn't load (phab:T367652) and that functionality is not the main use of the tool, e.g. it would load way too long and one can't use it for files/views found as described above. However, its code could maybe be used for this and I suggested what I'm proposing here also on the tool's talk page here. This feature would also be useful when images that shouldn't be there show up in a petscan where one can intersect Commons categories. Issue 182 about it at petscan.
Not having this feature greatly limits the usefulness of categories on Commons which can hardly be refined so that they really only show files about the category subject and impedes searching them using deepcategory as well as viewing categories with a modern scrollable wall-of-text view instead of having to navigate many subcategories in which files are dispersed.
Examples
- this petscan scan shows many image files as being in the Videos cat
- this petscan scan shows files that are not science fiction films
- this scan shows several files that are not comedy like this or this
- Charts like c:File:Life expectancy in Albania.svg are in c:Category:Maps of the world as this scan shows, which may be because Category:Demographics of the European Union is falsely in the category tree of Category:Maps of the European Union, and second Category:Maps of Afro-Eurasia is falsely in Category:Maps of the world indicating regions
- Why is this video of the ISS somewhere in the "Fantasy art" cat? Scan: Fantasy_art + Videos by file format
- Why is this 3D STL file in the Videos by file format cat (same scan as above)?
- Why are there these many unrelated files in Audio files of music + Hip hop music (glamorous, another tool, showed e.g. this file which doesn't belong there
- Why are these unrelated files in Dark wave (music genre)?
By the way this would also be useful to display the source of categorization for seeing how the file relates to the selected category (which is usually specified in the chain of category names).
Somebody please enable showing a category path beneath each item in the thumbnail view (and a button per item in the normal table view). It would be best as a native functionality but a gadget would help a lot too and maybe could later be turned into a native feature.
It could improve data quality a lot and fix many categorization problems that otherwise would be very difficult/unlikely to find.
Assigned focus area
Unassigned.
Type of wish
Feature request
Related projects
Wikimedia Commons
Affected users
Commons users
Other details
- Created: 18:38, 7 June 2025 (UTC)
- Last updated: 14:27, 19 June 2025 (UTC)
- Author: Prototyperspective (talk)