Community Wishlist Survey 2022/Better diff handling of paragraph splits/Test cases
Some of the below diff examples have had <rudeword>
added to them to simulate vandalism.
For ideas on how to generate test cases, see Community_Wishlist_Survey_2022/Better_diff_handling_of_paragraph_splits/Test_strategy#Techniques.
See also w:User:Cacycle/diff_test_cases.
Splitting paragraphs
[edit]- https://en.wikipedia.org/w/index.php?title=User:Nux/sandbox&type=revision&diff=749562328&oldid=749562210
- https://test.wikipedia.org/w/index.php?title=Lithium_aluminate&type=revision&diff=552199&oldid=546537
- Does not highlight the word added or changed. This can be an issue if the paragraph is long and the person has to read the whole thing just to find the minor change
- https://test.wikipedia.org/w/index.php?title=Zayed_University&type=revision&diff=552200&oldid=547110
- No highlight on the words removed. The same issue above if the paragraph is long.
- https://test.wikipedia.org/w/index.php?title=13th_Venice_International_Film_Festival&type=revision&diff=552271&oldid=485968
- Splitting a paragraph in a wikitext table.
- https://test.wikipedia.org/w/index.php?title=SpaceX_Mars_transportation_infrastructure&type=revision&diff=552272&oldid=547056
- Splitting a paragraph in a list.
- https://test.wikipedia.org/w/index.php?title=Y%C3%AAn_B%C3%A1i_Province&type=revision&diff=552419&oldid=552377
- Splitting a paragraph by adding a single newline. Not modifying the paragraph in any other way. In the HTML, this does not actually lead to a new paragraph displaying (this behaviour depends on the exact wikitext you are editing)
- https://test.wikipedia.org/w/index.php?title=Y%C3%AAn_B%C3%A1i_Province&type=revision&diff=552420&oldid=552419
- Splitting a paragraph by adding two newlines. Not modifying the paragraph in any other way.
- https://test.wikipedia.org/w/index.php?title=Strathbungo&type=revision&diff=552424&oldid=540376
- Splitting paragraph and adding word (
<addedword>
).
- Splitting paragraph and adding word (
- https://test.wikipedia.org/w/index.php?title=Strathbungo&type=revision&diff=552425&oldid=552424
- Splitting paragraph and removing word (
Princes
).
- Splitting paragraph and removing word (
- https://test.wikipedia.org/w/index.php?title=Tto_page_9&type=revision&diff=552426&oldid=202872
- This was originally split with only one newline. I modified it to two newlines.
- https://test.wikipedia.org/w/index.php?title=Strathbungo&type=revision&diff=552425&oldid=540376
- Splitting one paragraph into multiple paragraphs, with removing and adding.
- https://test.wikipedia.org/w/index.php?title=Curium&type=revision&diff=552443&oldid=470454
- Splitting a paragraph and adding whitespace. Diff output is confusing.
- https://test.wikipedia.org/w/index.php?title=Earl_of_Tyrone&type=revision&diff=552444&oldid=550515
- Splitting one paragraph into two but adding a new paragraph between them.
- https://test.wikipedia.org/w/index.php?title=MediaWiki%3AFoobar.js&type=revision&diff=552527&oldid=552526
- Splitting paragraphs in JavaScript.
- https://test.wikipedia.org/w/index.php?title=MediaWiki%3AFoobar.js&type=revision&diff=552528&oldid=552526
- Splitting paragraphs in JavaScript, with modification.
Combining paragraphs
[edit]- https://test.wikipedia.org/w/index.php?title=Exponential_factorial&type=revision&diff=551697&oldid=447426
- Two paragraphs next to one another are combined into one. Notice the entire second paragraph is highlighted, which might hide the fact that there is a <rudeword>.
- https://test.wikipedia.org/w/index.php?title=Kucing&type=revision&diff=551694&oldid=485108
- https://test.wikipedia.org/w/index.php?title=Zimbabwe_flyafrica.com&type=revision&diff=551695&oldid=547133
- https://test.wikipedia.org/w/index.php?title=Kogia_breviceps&type=revision&diff=551696&oldid=551688
- https://test.wikipedia.org/w/index.php?title=Puerto_Rico&type=revision&diff=552179&oldid=552176
- Moving multiple paragraphs into one
Moving paragraphs
[edit]- https://en.wikipedia.beta.wmflabs.org/w/index.php?title=BetterDiff:MovedAChangedWord&type=revision&diff=567469&oldid=567468 (also includes replacing a word)
- https://test.wikipedia.org/w/index.php?title=Swakopmund&type=revision&diff=552104&oldid=547079
- Moves one paragraph and makes a number of changes to it. Still recognized as a moved paragraph. This demonstrates that, even after changing a paragraph a lot, we can still recognize it as a modified version of a previous paragraph, rather than a new paragraph.
- https://test.wikipedia.org/w/index.php?title=Reparatus&type=revision&diff=552183&oldid=513078
- One paragraph splitting into multiple paragraphs and rearranging the order
Adding sentences to paragraphs
[edit]- https://test.wikipedia.org/w/index.php?title=BetterDiffs1&action=historysubmit&type=revision&diff=552101&oldid=552099
- The original paragraph had a lot of extra text added to it. The original sentences of the paragraph are not recognized as having existed before, so <rudeword> is not highlighted.
- https://test.wikipedia.org/w/index.php?title=Earl_of_Tyrone&type=revision&diff=552457&oldid=552456
- Lots of extra sentences added to existing paragraph, which causes it to be seen as a remove of a paragraph and an adding of a new paragraph. The two paragraphs are not vertically aligned in the diff output.
Moving sentences within paragraphs
[edit]- https://test.wikipedia.org/w/index.php?title=Hazaragi_cuisine&type=revision&diff=552098&oldid=552097
- Move a single sentence within a paragraph.
- https://test.wikipedia.org/w/index.php?title=Hazaragi_cuisine&type=revision&diff=552097&oldid=447936
- The first and second "halves" of the paragraph are swapped around, with one word replaced by <rudeword>.
- https://test.wikipedia.org/w/index.php?title=BetterDiffs1&type=revision&diff=552103&oldid=552102
- Two very similar sentences are moved. Notice that it looks as if only "The cat sat on the mat" was moved but it was in fact both sentences.
Moving sentences between paragraphs
[edit]- https://test.wikipedia.org/w/index.php?title=Lucre%C8%9Biu_P%C4%83tr%C4%83%C8%99canu&type=revision&diff=551700&oldid=532796
- The last sentence in a paragraph is moved to the first sentence of the next paragraph.
Splitting words
[edit]- https://test.wikipedia.org/w/index.php?title=Kirchliche_Arbeit_Alpirsbach&type=revision&diff=552507&oldid=513390
- Split a URL at
/
. Will the new wikidiff2 algorithm correctly put the URL back together when it tries to detect the paragraph split?
- Split a URL at
- https://test.wikipedia.org/w/index.php?title=Kirchliche_Arbeit_Alpirsbach&type=revision&diff=552510&oldid=513390
- Same as previous item, but with
<rudeword>
added.
- Same as previous item, but with
- https://test.wikipedia.org/w/index.php?title=Israeli_Nano_Satellite_Association&diff=556412&oldid=513797
- Compare the source and visual diff.
- https://test.wikipedia.org/w/index.php?title=Israeli_Nano_Satellite_Association&diff=556413&oldid=556412&diffmode=source
- https://test.wikipedia.org/w/index.php?title=Israeli_Nano_Satellite_Association&diff=556414&oldid=556413
- The strikethrough used by the visual diff is easy to confuse with a dash character. The inline diff squashes spaces (phab:T327642).
- https://test.wikipedia.org/w/index.php?title=Israeli_Nano_Satellite_Association&diff=556415&oldid=556414
Modifying words
[edit]- https://test.wikipedia.org/w/index.php?title=Yên_Bái_Province&type=revision&diff=552374&oldid=548024
- Doing a small modification to multiple words next to one another.
- https://test.wikipedia.org/w/index.php?title=Yên_Bái_Province&type=revision&diff=552375&oldid=552374
- Replacing some words, modifying others. Compare how they look in the Visual diff vs. wikitext diff. See how in the visual diff the unchanged
,
splits the diff into two.
- Replacing some words, modifying others. Compare how they look in the Visual diff vs. wikitext diff. See how in the visual diff the unchanged
- https://test.wikipedia.org/w/index.php?title=Yên_Bái_Province&type=revision&diff=552377&oldid=552376
- Modifying some words next to one another. Notice in visual diff that the unchanged
means
splits it into two diffs.
- Modifying some words next to one another. Notice in visual diff that the unchanged
- https://test.wikipedia.org/w/index.php?title=Yên_Bái_Province&type=revision&diff=552376&oldid=552375
- Replacing some words, modifying others. The diff appears as one block, rather than being split.
Replacing newlines
[edit]- https://en.wikipedia.org/w/index.php?title=Aristide_Briand&diff=next&oldid=862233861
- Newline replaced by list item.
- https://test.wikipedia.org/w/index.php?title=Earl_of_Tyrone&type=revision&diff=552456&oldid=552444
- Newline replaced by word. The visual diff is confusing.
Characters
[edit]- https://test.wikipedia.org/w/index.php?title=Test&diff=prev&oldid=476730
- Two words combined into one, but with
{{
. The visual diff is confusing. I think the diff treats punctuation differently.
- Two words combined into one, but with
- https://test.wikipedia.org/w/index.php?title=Multipage_poem_test/a&type=revision&diff=552501&oldid=183798
- More examples of adding punctuation characters and how diff treats them. The visual diff is confusing.
- https://test.wikipedia.org/w/index.php?title=Wikip%C3%A9dia:Administradores/Pedidos_de_aprova%C3%A7%C3%A3o/!SilentTest/17&type=revision&diff=552502&oldid=326572
- The individually added
;
is highlighted on its own.
- The individually added
- https://test.wikipedia.org/w/index.php?title=Wikipédia:Administradores/Pedidos_de_aprovação/!SilentTest/17&type=revision&diff=552503&oldid=552502
- Removing characters surrounded by parentheses.
- https://test.wikipedia.org/w/index.php?title=Reparatus&type=revision&diff=552183&oldid=513078
- It treats spaces as separate tokens (e.g. phab:T326773) leading to a confusing inline diff (compared to visual diff).
Style/Text Formatting
[edit]- https://test.wikipedia.org/w/index.php?title=Im_in_ur_page&type=revision&diff=552552&oldid=75550
- Bold: Does not highlight the whole word that is bold
- https://test.wikipedia.org/w/index.php?title=Nicaea_of_Macedon&type=revision&diff=552553&oldid=459838
- Bold: When you bold specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=Arboretum_de_la_Pipe_Qui_Fume&type=revision&diff=552554&oldid=485827
- Italic: Does not highlight the whole word that is italic
- https://test.wikipedia.org/w/index.php?title=Hakon_Sunnivasson&type=revision&diff=552555&oldid=515170
- Italic: When you italic specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=Task_Force_76&type=revision&diff=552556&oldid=541391
- Bold & Italic: Does not highlight the whole word that is bold and italic
- https://test.wikipedia.org/w/index.php?title=Test_3&type=revision&diff=552558&oldid=473609
- Bold & Italic: When you bold and italic specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=OshwahSpeedyTest&type=revision&diff=552562&oldid=552561
- Strikethrough: Does not highlight the whole word that is strikethrough
- https://test.wikipedia.org/w/index.php?title=Test_cookie_problem11&type=revision&diff=552563&oldid=395728
- Strikethrough: When you strikethrough specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=St_Martin%27s_House&type=revision&diff=552550&oldid=511874
- Small Font: Does not highlight the whole word that is small font
- https://test.wikipedia.org/w/index.php?title=Favilla_Guitars&type=revision&diff=552551&oldid=540971
- Small Font: When you small font specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=Comparison_of_the_healthcare_systems_in_Canada_and_the_United_States&type=revision&diff=552773&oldid=546463
- Big Font: Does not highlight the whole word that is big font
- https://test.wikipedia.org/w/index.php?title=Herkogamy&type=revision&diff=552547&oldid=485736
- Big Font: When you big font specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=Navigable_TOC&diff=552910&oldid=397298
- Superscript: Does not highlight the whole word that is superscript
- https://test.wikipedia.org/w/index.php?title=LegIt_test_page&diff=552911&oldid=384426
- Superscript: When you superscript specific letters from a word, it highlights the whole word besides the change
- https://test.wikipedia.org/w/index.php?title=Page383&diff=552912&oldid=372798
- Subscript: Does not highlight the whole word that is subscript
- https://test.wikipedia.org/w/index.php?title=Krishnalal_Basak&diff=552913&oldid=489245
- Subscript: When you subscript specific letters from a word, it highlights the whole word besides the change
Replacing words
[edit]- https://test.wikipedia.org/w/index.php?title=Reinier_de_Graaf_%28architect%29&diff=556417&oldid=552005
- The inline diff shows two individual words being replaced followed by a whole phrase. Compare this to visual diff where the whole phrase is shown as replaced.
- https://en.wikipedia.org/w/index.php?title=Christmas_Present_(Andy_Williams_album)&diff=next&oldid=403692546
More complicated/uncategorised examples
[edit]- https://en.wikipedia.org/w/index.php?title=Calais_Conference_(December_1915)&diff=prev&oldid=1118143220
- https://en.wikipedia.org/w/index.php?title=Calais_Conference_(December_1915)&diff=prev&oldid=1118137858
- https://en.wikipedia.org/w/index.php?title=Calais_Conference_(December_1915)&diff=prev&oldid=1118136368
- https://test.wikipedia.org/w/index.php?title=Template:No_article_text&diff=prev&oldid=455804
- This is a really confusing change, esp. the move on line 7.