Abstract Wikipedia/Ideas

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search
Other languages:
English

Structured Comments, Attributes and Decorators[edit]

Structured comments, attributes and decorators could be utilized by Wikilambda to provide features such as: function metadata, facilitating searching for functions, and the automatic generation of documentation.

Structured comments are programming language comments which utilize syntactic patterns or XML so that the contents of the comments can be mechanically processed.

Attributes can be attached to functions and their parameters in programming languages such as C# and Java.

Decorators are a proposed JavaScript language feature.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Versioning[edit]

Utilizing structured comments, attributes, or decorators, versioning-related metadata can simplify versioning scenarios on evolving crowdsourced resources.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Namespaces and Modules[edit]

Namespaces and modules can be useful when organizing large collections of functions. With namespaces or modules, multiple paradigms or ecosystems of functions could more readily coexist in a crowdsourced resource.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Scripting Environments for Natural Language Generation[edit]

With modern scripting engines such as V8, it is relatively easy to create and provide scripting environments.

Resembling how Web browsers provide scripting environments and API for Web scenarios, we can envision providing scripting environments and API for natural language generation scenarios.

Discussion topics pertaining to scripting environments for renderers include: (1) API and object models for accessing and working with input Wikidata data, (2) API and object models for accessing and working with the rendering context, (3) API and object models for accessing and working with intermediate knowledge representations, (4) API and object models for generating output natural language content.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Provenance[edit]

As Wikidata is a sourced knowledgebase, API and object models should include means for annotating any intermediate representations and portions of natural language with sources. In automatically-generated articles, statements’ sources could appear as referenced materials in articles’ “References” sections with numbered citations appearing inline, near relevant content.

Should Wikidata come to include support for automated reasoning, any reasoning, argumentation, derivations and/or proofs supporting statements could similarly appear in articles’ “References” sections with numbered citations appearing inline, near relevant content. Readers could click on hyperlinks to navigate to automatically-generated documents which indicate supporting reasoning, argumentation, derivations and/or proofs for one or more statements.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Output Streams, Logging and Diagnostic Events[edit]

When editing/developing Wikilambda content for use on Abstract Wikipedia, it would be convenient to be able to output to multiple streams, to log, and/or to raise typed events. Such features are part of the scripting environment provided to functions.

It would also be useful to be able to aggregate, organize and view diagnostic outputs with a configurable granularity or verbosity.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Editor/Developer Experiences[edit]

Editors/developers could have a means of toggling a “developer mode” or “debugging mode” on Abstract Wikipedia so that they could, while viewing articles, either: (1) hover over portions of natural language to view relevant traces of computation and diagnostic messages in hoverboxes, (2) view visual indicators for traces of computation and diagnostic messages in a margin so that they could then interact with the visual indicators to view expanded data, or (3) otherwise select or indicate portions of natural language content to view relevant traces of computation and diagnostic messages.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Reader Experiences[edit]

We can consider adding feedback mechanisms for Abstract Wikipedia readers such as commenting upon, liking, upvoting, or otherwise providing feedback with respect to specific portions of automatically-generated natural language content.

Also possible is that readers could “post-edit” automatically-generated content. For automatically-generated articles, there could be wiki versions of the articles for purposes of crowdsourcing the fine-tuning of the articles. These “wiki post-edited” versions of automatically-generated articles could be navigated to via tab user-interface elements. Data from this variety of crowdsourced feedback on automatically-generated articles, “wiki post-editing”, could be collected and aggregated for use by Wikilambda editors/developers.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

The Automatic Evaluation of Natural Language[edit]

Software tools in the categories of automatic essay scoring, grammar checking, readability measurement, and/or natural language evaluation could be of use for automatically measuring articles in a number of ways. Coh-Metrix 3.0, for instance, measures natural language on 108 indices.

Perhaps bots could measure articles as they are updated and report their data to editors/developers using a platform API.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Generating Articles in Response to Users’ Questions[edit]

Resembling question-answering systems, articles could be generated for Wikidata queries or after users navigate to articles from Web searches. Beyond highlighting relevant content, articles could be generated while utilizing this context data.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Generating Follow-up Questions for Use in Articles[edit]

Resembling hypertext-based dialogue systems, one could place follow-up questions which might interest a reader in a section near the bottom of articles, each being a hyperlink to another article. One could hyperlink to articles which could be dynamically generated, if they are not already created and cached.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)

Speech Synthesis and Hypertext[edit]

There exists a CSS Speech Module W3C Candidate Recommendation.

With respect to pronunciation, one can utilize pronunciation lexicons with hypertext documents. Also, resembling EPUB3, one can utilize SSML-based attributes on generated hypertext outputs to provide pronunciation data.

AdamSobieski (talk) 22:15, 15 July 2020 (UTC)