This is an archive of past discussions with User:GeneralNotability. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page.
Wikidata Weekly Summary #647
<languages/>
<translate> Here's your quick overview of what has been happening around Wikidata in the week leading up to 2024-09-30. Please help Translate. Missed the previous one? See issue #646</translate>
<translate>Discussions</translate>
<translate>* Closed request for adminship: Andrei Stroe - Success! Welcome User:Andrei Stroe as Wikidata's latest Admin.
New requests for permissions/Bot: QichwaBot - Task(s): Creating wikidata lexemes for the Quechua languages.
<translate>Events</translate>
<translate>* Wikidata's 12th birthday is coming up on October 29th. Have a look at the birthday parties and more planned around the world.
Next Linked Data for Libraries LD4 Wikidata Affinity Group session 1 October, 2024: We have our next LD4 Wikidata Affinity Group Session on Tuesday, 1 October, 2024 at 9am PT / 12pm ET / 17:00 UTC / 6pm CET (Time zone converter). Christa Strickler will be our first Project Series lead with her joint project with the Wikidata Religion & Theology Community of Practice to contribute biographical data to Wikidata from the IRFA database using the Mix’n’Match tool. We are excited to learn more about this project, provide a forum for discussion and shared learning, and lend a hand while building new skills. Event page.</translate>
<translate>Press, articles, blog posts, videos</translate>
<translate>* Papers
A Systematic Review of Wikidata in GLAM Institutions: a Labs Approach - Presents a systematic review of Wikidata use in GLAM institutions within the context of the work of the International GLAM Labs Community (glamlabs.io). The results summarise academic literature on Wikidata projects. By G. Candela et al.
Using Wikidata for Managing Cultural Heritage Information - The present study uses model wikidata elements as a basis and explores its dynamic formation into a cultural heritage information management tool within a museum. By D. Kyriaki-Manessi and S. Vazaiou.
<translate>Tool of the week</translate>
<translate>* Three new Userscripts for Wikidata - User:Lagewi has written 3 scripts to simplify reading references, explore property-value pairs in use for a statement or attaching a full bibliography to the end of the item page.</translate>
<translate>Other Noteworthy Stuff</translate>
<translate>* OpenSactions:Wikidata Persons in Relevant Categories - Using PETScan, generates a list of profiles of politically exposed persons by querying specific categories on Wikidata and extracting the entities.</translate>
objects of occurrence have role (role that objects of this occurrence take on in the context of this occurrence. (For selectional restrictions, use "object class of occurrence" (P12913) instead.))
agents of action have role (role that agents of this action take on in the context of this action. (For selectional restrictions, use "agent class of action" (P12994) instead. ))
agent class of action (class of items that may initiate this action or class of actions (For roles filled by agents of an action, use "agents of action have role" (P12993) instead))
agent of action (particular item that initiates this action or class of actions)
<translate>* New property proposals to review:</translate>
<translate>** General datatypes: </translate>
Larval host plant (Larval host plant - used only for insects - subclass of P1034)
has reading (phonetic reading or pronunciation of the kanji)
chemical formula (Description of chemical compound giving element symbols and counts)
mode of reproduction (ways for living organisms to propagate or produce their offsprings)
health points (health or armor points of this video game, board game or role-playing game character)
damage (damage value of this video game weapon, ability or character)
magazine capacity (In (real or fictional) devices like a firearm, weapon, or engineered thing, this is the default capacity or size of a devices' magazine, clip, or other container typically used to hold ammunition, bolts, cartridges, tools, etc. which pushes those items as needed usually through a spring-based mechanism into a receiver for further use by the device)
male mean age (male mean age in a given place; qualifier of {{P|4442}})
female mean age (female mean age in a given place; qualifier of {{P|4442}})
Western Australian Biographical Index (Card ID from the Western Australian Biographical Index, a set of handwritten index cards compiled in the 1970s.)
leased to (person or organisation that holds or was granted a lease on the subject)
WPBSA com player ID (Identifier for an athlete on the main website of WPBSA)
JLPT level (difficulty of word by the level of JLPT)
Search: The haswbstatement search magic word has been improved by the Search Platform Team. Previously it was limited in which Properties were indexed for it. Going forward haswbstatement:P123 will work for all Properties, regardless of their datatype. This will allow you to filter search results for Items that have a statement with a specific Property. (Searching for a specific complete statement with haswbstatement:P123=xxx will still only work for specific datatypes.) For this to work all Items have to be reindexed and this will take up to 1 month.
Design system migration: We have migrated the Special:NewLexeme page from Wikit to Codex and are working on finishing the migration for the Query Builder.
EntitySchemas: We finished the investigation about how to support search for EntitySchemas by label or alias when linking to an EntitySchema in a statement. (phab:T362005)
Wikibase REST API: We worked on integrating language fallbacks into the API (phab:T371605)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Updates for editors
Readers of 42 more wikis can now use Dark Mode. If the option is not yet available for logged-out users of your wiki, this is likely because many templates do not yet display well in Dark Mode. Please use the night-mode-checker tool if you are interested in helping to reduce the number of issues. The recommendations page provides guidance on this. Dark Mode is enabled on additional wikis once per month.
Editors using the 2010 wikitext editor as their default can access features from the 2017 wikitext editor by adding ?veaction=editsource to the URL. If you would like to enable the 2017 wikitext editor as your default, it can be set in your preferences. [1]
For logged-out readers using the Vector 2022 skin, the "donate" link has been moved from a collapsible menu next to the content area into a more prominent top menu, next to "Create an account". This restores the link to the level of prominence it had in the Vector 2010 skin. Learn more about the changes related to donor experiences. [2]
The CampaignEvents extension provides tools for organizers to more easily manage events, communicate with participants, and promote their events on the wikis. The extension has been enabled on Arabic Wikipedia, Igbo Wikipedia, Swahili Wikipedia, and Meta-Wiki. Chinese Wikipedia has decided to enable the extension, and discussions on the extension are in progress on Spanish Wikipedia and on Wikidata. To learn how to enable the extension on your wiki, you can visit the CampaignEvents page on Meta-Wiki.
Developers with an account on Wikitech-wiki should check if any action is required for their accounts. The wiki is being changed to use the single-user-login (SUL) system, and other configuration changes. This change will help reduce the overall complexity for the weekly software updates across all our wikis.
In depth
The server switch was completed successfully last week with a read-only time of only 2 minutes 46 seconds. This periodic process makes sure that engineers can switch data centers and keep all of the wikis available for readers, even if there are major technical issues. It also gives engineers a chance to do maintenance and upgrades on systems that normally run 24 hours a day, and often helps to reveal weaknesses in the infrastructure. The process involves dozens of software services and hundreds of hardware servers, and requires multiple teams working together. Work over the past few years has reduced the time from 17 minutes down to 2–3 minutes. [3]
Following a discussion, the speedy deletion reason "File pages without a corresponding file" has been moved from criterion G8 to F2. This does not change what can be speedily deleted.
<translate> Here's your quick overview of what has been happening around Wikidata in the week leading up to 2024-10-07. Please help Translate. Missed the previous one? See issue #647</translate>
Wikidata Day 2024 at the Pratt Institute Manhattan Campus, New York - To celebrate Wikidata's 12th Birthday, a mini-conference with beginner workshops, lightning talks and keynote speeches will be held. October 26, 11am - 5pm EDT (UTC-4). More info, registration and full address on this Wikipedia event page.
The Wikidata Days 2024 in Bologna, Italy will take place on November 8th and 9th. Its program revolves around Wikidata for libraries and academia, and features a wide range of Wikidata-enthusiastic librarians and researchers from Italy. Registration is open until October 31st.
The next Wikidata+Wikibase office hours will take place on Wednesday, 16th October 2024 at 18:00 CEST in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Scholia hackathon on Oct 18-20, aimed at addressing changes related to the Wikidata graph split
Intangible Cultural Heritage on Wikidata - Hosted by Wikimedia Community Malta (WCM), November 8, 2024 18:00 - 19:00 CEST, Malta Fairs and Conference Centre (MFCC) in Ta’ Qali, Malta
<translate>Press, articles, blog posts, videos</translate>
<translate>* Blogs
Wikidata is a giant crosswalk file dbreunig.com describes how with a little DuckDB and Ruby and data from Wikidata, you can produce a cross-walk file of geographic entities.
(fr) wikidata MultiSearch - search for a list of elements in Wikidata. A GPLv3 licenced tool built by Philippe Gambette allows you to search for a list of words in Wikidata and retrieve some associated Wikidata properties.
<translate>Other Noteworthy Stuff</translate>
Are you building applications or services with Wikidata's data? We'd love to hear from you to help us figure out the future of accessing Wikidata's data.
Wikidata: Event Organizers - If you are organizing or thinking about planning a Wikidata event, this new page listing the additional User rights the user-role 'event organizer' has will be a valuable resource. Including the process for applying for permission rights.
objects of occurrence have role (role that objects of this occurrence take on in the context of this occurrence. (For selectional restrictions, use "object class of occurrence" (P12913) instead.))
agents of action have role (role that agents of this action take on in the context of this action. (For selectional restrictions, use "agent class of action" (P12994) instead. ))
agent class of action (class of items that may initiate this action or class of actions (For roles filled by agents of an action, use "agents of action have role" (P12993) instead))
agent of action (particular item that initiates this action or class of actions)
characteristic of ((qualifier only) statement value is a characteristic, quality, property, or state of this qualifier value)
Lingnan University Library: Wikidata Pilot Project - Creating and improving entries for Lingnan University academic staff, as well as generating entries for the Library's digital collections and Lingnan theses and dissertations.
French Literary Prizes - Aims to coordinate the development of a database on French literary prizes (list of prizes, jury members, list of winners)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Weekly highlight
Communities can now request installation of Automoderator on their wiki. Automoderator is an automated anti-vandalism tool that reverts bad edits based on scores from the new "Revert Risk" machine learning model. You can read details about the necessary steps for installation and configuration. [4]
Updates for editors
Translators in wikis where the mobile experience of Content Translation is available, can now customize their articles suggestion list from 41 filtering options when using the tool. This topic-based article suggestion feature makes it easy for translators to self-discover relevant articles based on their area of interest and translate them. You can try it with your mobile device. [5]
It is now possible for <syntaxhighlight> code blocks to offer readers a "Copy" button if the copy=1 attribute is set on the tag. Thanks to SD0001 for these improvements. [6]
Customized copyright footer messages on all wikis will be updated. The new versions will use wikitext markup instead of requiring editing raw HTML. [7]
Later this month, temporary accounts will be rolled out on several pilot wikis. The final list of the wikis will be published in the second half of the month. If you maintain any tools, bots, or gadgets on these 11 wikis, and your software is using data about IP addresses or is available for logged-out users, please check if it needs to be updated to work with temporary accounts. Guidance on how to update the code is available.
Rate limiting has been enabled for the code review tools Gerrit and GitLab to address ongoing issues caused by malicious traffic and scraping. Clients that open too many concurrent connections will be restricted for a few minutes. This rate limiting is managed through nftables firewall rules. For more details, see Wikitech's pages on Firewall, GitLab limits and Gerrit operations.
While these candidates have been ranked through the vote, they still need to be appointed to the Board of Trustees. They need to pass a successful background check and meet the qualifications outlined in the Bylaws. New trustees will be appointed at the next Board meeting in December 2024.
Next Linked Data for Libraries LD4 Wikidata Affinity Group session 15 October, 2024: We have our next LD4 Wikidata Affinity Group Session on Tuesday, 15 October, 2024 at 9am PT / 12pm ET / 16:00 UTC / 6pm CEST (Time zone converter). https://zonestamp.toolforge.org/1729008000 Christa Strickler will be our first Project Series lead with her joint project with the Wikidata Religion & Theology Community of Practice to contribute biographical data to Wikidata from the IRFA database https://irfa.paris/en/en-learn-about-a-missionary/ using the Mix’n’Match tool. We are excited to learn more about this project, provide a forum for discussion and shared learning, and lend a hand while building new skills. Event page: [13]
The next Wikidata+Wikibase office hours will take place on Wednesday, 16th October 2024 at 18:00 CEST in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Wikidata:Twelfth Birthday: We already have 30 events scheduled on the list 😍. As a reminder, when your event is ready, don't forget to:
create a wikipage with more information about the event, participants list, etc.
Small data, slow data − a SNAIL approach to Wikidata: discusses the value of small, carefully curated datasets in the era of big data. It emphasizes the importance of taking a methodical, "snail-paced" approach to data collection and analysis, which can lead to more meaningful and accurate insights. The blogpost also highlights how this approach can complement the broader trends of big data, ensuring that detailed, high-quality data is not overlooked.
Papers
"WoolNet: Finding and Visualising Paths in Knowledge Graphs" given two or more entities requested by a user, the system finds and visualises paths that connect these entities, forming a topical subgraph of Wikidata (Torres Gutiérrez and Hogan)
Dynamic Mapping using Collaborative Knowledge Graphs: Real-Time SKOS Mapping from Wikidata: This presentation introduces a workflow using SPARQL queries to dynamically map live Wikidata data to SKOS concepts, featuring a Python tool that converts CSV outputs into RDF triples for integration into linked data environments and knowledge graphs, emphasizing real-time data retrieval and interoperability.
Could making Wikidata 'human' readable lead to better AI?: Lydia Pintscher (WMDE), Portfolio Lead Product Manager at Wikidata Deutschland, discussed a new project aimed at making Wikidata more 'human' readable for Large Language Models (LLMs), which could improve AI reliability by giving these models access to high-quality, human-curated data from Wikidata.
Elemwala (এলেমওয়ালা) (https://elemwala.toolforge.org): is a proof-of-concept interface that allows you to input abstract content and get natural language text in a given output language. There may well be errors with particular inputs, and the text may not be quite as natural as you might expect, but that's where your improvements to your language's lexemes, other Wikidata items, and the tool's sourcecode come in!
mlscores: Tool for calculating multilinguality score of Wikidata items (including properties). E.g. for Wikidata (Q2013), the scores are - en: 99.66%, fr: 89.49%, es: 84.07%, pt: 68.47%. For instance of (P31), the scores are - en: 99.86%, fr: 87.12%, es: 80.83%, pt: 61.37%.
Other Noteworthy Stuff
Launch of WikiProject Deprecate P642: The goal of this project is to prepare for deprecation, and eventual removal, of the property of (P642). Currently, of (P642) is labeled as "being deprecated", meaning its use is still allowed, but discouraged. From a peak of around 900,000 uses, the property now has around 700,000 uses (see status here). Our goal is to reduce that as much as possible in a systematic way, while ensuring that appropriate properties exist to replace all valid uses of of (P642). The latter is key to officially deprecating the property. Before removing the property, we want to get as close to zero uses as possible.
bais (Indicates a specific form of bias present in a media source, organization, or document, such as false balance, slant, or omission, affecting the representation of information.)
TDK lexeme ID (Dictionary created by the [[Q1569712|Turkish Language Association]])
Atatürk Ansiklopedisi ID (Online Turkish encyclopedia created by [[Q6062914]] and [[Q19610584]])
Eurotopics ID (A database containing data on European media.)
Stated in unreliable source (used in the references field to refer to the database that is considered a unreliable source in which the claim is made)
PatternsKilkenny - Patterns were devotional days on the day of the patron saint of a parish or area or at least an annually occurring day when the people of the locality held their personal devotions in a certain pattern (hence the name), i.e. "doing the rounds" around trees or other landmarks at the sacred site. This project tries to collate the records and memories of these patterns for County Kilkenny.
Deprecate P642 - The goal of this project is to prepare for deprecation, and eventual removal, of the property of (P642).
AIDS Walks - This project aims to collaborate with Wiki editors across the globe to highlight AIDS Walks anywhere in the world.
Temples in Roman Britain - The aim of the Wikiproject Temples in Roman Britain is to record and catalog sacred spaces in the Roman province Britannia between 43 to 409 CE. By sacred spaces, we include (for the moment) only built structures such as temples, sanctuaries and shrines.
Nihongo - The goal of this project is to capture the Japanese Language Japanese (Q5287) in its entirety on Wikidata. We aim to give advice and establish standards for representing Japanese words as lexemes.
EntitySchemas: We are continuing the work on making it possible to find an EntitySchema by its label or aliases when linking to an EntitySchema in a statement (phab:T375641)
Design system: We are continuing the work on migrating the Query Builder from Wikit to Codex
REST API: We finished the work on language fallback support in the REST API (phab:T371605)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Updates for editors
The Structured Discussion extension (also known as Flow) is starting to be removed. This extension is unmaintained and causes issues. It will be replaced by DiscussionTools, which is used on any regular talk page. A first set of wikis are being contacted. These wikis are invited to stop using Flow, and to move all Flow boards to sub-pages, as archives. At these wikis, a script will move all Flow pages that aren't a sub-page to a sub-page automatically, starting on 22 October 2024. On 28 October 2024, all Flow boards at these wikis will be set in read-only mode. [14][15]
WMF's Search Platform team is working on making it easier for readers to perform text searches in their language. A change last week on over 30 languages makes it easier to find words with accents and other diacritics. This applies to both full-text search and to types of advanced search such as the hastemplate and incategory keywords. More technical details (including a few other minor search upgrades) are available. [16]
View all 20 community-submitted tasks that were resolved last week. For example, EditCheck was installed at Russian Wikipedia, and fixes were made for some missing user interface styles.
Updates for technical contributors
Editors who use the Toolforge tool Earwig's Copyright Violation Detector will now be required to log in with their Wikimedia account before running checks using the "search engine" option. This change is needed to help prevent external bots from misusing the system. Thanks to Chlod for these improvements. [17]
Some HTML elements in the interface are now wrapped with a <bdi> element, to make our HTML output more aligned with Web standards. More changes like this will be coming in future weeks. This change might break some tools that rely on the previous HTML structure of the interface. Note that relying on the HTML structure of the interface is not recommended and might break at any time. [19]
In depth
The latest monthly MediaWiki Product Insights newsletter is available. This edition includes: updates on Wikimedia's authentication system, research to simplify feature development in the MediaWiki platform, updates on Parser Unification and MathML rollout, and more.
The latest quarterly Technical Community Newsletter is now available. This edition include: research about improving topic suggestions related to countries, improvements to PHPUnit tests, and more.