User talk:PKM

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Archive


Contents

Facto Post – Issue 2 – 13 July 2017[edit]

Facto Post – Issue 2 – 13 July 2017
Content mine logo.png

Editorial: Core models and topics[edit]

Wikimedians interest themselves in everything under the sun — and then some. Discussion on "core topics" may, oddly, be a fringe activity, and was popular here a decade ago.

The situation on Wikidata today does resemble the halcyon days of 2006 of the English Wikipedia. The growth is there, and the reliability and stylistic issues are not yet pressing in on the project. Its Berlin conference at the end of October will have five years of achievement to celebrate. Think Wikimania Frankfurt 2005.

Progress must be made, however, on referencing "core facts". This has two parts: replacing "imported from Wikipedia" in referencing by external authorities; and picking out statements, such as dates and family relationships, that must not only be reliable but be seen to be reliable.

In addition, there are many properties on Wikidata lacking a clear data model. An emerging consensus may push to the front key sourcing and biomedical properties as requiring urgent attention. Wikidata's "manual of style" is currently distributed over thousands of discussions. To make it coalesce, work on such a core is needed.

Links[edit]


Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

Thank you and post-event survey for Met Open Access Artworks Challenge![edit]

I award you with this barnstar of William the Faience Hippopotamus in recognition of your good work at the Met Open Access Artworks Challenge!

We'd like to invite your participation in the post-event survey for the Met Open Access Artworks Challenge.--Pharos (talk) 20:24, 17 July 2017 (UTC)

@Pharos: Thank you! - PKM (talk) 20:33, 17 July 2017 (UTC)

Facto Post – Issue 3 – 11 August 2017[edit]

Facto Post – Issue 3 – 11 August 2017
Content mine logo.png

Wikimania report[edit]

Interviewed by Facto Post at the hackathon, Lydia Pintscher of Wikidata said that the most significant recent development is that Wikidata now accounts for one third of Wikimedia edits. And the essential growth of human editing.

Internet-In-A-Box

Impressive development work on Internet-in-a-Box featured in the WikiMedFoundation annual conference on Thursday. Hardware is Raspberry Pi, running Linux and the Kiwix browser. It can operate as a wifi hotspot and support a local intranet in parts of the world lacking phone signal. The medical use case is for those delivering care, who have smartphones but have to function in clinics in just such areas with few reference resources. Wikipedia medical content can be served to their phones, and power supplied by standard lithium battery packages.

Yesterday Katherine Maher unveiled the draft Wikimedia 2030 strategy, featuring a picturesque metaphor, "roads, bridges and villages". Here "bridges" could do with illustration. Perhaps it stands for engineering round or over the obstacles to progress down the obvious highways. Internet-in-a-Box would then do fine as an example.

"Bridging the gap" explains a take on that same metaphor, with its human component. If you are at Wikimania, come talk to WikiFactMine at its stall in the Community Village, just by the 3D-printed display for Bassel Khartabil; come hear T Arrow talk at 3 pm today in Drummond West, Level 3.

Link[edit]

  • Plaudit for the Medical Wikipedia app, content that is loaded into Internet-In-A-Box with other material, such as per-country documentation.
Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 10:55, 12 August 2017 (UTC)

Books and Bytes - Issue 23[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 23, June-July 2017

  • Library card
  • User Group update
  • Global branches update
  • Spotlight: Combating misinformation, fake news, and censorship
  • Bytes in brief

Chinese, Arabic and Yoruba versions of Books & Bytes are now available in meta!

Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 02:04, 23 August 2017 (UTC)

LA event this Thursday[edit]

LA Meetup: September 7 edit-a-thon near DTLA
Eugene W. Britt House, West Adams, Los Angeles.JPG

Dear fellow Wikipedian,

You have been invited to a meetup and edit-a-thon at the LA84 Foundation in Jefferson Park (near DTLA) on Thursday, September 7, 2017 from 5:45 p.m. to 9:00 p.m.! This event aims to improve coverage of female Olympians and Paralympians (some of whom will be attending!). There will be a deejay and food/drinks, and kids are welcome.

I hope to see you there! Calliopejen1 (talk) - via MediaWiki message delivery (talk) 14:10, 2 September 2017 (UTC)

Join our Facebook group, follow our Twitter account, and like our Facebook page!! To opt out of future mailings about LA meetups, please remove your name from this list.

Facto Post – Issue 4 – 18 September 2017[edit]

Facto Post – Issue 4 – 18 September 2017
Content mine logo.png

Editorial: Conservation data[edit]

The IUCN Red List update of 14 September led with a threat to North American ash trees. The International Union for Conservation of Nature produces authoritative species listings that are peer-reviewed. Examples used as metonyms for loss of species and biodiversity, and discussion of extinction rates, are the usual topics covered in the media to inform us about this area. But actual data matters.

Dorstenia elata, a critically endangered South American herb, contained in Moraceae, the family of figs and mulberries

Clearly, conservation work depends on decisions about what should be done, and where. While animals, particularly mammals, are photogenic, species numbers run into millions. Plant species lie at the base of typical land-based food chains, and vegetation is key to the habitats of most animals.

ContentMine dictionaries, for example as tabulated at d:Wikidata:WikiFactMine/Dictionary list, enable detailed control of queries about endangered species, in their taxonomic context. To target conservation measures properly, species listings running into the thousands are not what is needed: range maps showing current distribution are. Between the will to act, and effective steps taken, the services of data handling are required. There is now no reason at all why Wikidata should not take up the burden.

Links[edit]

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 14:46, 18 September 2017 (UTC)

Starting the Met Weekly Challenge: The Horse Fair + help pick future collaborations[edit]

Rosa Bonheur's The Horse Fair has been chosen as the first-ever Met Weekly Challenge, starting today, for the coming week. There are a lot of resources at the artwork's Met Collection record (see 'Catalogue Entry'), and also lists of other references. We could also benefit from the French Wikipedia version and Commons:Category:The Horse Fair — see the section for Week 1: The Horse Fair (Sept 25 - Oct 1) for more on how to participate, or just get started editing!

I also very much encourage folks to share and vote for your ideas on future collaborations!--Pharos (talk) 18:21, 25 September 2017 (UTC)

Met Weekly Challenge, Week 2: The Princesse de Broglie (Oct 1 - Oct 8): + help pick future collaborations[edit]

Ingres' The Princesse de Broglie has been chosen as the Week 2 focus of the Met Weekly Challenge. There are a lot of resources at the artwork's Met Collection record (see 'Catalogue Entry'), and also lists of other references. See the section for Week 2: The Princesse de Broglie (Oct 1 - Oct 8) for more on how to participate, or just get started editing!

I also very much encourage folks to share and vote for your ideas on future collaborations!--Pharos (talk) 20:08, 3 October 2017 (UTC)

Met Weekly Challenge, Week 3: Portrait of Juan de Pareja (Oct 9 - Oct 15): + help pick future collaborations[edit]

Velázquez's Portrait of Juan de Pareja has been chosen as the Metropolitan Museum of Art Weekly Challenge for the coming week. There are a lot of resources at the artwork's Met Collection record (see 'Catalogue Entry'), and also lists of other references. See the section for Week 3: Portrait of Juan de Pareja (Oct 9 - Oct 15) for more on how to participate, or just get started editing!

Simultaneously, we are also encouraging collaboration on the biographical article for Juan de Pareja, the artist depicted.

I also very much encourage folks to share and vote for your ideas on future collaborations!--Pharos (talk) 20:37, 10 October 2017 (UTC)

Facto Post – Issue 5 – 17 October 2017[edit]

Facto Post – Issue 5 – 17 October 2017
Content mine logo.png

Editorial: Annotations[edit]

Annotation is nothing new. The glossators of medieval Europe annotated between the lines, or in the margins of legal manuscripts of texts going back to Roman times, and created a new discipline. In the form of web annotation, the idea is back, with texts being marked up inline, or with a stand-off system. Where could it lead?

1495 print version of the Digesta of Justinian, with the annotations of the glossator Accursius from the 13th century

ContentMine operates in the field of text and data mining (TDM), where annotation, simply put, can add value to mined text. It now sees annotation as a possible advance in semi-automation, the use of human judgement assisted by bot editing, which now plays a large part in Wikidata tools. While a human judgement call of yes/no, on the addition of a statement to Wikidata, is usually taken as decisive, it need not be. The human assent may be passed into an annotation system, and stored: this idea is standard on Wikisource, for example, where text is considered "validated" only when two different accounts have stated that the proof-reading is correct. A typical application would be to require more than one person to agree that what is said in the reference translates correctly into the formal Wikidata statement. Rejections are also potentially useful to record, for machine learning.

As a contribution to data integrity on Wikidata, annotation has much to offer. Some "hard cases" on importing data are much more difficult than average. There are for example biographical puzzles: whether person A in one context is really identical with person B, of the same name, in another context. In science, clinical medicine require special attention to sourcing (WP:MEDRS), and is challenging in terms of connecting findings with the methodology employed. Currently decisions in areas such as these, on Wikipedia and Wikidata, are often made ad hoc. In particular there may be no audit trail for those who want to check what is decided.

Annotations are subject to a World Wide Web Consortium standard, and behind the terminology constitute a simple JSON data structure. What WikiFactMine proposes to do with them is to implement the MEDRS guideline, as a formal algorithm, on bibliographical and methodological data. The structure will integrate with those inputs the human decisions on the interpretation of scientific papers that underlie claims on Wikidata. What is added to Wikidata will therefore be supported by a transparent and rigorous system that documents decisions.

An example of the possible future scope of annotation, for medical content, is in the first link below. That sort of detailed abstract of a publication can be a target for TDM, adds great value, and could be presented in machine-readable form. You are invited to discuss the detailed proposal on Wikidata, via its talk page.

Links[edit]

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 08:46, 17 October 2017 (UTC)

Books and Bytes - Issue 24[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 24, August-September 2017

  • User Group update
  • Global branches update
    • Star Coordinator Award - last quarter's star coordinator: User:Csisc
  • Wikimania Birds of a Feather session roundup
  • Spotlight: Wiki Loves Archives
  • Bytes in brief

Arabic, Kiswahili and Yoruba versions of Books & Bytes are now available in meta!

Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 04:53, 21 October 2017 (UTC)

Global online Wikipedia Asian Art Month (Nov 1-30) with WikiProject Metropolitan Museum of Art[edit]

Damascus Room MET DP240367.jpg

You are invited to join the global online Wikipedia Asian Art Month (sign up here!), running November 1-30. As part of Wikipedia:Wikipedia Asian Month, there will be Met postcards for all who participate, and art books as grand prizes. Let me know if you have questions, or need help finding a good topic and matching images from the Met CC0 collections release.--Pharos (talk) 16:17, 2 November 2017 (UTC)

Facto Post – Issue 6 – 15 November 2017[edit]

Facto Post – Issue 6 – 15 November 2017
Content mine logo.png

WikidataCon Berlin 28–9 October 2017[edit]

WikidataCon 2017 group photo

Under the heading rerum causas cognescere, the first ever Wikidata conference got under way in the Tagesspiegel building with two keynotes, One was on YAGO, about how a knowledge base conceived ten years ago if you assume automatic compilation from Wikipedia. The other was from manager Lydia Pintscher, on the "state of the data". Interesting rumours flourished: the mix'n'match tool and its 600+ datasets, mostly in digital humanities, to be taken off the hands of its author Magnus Manske by the WMF; a Wikibase incubator site is on its way. Announcements came in talks: structured data on Wikimedia Commons is scheduled to make substantive progress by 2019. The lexeme development on Wikidata is now not expected to make the Wiktionary sites redundant, but may facilitate automated compilation of dictionaries.

WD-FIST explained

And so it went, with five strands of talks and workshops, through to 11 pm on Saturday. Wikidata applies to GLAM work via metadata. It may be used in education, raises issues such as author disambiguation, and lends itself to different types of graphical display and reuse. Many millions of SPARQL queries are run on the site every day. Over the summer a large open science bibliography has come into existence there.

Wikidata's fifth birthday party on the Sunday brought matters to a close. See a dozen and more reports by other hands.

Links[edit]

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 10:02, 15 November 2017 (UTC)

ArbCom 2017 election voter message[edit]

Scale of justice 2.svgHello, PKM. Voting in the 2017 Arbitration Committee elections is now open until 23.59 on Sunday, 10 December. All users who registered an account before Saturday, 28 October 2017, made at least 150 mainspace edits before Wednesday, 1 November 2017 and are not currently blocked are eligible to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2017 election, please review the candidates and submit your choices on the voting page. MediaWiki message delivery (talk) 18:42, 3 December 2017 (UTC)

Facto Post – Issue 7 – 15 December 2017[edit]

Facto Post – Issue 7 – 15 December 2017
Content mine logo.png

A new bibliographical landscape[edit]

At the beginning of December, Wikidata items on individual scientific articles passed the 10 million mark. This figure contrasts with the state of play in early summer, when there were around half a million. In the big picture, Wikidata is now documenting the scientific literature at a rate that is about eight times as fast as papers are published. As 2017 ends, progress is quite evident.

Behind this achievement are a technical advance (fatameh), and bots that do the lifting. Much more than dry migration of metadata is potentially involved, however. If paper A cites paper B, both papers having an item, a link can be created on Wikidata, and the information presented to both human readers, and machines. This cross-linking is one of the most significant aspects of the scientific literature, and now a long-sought open version is rapidly being built up.

WikiCite wordmark.svg

The effort for the lifting of copyright restrictions on citation data of this kind has had real momentum behind it during 2017. WikiCite and the I4OC have been pushing hard, with the result that on CrossRef over 50% of the citation data is open. Now the holdout publishers are being lobbied to release rights on citations.

But all that is just the beginning. Topics of papers are identified, authors disambiguated, with significant progress on the use of the four million ORCID IDs for researchers, and proposals formulated to identify methodology in a machine-readable way. P4510 on Wikidata has been introduced so that methodology can sit comfortably on items about papers.

More is on the way. OABot applies the unpaywall principle to Wikipedia referencing. It has been proposed that Wikidata could assist WorldCat in compiling the global history of book translation. Watch this space.

And make promoting #1lib1ref one of your New Year's resolutions. Happy holidays, all!

November 2017 map of geolocated Wikidata items, made by Addshore

Links[edit]


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 14:54, 15 December 2017 (UTC)

Books and Bytes - Issue 25[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 25, October – November 2017

  • OAWiki & #1Lib1Ref
  • User Group update
  • Global branches update
  • Spotlight: Research libraries and Wikimedia
  • Bytes in brief

Arabic, Korean and French versions of Books & Bytes are now available in meta!

Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 18:57, 15 December 2017 (UTC)

Facto Post – Issue 8 – 15 January 2018[edit]

Facto Post – Issue 8 – 15 January 2018
Content mine logo.png

Metadata on the March[edit]

From the days of hard-copy liner notes on music albums, metadata have stood outside a piece or file, while adding to understanding of where it comes from, and some of what needs to be appreciated about its content. In the GLAM sector, the accumulation of accurate metadata for objects is key to the mission of an institution, and its presentation in cataloguing.

Today Wikipedia turns 17, with worlds still to conquer. Zooming out from the individual GLAM object to the ontology in which it is set, one such world becomes apparent: GLAMs use custom ontologies, and those introduce massive incompatibilities. From a recent article by sadads, we quote the observation that "vocabularies needed for many collections, topics and intellectual spaces defy the expectations of the larger professional communities." A job for the encyclopedist, certainly. But the data-minded Wikimedian has the advantages of Wikidata, starting with its multilingual data, and facility with aliases. The controlled vocabulary — sometimes referred to as a "thesaurus" as term of art — simplifies search: if a "spade" must be called that, rather than "shovel", it is easier to find all spade references. That control comes at a cost.

SVG pedestrian crosses road
Zebra crossing/crosswalk, Singapore

Case studies in that article show what can lie ahead. The schema crosswalk, in jargon, is a potential answer to the GLAM Babel of proliferating and expanding vocabularies. Even if you have no interest in Wikidata as such, simply vocabularies V and W, if both V and W are matched to Wikidata, then a "crosswalk" arises from term v in V to w in W, whenever v and w both match to the same item d in Wikidata.

For metadata mobility, match to Wikidata. It's apparently that simple: infrastructure requirements have turned out, so far, to be challenges that can be met.

Links[edit]


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 12:38, 15 January 2018 (UTC)

Books and Bytes - Issue 26[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 26, December – January 2018

  • #1Lib1Ref
  • User Group update
  • Global branches update
  • Spotlight: What can we glean from OCLC’s experience with library staff learning Wikipedia?
  • Bytes in brief

Arabic and French versions of Books & Bytes are now available in meta!
Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 17:36, 31 January 2018 (UTC)

Facto Post – Issue 9 – 5 February 2018[edit]

Facto Post – Issue 9 – 5 February 2018
Content mine logo.png

m:Grants:Project/ScienceSource is the new ContentMine proposal: please take a look.

Wikidata as Hub[edit]

One way of looking at Wikidata relates it to the semantic web concept, around for about as long as Wikipedia, and realised in dozens of distributed Web institutions. It sees Wikidata as supplying central, encyclopedic coverage of linked structured data, and looks ahead to greater support for "federated queries" that draw together information from all parts of the emerging network of websites.

Mandel zoom 07 satellite.jpg

Another perspective might be likened to a photographic negative of that one: Wikidata as an already-functioning Web hub. Over half of its properties are identifiers on other websites. These are Wikidata's "external links", to use Wikipedia terminology: one type for the DOI of a publication, another for the VIAF page of an author, with thousands more such. Wikidata links out to sites that are not nominally part of the semantic web, effectively drawing them into a larger system. The crosswalk possibilities of the systematic construction of these links was covered in Issue 8.

Wikipedia:External links speaks of them as kept "minimal, meritable, and directly relevant to the article." Here Wikidata finds more of a function. On viaf.org one can type a VIAF author identifier into the search box, and find the author page. The Wikidata Resolver tool, these days including Open Street Map, Scholia etc., allows this kind of lookup. The hub tool by maxlath takes a major step further, allowing both lookup and crosswalk to be encoded in a single URL.

Links[edit]


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 11:50, 5 February 2018 (UTC)

WikiProject X Newsletter • Issue 11[edit]

WikiProject X icon.svg
Newsletter • February 2018

Check out this month's issue of the WikiProject X newsletter, with plans to renew work with a followup grant proposal to support finalising the deployment of CollaborationKit!

-— Isarra 21:26, 14 February 2018 (UTC)

Wikipedia Day LA, March 31[edit]

Wikipedia Day LA 2018
Ace Hotel.jpg

Please join us from 10:00 am - 5:00 pm on Saturday, March 31st for Wikipedia Day LA 2018 at the Ace Hotel in downtown Los Angeles. There will be speakers, panel discussions, a presentation on Wikidata, flash sessions, and a discussion about the formation of an LA User Group. There could be dramatic readings of LA-related talk pages, and there will be truly excellent cake. Please RSVP on the event page if you're thinking of joining us.

We hope to see you there! JSFarman (talk) via MediaWiki message delivery (talk) 03:19, 9 March 2018 (UTC)

Join our Facebook group here, and follow us on Twitter .

To opt out of future mailings about LA meetups, please remove your name from this list.

Facto Post – Issue 10 – 12 March 2018[edit]

Facto Post – Issue 10 – 12 March 2018
Content mine logo.png

Milestone for mix'n'match[edit]

Around the time in February when Wikidata clicked past item Q50000000, another milestone was reached: the mix'n'match tool uploaded its 1000th dataset. Concisely defined by its author, Magnus Manske, it works "to match entries in external catalogs to Wikidata". The total number of entries is now well into eight figures, and more are constantly being added: a couple of new catalogs each day is normal.

Since the end of 2013, mix'n'match has gradually come to play a significant part in adding statements to Wikidata. Particularly in areas with the flavour of digital humanities, but datasets can of course be about practically anything. There is a catalog on skyscrapers, and two on spiders.

These days mix'n'match can be used in numerous modes, from the relaxed gamified click through a catalog looking for matches, with prompts, to the fantastically useful and often demanding search across all catalogs. I'll type that again: you can search 1000+ datasets from the simple box at the top right. The drop-down menu top left offers "creation candidates", Magnus's personal favourite. m:Mix'n'match/Manual for more.

For the Wikidatan, a key point is that these matches, however carried out, add statements to Wikidata if, and naturally only if, there is a Wikidata property associated with the catalog. For everyone, however, the hands-on experience of deciding of what is a good match is an education, in a scholarly area, biographical catalogs being particularly fraught. Underpinning recent rapid progress is an open infrastructure for scraping and uploading.

Congratulations to Magnus, our data Stakhanovite!

Links[edit]

3D printing

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 12:26, 12 March 2018 (UTC)

Facto Post – Issue 11 – 9 April 2018[edit]

Facto Post – Issue 11 – 9 April 2018
Content mine logo.png

The 100 Skins of the Onion[edit]

Open Citations Month, with its eminently guessable hashtag, is upon us. We should be utterly grateful that in the past 12 months, so much data on which papers cite which other papers has been made open, and that Wikidata is playing its part in hosting it as "cites" statements. At the time of writing, there are 15.3M Wikidata items that can do that.

Pulling back to look at open access papers in the large, though, there is is less reason for celebration. Access in theory does not yet equate to practical access. A recent LSE IMPACT blogpost puts that issue down to "heterogeneity". A useful euphemism to save us from thinking that the whole concept doesn't fall into the realm of the oxymoron.

Some home truths: aggregation is not content management, if it falls short on reusability. The PDF file format is wedded to how humans read documents, not how machines ingest them. The salami-slicer is our friend in the current downloading of open access papers, but for a better metaphor, think about skinning an onion, laboriously, 100 times with diminishing returns. There are of the order of 100 major publisher sites hosting open access papers, and the predominant offer there is still a PDF.

Red onion cross section

From the discoverability angle, Wikidata's bibliographic resources combined with the SPARQL query are superior in principle, by far, to existing keyword searches run over papers. Open access content should be managed into consistent HTML, something that is currently strenuous. The good news, such as it is, would be that much of it is already in XML. The organisational problem of removing further skins from the onion, with sensible prioritisation, is certainly not insuperable. The CORE group (the bloggers in the LSE posting) has some answers, but actually not all that is needed for the text and data mining purposes they highlight. The long tail, or in other words the onion heart when it has become fiddly beyond patience to skin, does call for a pis aller. But the real knack is to do more between the XML and the heart.

Links[edit]


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 16:25, 9 April 2018 (UTC)

Books & Bytes - Issue 27[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 27, February – March 2018

  • #1Lib1Ref
  • New collections
    • Alexander Street (expansion)
    • Cambridge University Press (expansion)
  • User Group
  • Global branches update
    • Wiki Indaba Wikipedia + Library Discussions
  • Spotlight: Using librarianship to create a more equitable internet: LGBTQ+ advocacy as a wiki-librarian
  • Bytes in brief

Arabic, Chinese and French versions of Books & Bytes are now available in meta!
Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 14:50, 18 April 2018 (UTC)

Arts Datathon![edit]

LA County Civic Arts Datathon!
Please join us for the LA County Arts Commission Civic Art Wikipedia Edit-a-thon. Beginners are welcome! We'll provide training for new editors.
(See the meetup page for more details.)
Friday, April 27, 2018, 9:00-5:00
Bob Hope Patriotic Hall, 1816 S. Figueroa St., Los Angeles, CA 90015.
We hope to see you there! MediaWiki message delivery (talk) 04:09, 25 April 2018 (UTC)
Join our Facebook group here, and follow us on Twitter .
To opt out of future mailings about LA meetups, please remove your name from this list.

Facto Post – Issue 12 – 28 May 2018[edit]

Facto Post – Issue 12 – 28 May 2018
Content mine logo.png

ScienceSource funded[edit]

The Wikimedia Foundation announced full funding of the ScienceSource grant proposal from ContentMine on May 18. See the ScienceSource Twitter announcement and 60 second video.

A medical canon?

The proposal includes downloading 30,000 open access papers, aiming (roughly speaking) to create a baseline for medical referencing on Wikipedia. It leaves open the question of how these are to be chosen.

The basic criteria of WP:MEDRS include a concentration on secondary literature. Attention has to be given to the long tail of diseases that receive less current research. The MEDRS guideline supposes that edge cases will have to be handled, and the premature exclusion of publications that would be in those marginal positions would reduce the value of the collection. Prophylaxis misses the point that gate-keeping will be done by an algorithm.

Two well-known but rather different areas where such considerations apply are tropical diseases and alternative medicine. There are also a number of potential downloading troubles, and these were mentioned in Issue 11. There is likely to be a gap, even with the guideline, between conditions taken to be necessary but not sufficient, and conditions sufficient but not necessary, for candidate papers to be included. With around 10,000 recognised medical conditions in standard lists, being comprehensive is demanding. With all of these aspects of the task, ScienceSource will seek community help.

Links[edit]

OpenRefine logo, courtesy of Google

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM. ScienceSource pages will be announced there, and in this mass message.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 10:16, 28 May 2018 (UTC)

Books & Bytes – Issue 28[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 28, April – May 2018

  • #1Bib1Ref
  • New partners
  • User Group update
  • Global branches update
    • Wikipedia Library global coordinators' meeting
  • Spotlight: What are the ten most cited sources on Wikipedia? Let's ask the data
  • Bytes in brief

Arabic, Chinese, Hindi, Italian and French versions of Books & Bytes are now available in meta!
Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 19:33, 20 June 2018 (UTC)

Facto Post – Issue 13 – 29 May 2018[edit]

Facto Post – Issue 13 – 29 May 2018
Logo for ContentMine ScienceSource.svg

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.
Back numbers are here.

Respecting MEDRS

Facto Post enters its second year, with a Cambridge Blue (OK, Aquamarine) background, a new logo, but no Cambridge blues. On-topic for the ScienceSource project is a project page here. It contains some case studies on how the WP:MEDRS guideline, for the referencing of articles at all related to human health, is applied in typical discussions.

Close to home also, a template, called {{medrs}} for short, is used to express dissatisfaction with particular references. Technology can help with patrolling, and this Petscan query finds over 450 articles where there is at least one use of the template. Of course the template is merely suggesting there is a possible issue with the reliability of a reference. Deciding the truth of the allegation is another matter.

This maintenance issue is one example of where ScienceSource aims to help. Where the reference is to a scientific paper, its type of algorithm could give a pass/fail opinion on such references. It could assist patrollers of medical articles, therefore, with the templated references and more generally. There may be more to proper referencing than that, indeed: context, quite what the statement supported by the reference expresses, prominence and weight. For that kind of consideration, case studies can help. But an algorithm might help to clear the backlog.

Evidence pyramid leading up to clinical guidelines, from WP:MEDRS
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 18:19, 29 June 2018 (UTC)

Facto Post – Issue 14 – 21 July 2018[edit]

Facto Post – Issue 14 – 21 July 2018
Logo for ContentMine ScienceSource.svg

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.
Back numbers are here.

Plugging the gaps – Wikimania report

Officially it is "bridging the gaps in knowledge", with Wikimania 2018 in Cape Town paying tribute to the southern African concept of ubuntu to implement it. Besides face-to-face interactions, Wikimedians do need their power sources.

Hackathon mentoring table wiring

Facto Post interviewed Jdforrester, who has attended every Wikimania, and now works as Senior Product Manager for the Wikimedia Foundation. His take on tackling the gaps in the Wikimedia movement is that "if we were an army, we could march in a column and close up all the gaps". In his view though, that is a faulty metaphor, and it leads to a completely false misunderstanding of the movement, its diversity and different aspirations, and the nature of the work as "fighting" to be done in the open sector. There are many fronts, and as an eventualist he feels the gaps experienced both by editors and by users of Wikimedia content are inevitable. He would like to see a greater emphasis on reuse of content, not simply its volume.

If that may not sound like radicalism, the Decolonizing the Internet conference here organized jointly with Whose Knowledge? can redress the picture. It comes with the claim to be "the first ever conference about centering marginalized knowledge online".

Plugbar buildup at the Hackathon
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery (talk) 06:10, 21 July 2018 (UTC)

Books & Bytes – Issue 29[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 29, June – July 2018

Hindi, Italian and French versions of Books & Bytes are now available in meta!
Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 16:02, 25 August 2018 (UTC)

WikiProject X Newsletter • Issue 12[edit]

WikiProject X icon.svg
Newsletter • August 2018

This month: WikiProject X: The resumption

Work has resumed on WikiProject X and CollaborationKit, backed by a successfully funded Project Grant. For more information on the current status and planned work, please see this month's issue of the newsletter!

-— Isarra 22:24, 30 August 2018 (UTC)

7th Annual Los Angeles Wiknic[edit]

It's the 7th Annual Los Angeles Wiknic!

Sunday, September 30, 11:00-4:00 PM
Pan Pacific Park, 7600 Beverly Blvd, Los Angeles, CA 90036
Hang out. Consume crowd-sourced BBQ! Bask in the glory of late September in Los Angeles (and the glory
of our new user group, Wikimedians of Los Angeles).
RSVP (and volunteer) here.
We hope to see you there! JSFarman (talk) 02:50, 9 September 2018 (UTC)
Join our Facebook group, or follow us on Twitter!

To opt out of future mailings about LA meetups, please remove your name from this list.

Results from global Wikimedia survey 2018 are published[edit]

19:25, 1 October 2018 (UTC)

Many thanks for your work and comment ...[edit]

Risk Engineer (talk) 13:24, 7 October 2018 (UTC)

Books & Bytes, Issue 30[edit]

Wikipedia Library owl.svg The Wikipedia Library

Bookshelf.jpg

Books & Bytes
Issue 30, August – Septmeber 2018

  • Library Card translation
  • Spotlight: 1Lib1Ref spreads to the Southern Hemisphere and beyond
  • Wikimedia and Libraries User Group update
  • Global branches update
  • Bytes in brief

French version of Books & Bytes is now available in meta!
Read the full newsletter

Sent by MediaWiki message delivery on behalf of The Wikipedia Library team --MediaWiki message delivery (talk) 03:43, 25 October 2018 (UTC)

ArbCom 2018 election voter message[edit]

Scale of justice 2.svgHello, PKM. Voting in the 2018 Arbitration Committee elections is now open until 23.59 on Sunday, 3 December. All users who registered an account before Sunday, 28 October 2018, made at least 150 mainspace edits before Thursday, 1 November 2018 and are not currently blocked are eligible to vote. Users with alternate accounts may only vote once.

The Arbitration Committee is the panel of editors responsible for conducting the Wikipedia arbitration process. It has the authority to impose binding solutions to disputes between editors, primarily for serious conduct disputes the community has been unable to resolve. This includes the authority to impose site bans, topic bans, editing restrictions, and other measures needed to maintain our editing environment. The arbitration policy describes the Committee's roles and responsibilities in greater detail.

If you wish to participate in the 2018 election, please review the candidates and submit your choices on the voting page. MediaWiki message delivery (talk) 18:42, 19 November 2018 (UTC)