2020 article

Analysis of Wikipedia pageviews to identify popular chemicals

REPORTERS, MARKERS, DYES, NANOPARTICLES, AND MOLECULAR PROBES FOR BIOMEDICAL APPLICATIONS XII, Vol. 11256.

By: Y. Cao*, H. Mehta*, A. Norcross n , M. Taniguchi n  & J. Lindsey n 

co-author countries: United States of America πŸ‡ΊπŸ‡Έ
author keywords: Wikipedia; Database; Pageview; Search; SMILES; Medicine; Pharmaceutical; World Health Organization
Source: Web Of Science
Added: September 28, 2020

A new approach to assess popularity relies on analysis of the number of times a web article is viewed. Here, a strategy is described to identify chemicals of widespread interest. The strategy makes use of Wikipedia, a rapidly growing publicly editable web encyclopedia that has become an influential knowledge base. While the total number of chemicals mentioned in Wikipedia is unknown, use of the Wikipedia Chemical Structure Explorer (WCSE) developed by Novartis enables identification of those that are described in an Infobox or Chembox along with a Simplified Molecular-Input Line-Entry system (SMILES) code. Using a Python script, all so-listed chemicals (16,243) in Wikipedia were identified and then sorted on the basis of their pageview rankings. Of the 16,243 chemicals, 846 (5.2%) belonged to controlled substances (United States Drug Enforcement Administration), WHO essential medicines, or the top 300 US drugs. These 846 chemicals received 220 million pageviews, which is 41.4% of the pageviews for all members of the Wikipedia chemical list. The number of chemicals described in the entire corpus of Wikipedia remains a tiny fraction of the &lt;10<sup>7</sup> known chemicals. Much remains to be done to make the venerable literature and data of chemistry readily accessible. Regardless, identification of popular chemicals in this manner can be used to create selected databases, to tailor educational curricula, or to create targeted informational materials (such as safety brochures); such considerations of public demand are likely to engender corresponding widespread interest.