The API Briefing: NIH’s Pillbox – Unlocking Valuable Government Health Information through APIs

May 28, 2014

National Library of Medicine (NLM) Pillbox app icon
The Food and Drug Administration collects drug labeling information for human prescription, over-the-counter, homeopathic, and veterinary products through a special markup language called “Structured Product Labeling” (SPL). The database created from the SPL submissions is a treasure trove of health information that is valuable to pharmacists, doctors, and the ordinary health consumer. The problem is that data is hard for developers to access and process. Until recently, when the National Library of Medicine released open source code for “Pillbox.”

Available on Github, Pillbox is a collection of Python scripts that download the SPL data files, process the data files along with other data sources to create a “pill-focused” view of the information, and provide the data into formats that can be easily used by APIs. Pillbox is available as an API or developers can download and incorporate the scripts into their own applications. Developers can use Pillbox to power applications that can check for drug interactions, help consumers identify a pill by taking a picture of it with their smartphone, or maintain a list of personal medical prescriptions.

Pillbox is a great example of how to use the open source community to help develop government APIs. By releasing Pillbox on Github, the National Library of Medicine invites anyone to copy (“fork” in Github terms) the scripts and make their own improvements which could be incorporated back into the original code (“commit”). The National Library of Medicine also encourages users to find and describe errors in the Pillbox drug information data. The Federal government benefits by having an army of volunteers to improve the pill data and to find new uses for it.

However, the greater benefits are to the American public. The National Library of Medicine’s approach behind Pillbox is smart in that it creates an easier way to access vital government information and encourages developers to find innovative uses for vitally-needed health information. Pillbox’s open source code was made possible through Health and Human Services’ Idealab program (watch the Pillbox pitch video). I wonder what other government data sources could benefit from a similar method of development.Each week in “The API Briefing,” I will showcase government APIs and the latest API news and trends. Visit this blog every week to learn how government APIs are transforming government and improving government services for the American people. If you have ideas for a topic or have questions about APIs, please contact me via email. Dr. William A. Brantley, PMP, is a Program Analyst, in Forecasting and Methods at the U.S. Office of Personnel Management (OPM). You can find out more about his work in this space at BillBrantley.com.