Web scraping is the practice of harvesting large amounts of data from the internet and storing it locally or in a database to be processed, for example, pulling many scientific journals and processing the results.

Theory Jar, September 01, 2017

The aim of Theory Jar was to translate scientific jargon into lay language. I was responsible for scraping text to build a corpora, and processed this using Python's Natural Language Toolkit (NLTK). With this accomplished, a mySQL database was connected to a web application that could accept any…

Tuberculosis Sample Study, November 23, 2016

28,000 tuberculosis (TB) sample entries were extracted from the NCBI Database in order to better understand the wide array of descriptors entered at the time these samples were uploaded. A custom frequency-based word cloud was created from all 322 descriptors to highlight the difficulty in…