csv,conf 2014 - July 15, 2014
14:00 The Content Mine by Peter Murray-Rust @petermurrayrust
- "liberating scientific facts for humanity"
- crawl, then data mine the literature, publish resulting facts CC0
- uses technology to extract information from PDFs, "can now turn PDFs into science"
- information will go to Wikipedia and as linked open data to DBpedia
- site is contentmine.org
Peter is using the UK Exceptions to Copyright for Research (PDF), specifically the text and data mining exception.