Daily Archives: October 20, 2009

PDF to text in python

PDFMiner is a suite of programs in python that help extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other extra information … Continue reading

Leave a comment