Andrew Friedman

afriedman412 [at] gmail [dot] com

HomeWorkProjectsOpen SourceWritingContent


OPEN SOURCE CONTRIBUTIONS


PDFPlumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables..

FPDF2

Simple PDF generation for Python

Textacy

Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals --- tokenization, part-of-speech tagging, dependency parsing, etc. --- delegated to another library, textacy focuses primarily on the tasks that come before and follow after.