A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types
A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types