You can likewise investigate PDFMiner (or for more older versions of Python see PDFMiner and PDFMiner).
A specific component of interest in PDFMiner is that you can handle how it pulls together text parts while separating them. You do this by determining the space between lines, words, characters, and so forth. Along these lines, perhaps by tweaking this you can accomplish what you need (that depends of the inconstancy of your documents). PDFMiner can likewise give you the area of the text in the page, it can separate information by Object ID and other stuff. So dive in PDFMiner and be imaginative!
Are you pretty much interested to learn python in detail? Come and join the python training course to gain more knowledge.