This week's book giveaway is in the OCMJEA forum. We're giving away four copies of OCM Java EE 6 Enterprise Architect Exam Guide and have Paul Allen & Joseph Bambara on-line! See this thread for details.
PDF is -for the most part- a read-only format. While some limited alterations are possible, you can't extract the layout information. It's possible to extract the text they contain using libraries such as PDFBox and JPedal.