The apache (HYPERLINK "http://www.apache.org/"www.apache.org) projecthas something which allows you to read H.S.F. aka Excel files (HorribleSpreadsheet Format) maybe that will point you in the right direction.I¡Çm not sure for PDF