A good walkthrought on how a security professional tried to scan a client for confidential info, basically making a copy of every file accessible and then parsing it for information.
The second half of this document is a fantastic resource on converting pdf files to ascii text txt files