However, this is not data that all users would be comfortable with being made public, I think.
Any suggestions on how to get good targeted keywords despite that?
The only thing I've got at the moment is jumbling all the data for a given page, picking out relatively uncommon words, repeating them a bit, and serving that up as stew for anyone who isn't the owner of the data to see.