Text this: A model for file structure determination for large on-line data files.