I have a strange problem that I am trying to fix without success. I had a large batch of text files containing translations of early Christian writings that I'm using in a corpus linguistic study. These started out as UTF-8 files on the Macintosh but were sent to a friend for some processing. when he returned them, it was apparent that his processes had assumed the files to be ISO Latin-1 encoding as an input and then saved them as UTF-8, leading to some serious garbling, things like
This problem is manually fixable through the following process:
1. open the file 2. change the encoding from the menu at the bottom of the text window to ISO Latin-1 3. save the file 4. use the Reopen Using Encoding > UTF-8 command 5. resave the file
Not a big deal, but I have literally thousands of these files (some 74 MB worth) and don't want to manually process them. I don't know AppleScript (which I am sure could be used to automate that process) and Text Factories don't allow access to all of those steps, so I can't use them.
Does anyone have a suggestion for an automatable process or a script that I could use to process a large batch of files as listed above?
--~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the "BBEdit Talk" discussion group on Google Groups. To post to this group, send email to firstname.lastname@example.org To unsubscribe from this group, send email to email@example.com For more options, visit this group at http://groups.google.com/group/bbedit?hl=en If you have a feature request or would like to report a problem, please email "firstname.lastname@example.org" rather than posting to the group. -~----------~----~----~----~------~----~------~--~-
This site manages and broadcasts several email lists pertaining to Lasso Programming and technologies related and used by Lasso developers. Sign up today!