For correcting invalid (x)HTML?even Microsoft Word crap?I know of no better solution than HTML Purifier
Unfortunately, I can't use HTML Purifier with my current client because it's too restrictive for his needs. So what I'm looking for is something that does nothing other than strip or (preferably) convert non-ASCII characters to their equivalent HTML entities. e.g. convert ?My problem,? he said, ?is simple?WYSIWYGs.? would become “My problem,” he said, “is simple—WYSIWYGs.”. I have a sense that a good WYSIWYG should do this, but I haven't had any success with FCKEditor's "paste from Word" feature. Has anyone else? Does TinyMCE do any better?
Thanks
Travis Carden
1 year 52 weeks ago
Does it do titles too? My problems is that even the entities for things like left double quotes create headaches for titles, in some cases causing Drupal to store the title as a blank. This, in turn, causes Pathauto to screw up. CKEditor does not fix this because it only deals with the body.
-Nancy E. Wichmann, PMP
That's a good question, Nancy. I've encountered the same thing where a client copies curly quotes into a page title and then the Pathauto-generated path doesn't work. Unfortunately HTML Purifier can't do anything about that
because it's implemented as an input filter, and you can't apply input formats to titles.
Cheers,
Travis