From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: a07f3367d7,6a6d1d8878c29fe3 X-Google-Attributes: gida07f3367d7,public,usenet X-Google-NewGroupId: yes X-Google-Language: ENGLISH,ASCII-7-bit Received: by 10.66.76.130 with SMTP id k2mr758275paw.16.1343176312856; Tue, 24 Jul 2012 17:31:52 -0700 (PDT) Path: p10ni51603472pbh.1!nntp.google.com!border1.nntp.dca.giganews.com!border4.nntp.dca.giganews.com!border2.nntp.dca.giganews.com!nntp.giganews.com!ctu-peer!ctu-gate!news.nctu.edu.tw!usenet.stanford.edu!postnews.google.com!news2.google.com!npeer02.iad.highwinds-media.com!news.highwinds-media.com!feed-me.highwinds-media.com!post02.iad.highwinds-media.com!news.flashnewsgroups.com-b7.4zTQh5tI3A!not-for-mail From: Stephen Leake Newsgroups: comp.lang.ada Subject: Re: Those annoying HMTL entities from Google Groups References: Date: Tue, 17 Jul 2012 06:41:34 -0400 Message-ID: <85liiiy8ip.fsf@stephe-leake.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.1 (windows-nt) Cancel-Lock: sha1:6GMh4svr883rwBlZvdpFBJ5KmKw= MIME-Version: 1.0 X-Complaints-To: abuse@flashnewsgroups.com Organization: FlashNewsgroups.com X-Trace: 3a62a50054160e029e66114938 X-Received-Bytes: 2957 Content-Type: text/plain Date: 2012-07-17T06:41:34-04:00 List-Id: Simon Wright writes: > You know how, of late, there have been a lot of HTML entities (for > example, ", ', > for ", ', and > respectively) in postings > from people who're using Google Groups? Well, I haven't worked out how > to translate them while reading, There's already a package for this; html2text. I've enhanced it for use at work, where I read Outlook generated email with Emacs: (require 'html2text) (add-to-list 'html2text-replace-list (cons "’" "'")) (add-to-list 'html2text-replace-list (cons "'" "'")) (add-to-list 'html2text-replace-list (cons "–" "-")) (add-to-list 'html2text-replace-list (cons "‘" "'")) (add-to-list 'html2text-replace-list (cons "’" "'")) (add-to-list 'html2text-replace-list (cons "“" "'")) (add-to-list 'html2text-replace-list (cons "”" "'")) (add-to-list 'html2text-replace-list (cons "'" "'")) (add-to-list 'html2text-replace-list (cons "…" "...")) (add-to-list 'html2text-replace-list (cons "–" "-")) (add-to-list 'html2text-remove-tag-list "sup") (setq html2text-remove-tag-list (delete "br" html2text-remove-tag-list)) (add-to-list 'html2text-remove-tag-list "style") (add-to-list 'html2text-remove-tag-list "span") (defun html2text-clean-newline (p1 p2 p3 p4) (html2text-delete-tags p1 p2 p3 p4) (newline)) (add-to-list 'html2text-format-tag-list (cons "o:p" 'html2text-clean-newline)) (add-to-list 'html2text-format-tag-list (cons "br" 'html2text-clean-newline)) (defun html2text-delete-comment () (interactive) (let ((buffer-read-only)) (goto-char (point-min)) (while (re-search-forward "" (point-max) t))))) (defun html2text-delete-xml () (interactive) (let ((buffer-read-only)) (goto-char (point-min)) (while (re-search-forward "" (point-max) t) (delete-region (match-beginning 0) (re-search-forward "" (point-max) t))))) (defun html-clean () (interactive) (html2text) (html2text-delete-comment) (html2text-delete-xml)) In a buffer with html: M-x html-clean Do that before replying. -- -- Stephe