[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Ifile-discuss] Re: html tag stripping
From: |
David Bushong |
Subject: |
[Ifile-discuss] Re: html tag stripping |
Date: |
Wed, 25 Jun 2003 14:08:56 -0700 |
User-agent: |
Mutt/1.4i |
On Wed, Jun 25, 2003 at 10:48:17PM +0200, clemens fischer wrote:
> * David Bushong:
>
> > Yo<kc34sma21py2>uve rea<khuyowp1wuizl>d about them in the
> > P<ks4nj3w258mkq1>apers....
> >
> > (If you're reading this list in HTML, try turning it off). Basically, this
> > completely ruins ifile's effectiveness. However a simple addition to the
> > word tokenizer to skip anything between matched <>'s would completely avoid
> > this problem (as well as stop making "font", "color", etc. my most popular
> > words, spam or otherwise).
>
> you've got my vote, because it's simple. then again, people who use
> ifile for something else then spam-filtering may not like it. i think
> all i've seen in ifile development has never deminuished applicability
> to text messages, be they meail or usenet, but many have been attempts
> to let it un-base64 MIME parts or whatnot. upto now, this hasn't
> happend.
>
Well, even if people are filtering non-email through it, it doesn't handle
tagged input gracefully. An option to do a simple, na?ve tag-strip seems
like a win to me.
--David Bushong