Yahoo Groups archive

Milter-greylist

Index last updated: 2026-04-28 23:32 UTC

Message

Re: [milter-greylist] [off-topic] OCR milter?

2006-11-03 by Jack L. Stone

At 09:38 PM 11.2.2006 +0100, you wrote:
>Oliver Fromme <olli@...> wrote:
>
>> In order to change even a single pixel, the spammer would
>> have to decompress the image, and then compress it again.
>> That costs quite a bit of CPU resources, so I don't think
>> they're doing that when sending millions of spam mails.
>
>I suspect they just build the image on the fly when sending spam. That
>costs nearly nothing.
> 
>> It might work on personal machines that receive only mail
>> for one person, or maybe a few users.  (Obviously that's
>> already the case, see SpamAssassin + gocr.)  But it won't
>> work on large servers that receive mail (and spam) for
>> hundreds or thousands of people.
>
>It's easy to scatter the job on several machines...
>
>-- 
>Emmanuel Dreyfus
>http://hcpnet.free.fr/pubz
>manu@...
>

I don't know how this fits into the picture with MGL, but have you looked
at the SA's alternative to the OCR?

It seemed to me that the OCR used for SA had way too many working parts,
and Dallas Engelken provided a image filter ruleset alternative that had
done an excellent job of catching the image spams. None are getting through
here.

# ImageInfo Plugin for SpamAssassin
# Version: 0.6
# Current Home: http://www.rulesemporium.com/plugins.htm#imageinfo
# Created: 2006-08-02
# Modified: 2006-10-04
# By: Dallas Engelken <dallase@...>
#
# Changes: 
#   0.6 - fixed dems_ bug in image_size_range_
#   0.5 - added image_named and image_to_text_ratio
#   0.4 - added image_size_exact and image_size_range
#   0.3 - added jpeg support
#   0.2 - optimized by theo
#   0.1 - added gif/png support
#
# Files:
#   ImageInfo.pm (plugin)  - http://www.rulesemporium.com/plugins/ImageInfo.pm
#   imageinfo.cf (ruleset) - http://www.rulesemporium.com/plugins/imageinfo.cf
#   

(^_^)
Happy trails,
Jack L. Stone

System Admin
Sage-american

Attachments

Move to quarantaine

This moves the raw source file on disk only. The archive index is not changed automatically, so you still need to run a manual refresh afterward.