SpamBayes data file "Out of Balance".../

  • Thread starter Thread starter Kenneth
  • Start date Start date
K

Kenneth

Howdy,

I am running the most recent version of SpamBayes Outlook PlugIn...

I trained it on approximately 70 spams and 70 hams, and all was well.

SB occasionally misses a spam, and I have been training on those. SB
virtually never gives me a false positive (that is, characterizing a
ham as spam.)

As a result, the ratio of spams to hams in my database quickly goes up
over 2:1 which I understand from the FAQs is not the best way to have
things set up.

When this happens, I have thought to train on more hams in the hope of
getting the DB into better "balance" but I can't figure out how to
train on hams only. I can't move hams from the spam folder because
none are in there.

What is the best way for me to handle the situation I describe? (And I
will add, I have looked at the online support materials without
finding an answer to this one.)

Very sincere thanks for any help,
 
Kenneth said:
Howdy,

I am running the most recent version of SpamBayes Outlook PlugIn...

I trained it on approximately 70 spams and 70 hams, and all was well.

SB occasionally misses a spam, and I have been training on those. SB
virtually never gives me a false positive (that is, characterizing a
ham as spam.)

As a result, the ratio of spams to hams in my database quickly goes up
over 2:1 which I understand from the FAQs is not the best way to have
things set up.

When this happens, I have thought to train on more hams in the hope of
getting the DB into better "balance" but I can't figure out how to
train on hams only. I can't move hams from the spam folder because
none are in there.

What is the best way for me to handle the situation I describe? (And I
will add, I have looked at the online support materials without
finding an answer to this one.)

Very sincere thanks for any help,

Honestly, I don't think this is a problem. I've got SpamBayes running
on 30 computers at work, with ratios anywhere from 1:1 to 12:1.
SpamBayes has always continued functioning. We set "definate spam"
detection at 90% and possible spam is set to "do nothing". with this
setup we achieve about a 95% success rate with no false positives. My
suggestion - it's fine, just leave it.

John H. E-mail: (e-mail address removed)
Website: John's Best of FreeWare: WWW:Jhoodsoft.org
 
Honestly, I don't think this is a problem. I've got SpamBayes running
on 30 computers at work, with ratios anywhere from 1:1 to 12:1.
SpamBayes has always continued functioning. We set "definate spam"
detection at 90% and possible spam is set to "do nothing". with this
setup we achieve about a 95% success rate with no false positives. My
suggestion - it's fine, just leave it.

John H. E-mail: (e-mail address removed)
Website: John's Best of FreeWare: WWW:Jhoodsoft.org

Howdy,

Thanks for the useful info...

All the best,
 
Back
Top