I would recommend using hash tables for simplicity and performance. The main problem with your idea is the encouragement for abuse. People spamming random jibberish at 500 characters per line.
Well. At least I won lunch. Good philosophy, see good in bad, I like!
|