The results won't be enough to test that unless you had good knowledge of the entire ecosystem, i.e. what did it leave out? Sounds expensive to verify blindly.
Hive account@timcliff seems to fit the bill, and I believe he's taken an interest in your project.
RE: Visualising Spam Scores