Search Mailing List Archives

Limit search to: Subject & Body Subject Author
Sort by: Reverse Sort
Limit to: All This Week Last Week This Month Last Month
Select Date Range     through    

[liberationtech] On the Feasibility of Internet-Scale Author Identification

Steve Weis steveweis at
Mon Feb 20 19:47:14 PST 2012

There is some interesting research coming out from a team of Stanford &
Berkeley researchers about large-scale de-anonymization of blog posts based
on writing style, i.e. stylometry.

For a given blog post, the researchers were able to positively identify an
individual author from among 100,000 possibilities 20% of the
time. However, their method does not work if authors deliberately obfuscate
their writing style.

Here's the paper draft:

And a blog post about it:
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the liberationtech mailing list