Search Mailing List Archives


Limit search to: Subject & Body Subject Author
Sort by: Reverse Sort
Limit to: All This Week Last Week This Month Last Month
Select Date Range     through    

[liberationtech] On the Feasibility of Internet-Scale Author Identification

Steve Weis steveweis at gmail.com
Mon Feb 20 19:47:14 PST 2012


There is some interesting research coming out from a team of Stanford &
Berkeley researchers about large-scale de-anonymization of blog posts based
on writing style, i.e. stylometry.

For a given blog post, the researchers were able to positively identify an
individual author from among 100,000 possibilities 20% of the
time. However, their method does not work if authors deliberately obfuscate
their writing style.

Here's the paper draft:
http://randomwalker.info/publications/author-identification-draft.pdf

And a blog post about it:
http://33bits.org/2012/02/20/is-writing-style-sufficient-to-deanonymize-material-posted-online/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.stanford.edu/pipermail/liberationtech/attachments/20120220/a2efad99/attachment.html>


More information about the liberationtech mailing list