something about "55000 hacked twitter accounts"
This week, seen in several internet news portals the announcement of "hacking" 55000 twitter accounts. This kind of news tend to attract attention of the net and demonstrate the fragility of this. I like to spend some type observing the data published and speculate a bit.
Yes, after concatenate the five parts of pastebin, we have 58970 records. But after this and check how many of them are unique, only 36998 are. So, until here, the announcement was not successful.
We can consider the password list a Corpus and with it, useful to linguistic analysis and text mining.
First, the most popular passwords:
Very interesting, then:
- The first 16 password are numbers.
- The first most popular password is 315475 (a mystery to evaluate)
- The first word is "sexo".
- Passwords seem to predominate in portuguese (in popularity)
What is 315475?
- The phone prefix of Syracuse, NY (USA).
- One hexcolor? 580 persons love the blue?
- A common password from a spambot owner?
My vote for the spambot.

