in Home

something about “55000 hacked twitter accounts”

This week, seen in several internet news portals the announcement of “hacking” 55000 twitter accounts. This kind of news tend to attract attention of the net and demonstrate the fragility of this. I like to spend some type observing the data published and speculate a bit.
Yes, after concatenate the five parts of pastebin, we have 58970 records. But after this and check how many of them are unique, only 36998 are. So, until here, the announcement was not successful.
We can consider the password list a Corpus and with it, useful to linguistic analysis and text mining.
First, the most popular passwords:
Screen_shot_2012-05-11_at_10
Order Unfiltered	wordcount	 Occurencies
1.		315475		580				1.5573
2.		123456		489				1.3130
3.		123456789	176				0.4726
4.		102030		68				0.1826
5.		123	        61				0.1638
6.		12345		53				0.1423
7.		1234		45				0.1208
8.		101010		29				0.0779
9.		242424		28				0.0752
10.		12345678	27				0.0725
11.		010203		24				0.0644
12.		1234567		24				0.0644
13.		654321		23				0.0618
14.		123123		21				0.0564
15.		121212		19				0.0510
16.		123321		16				0.0430
17.		sexo		16				0.0430
18.		com	        16				0.0430
19.		1234567890	15				0.0403
20.		junior		14				0.0376
21.		gay	        14				0.0376
22.		gatinho		13				0.0349
23.		124578		13				0.0349
24.		0123456789	13				0.0349
25.		12345678910	12				0.0322
26.		brasil		12				0.0322
27.		123654		12				0.0322
28.		202020		12				0.0322
29.		159753		12				0.0322
30.		adm15575	11				0.0295
31.		212121		11				0.0295
32.		151515		11				0.0295
33.		2008		10				0.0269
34.		1010		10				0.0269
35.		252525		10				0.0269
36.		fernando	10				0.0269
37.		redeglobo	10				0.0269
38.		112233		10				0.0269
39.		eumesmo		10				0.0269
40.		000000		10				0.0269
41.		987654321	9				0.0242
42.		141414		9				0.0242
43.		123mudar	9				0.0242
44.		789456		9				0.0242
45.		696969		9				0.0242
46.		ricardo		9				0.0242
47.		thiago		8				0.0215
48.		alexandre	8				0.0215
49.		eduardo		8				0.0215
50.		456789		8				0.0215
Very interesting, then:
  • The first 16 password are numbers.
  • The first most popular password is 315475 (a mystery to evaluate)
  • The first word is “sexo”.
  • Passwords seem to predominate in portuguese (in popularity)
What is 315475?
  1. The phone prefix of Syracuse, NY (USA).
  2. One hexcolor? 580 persons love the blue?
  3. A common password from a spambot owner?
Screen_shot_2012-05-12_at_4
My vote for the spambot.

Write a Comment

Comment