TweepMe Data Extraction
So i was intrigued after writing my last post about TweepMe just exactly what someone would be letting themselves in for if they did sign up to the service, what accounts they could expect to see updates from etc.
Well, curiosity got the better of me so I threw some code together to scrape their pages and cross reference that with the relevant twitter page for the user and wandered away while it did it’s job. When I returned it had spat out a file with just shy of 2,000 people (probably more signed up since then, that was just how many was showing at time of scrape). For each person listed on TweepMe I retrieved:
- Twitter account name
- Name
- Location
- Website
- Following Count
- Followers Count
- Update Count
- Bio
I’ve had a quick spy through this file, and as i suspected it is very heavy on the following:
Product / services accounts – radio stations etc – aka you’re going to be spammed.
Massive following – just wanted more followers, that simple.
SEO and “Social Media guru” etc.
~250 accounts with < 10 followers looking to beef their numbers.
And worryingly, the top 5 accounts alone are responsible for 128,000 tweets since their inception, thats some serious time line flooding!
Having a look for one of my interests, a quick scan for ‘photo’ turned up only 87 tweeps mentioning that in their data, not a high ratio at all for my liking.
If you want a spy through the data yourself, you can grab a copy here:
I’m keen to hear anything interesting that anyone might turn up, so please do comment if you find anything interesting about the data.
~Shepy
UPDATE: After a tweet from @AlohaArleen, who presumably has something to do with the site, I’d like to share a couple of tweets about this post, just to make sure there is no misunderstanding about this data:
AlohaArleen : @Shepy You can’t use data extraction on the TweepMe site. The pages do not show all the users! Not even an acculmanation! FAIL! #tweepme
Shepy: @AlohaArleen How is it a fail, i scraped what was available, i never said it’s exhaustive, infact i said it wasnt. Defensive much? #tweepme
Shepy: @AlohaArleen Those accounts are reg’d, and so the data does give an accurate display of some of the accounts expected to follow. #tweepme
| This entry was posted by Shepy on March 17, 2009 at 6:39 pm, and is filed under Computers, internet, news, Twitter. Follow any responses to this post through RSS 2.0. You can leave a response or trackback from your own site. |
about 2 years ago
“Kick things off by filling out the form below.”
Okay, here we go!
Unfortunately AlohaArleen is a Social Media Influencer. She is Following over 55k and has over 55k Following her. That means that one Tweet from her about #TweepMe, which there are many at the moment, is going to cause that Twojan to viral out of control. I’d say it is getting closer as the clock ticks. And then the creator says it hasn’t started yet and will begin on the evening of St. Patricks Day 2009-03-17?
I’d like to ask AlohaArleen exactly what her relationship is with the #TweepMe service. Oh, and she may also want to correct that TweetMe typo in the article, I don’t think they would take kindly to the reference in this instance.
The #TweepMe service is a plague for Twitter. It is really no different than one of the old school chain mail routines. It functions pretty much the same way with some added twists.
Yes, I have signed up and tested the interface. Did you know that the developer designed it in a way that most Twitter users will click the register button not knowing what the hell they are about to do? Thousands have clicked that button and have sent one of two promo Tweets into the Twitter Timeline (43 since I started writing this). There are no privacy policies, nothing. FAIL!
This is what happens when you have children playing with things they shouldn’t be. Maybe that boobr should have done just a little more research into what most Twits prefer and not assume that everyone wants to be a part of some mass Twitter Chain Following. What a cluster this guy started.
Cancel your account and get out of the chain before it is too late. Maybe Twitter will bring the hammer down on all those promoting that #CrapApp.
about 2 years ago
Wow! All the references to this Blog post and not another comment besides mine? What gives with all you Social Media folks? Are you afraid to type more than 140 characters on someone’s Blog? Get with the program! ;)
We can add this to the mix…
TweepMe – Twitter Self Replicating Human Virus
http://www.SEOConsultants.com/twitter/tweepme/