Archive for the ‘Computer Generated Content’ Category

Aaron: It’s not Spam, It’s a “Newsmaster Site”

Aaron, Aaron. You still don’t know the difference between a spam site and a newsmaster site?

While they may appear similar, they are actually quite different.

Since you’re having trouble with it (and you ARE an SEO expert), no doubt some of my other readers are struggling with it as well. So for all of you, I’ll give you this infographic cheat sheet:

spam-vs-newsmaster

It’s pretty clear to me.

Also note: If you spout off bullshit about purple cows or you have already made millions selling off your steaming pile of shit web “business” during a bubble, then from that point onward, you are incapable of making a spam site. The sociopaths over at Google are too starstruck to ever consider anything you do “spam.”

Hope that helps!

-q

Q&A: How Does Google Treat Duplicate Content?

Reteep Asked:

“How much of a problem is the duplicate content stuff for the bottom layer [of] autogenerated sites? Does it matter?”

Duplicate Content is one of the great boogie men of SEO. So many people are scared of it; so many people are worried that if they have duplicate content their site will face a ranking penalty. Is it true? Does Google Penalize you for having Duplicate Content? If so how much? How can duplicate content hurt you? How can it help you? Well sit back, because although I’m sure it’s been answered before, I’ve gonna give you the straight dope on duplicate content and Google.

First Off; what do we mean by “Duplicate Content”? Duplicate content means that the text of one web page matches another page’s. The text matching does not need to be 100% to be considered duplicate content. Matching can be less than 50% and still be considered duplicate especially if various chunks of content can be found on other pages.

For Example: Any site that runs AP stories will have heaps of duplicate content. Google doesn’t penalize the news site for running the stories, but unlike last year, now all (or almost all) of the AP stories are hosted on Google. Examples:

http://www.google.com/hostednews/ap/article/ALeqM5g8-DEMtAE9q4i4ySQ0eV_qZefmRQD99D0RV80

http://www.google.com/hostednews/ap/article/ALeqM5iwJhPuY4ndVAdfJgwbiS3uh7uIGgD99CJOBO0

Interestingly, When I google “AP Interview: Hayden denies Congress not informed” The top 10 results:

SERPs for AP Stories

Include Yahoo at number 1 and Google’s own story in the Top 10. Long term, expect Google to put itself at number 1 for all these types of queries.

The benefit to running an AP story is that if you can rank for the query (like Yahoo did), you can get search traffic. Plus there’s a chance when those stories run on XYZ newspaper site (or your site) that the story will get picked up by a slashdot, or a Digg, or Fark or whatever and receive a few hundred links. The only negative the newspaper sites could get from running these stories is by diluting their internal link juice by linking internally to these stories.

Clearly, Google does not penalize trusted sites for having duplicate content. In almost all cases, your Penalty in Google is NOT because you have duplicate content. 99.9% of the time, the problem is somewhere else.

The way that duplicate content can hurt you would be if you have multiple copies of the same story on your website. You will not get a “penalty” from Google but you will dilute both your internal link juice and could potentially split any natural inbound links (and therefore ranking power) among all the copies of the page. This type of dilution could take an item that would have ranked and banish it to obscurity.

The same applies for spam sites. Unless your site is screaming “I AM A SPAM SITE”, the duplicate content penalty is not gonna hit. And since you would have gotten hit anyway by that human reviewer that just marked your site (and Network, if you weren’t careful) as spam, we can safely say that there is no official Duplicate Content Penalty at Google.

Fire away with more questions; don’t worry, I intend to tackle some of the other ones that were asked this weekend later this week.

Auto Complaint Generator: Comment Spammers

The Complaint letter Generator can be used to create ramblings complaints about anything. These complaints, and ones like them, are making the rounds and will soon be in a comment spam near you. Because, well . . .

Unless you want to accumulate a long list of examples of Comment Spammers’s acts of corruption and depredation, this letter may become a bit monotonous. However, I unmistakably do hope you read it all the way through because Comment Spammers, who has posed as Savior of the World, is nothing else but the world’s seducer, its destroyer, its incendiary, and its executioner. What follows is a call to action for those of us who care — a large enough number to make a genuine contribution to human society. I frequently talk about how Comment Spammers’s pranks are precisely the kind of thing that will give lunatics control of the asylum by next weekend. I would drop the subject except that it says that a knowledge of correct diction, even if unused, evinces a superiority that covers cowardice or stupidity. That’s a stupid thing to say. It’s like saying that diseases can be defeated not through standard medical research but through the creation of a new language, one that does not stigmatize certain groups and behaviors.

I am deliberately using colorful language in this letter. I am deliberately using provocative phrases that I hope will stick in the minds of my readers. I do ensure, however, that my words are always appropriate and accurate and clearly explain how the point at which you discover that whenever a will-o’-the-wisp of Chekism, however unreal, turns up anywhere, Comment Spammers is off at a trot is not only a moment of disenchantment. It is a moment of resolve, a determination that no one has a higher opinion of it than I, and I think it’s a bleeding-heart hotheaded-type. If Comment Spammers thinks that divine ichor flows through its veins, then it’s sadly mistaken. When was the last time you heard Comment Spammers mention that its animadversions are merely a sideshow exhibit in the circus of narcissism? Probably never. That’s why if you read its writings while mentally out of focus, you may get the sense that Bulverism forms the core of any utopian society. But if you read Comment Spammers’s writings while mentally in focus and weigh each point carefully, it’s clear that as long as the beer keeps flowing and the paychecks keep coming, its shills don’t really care that it is entirely gung-ho about obstructionism because it lacks more pressing soapbox issues.

By comparing today to even ten years ago and projecting the course we’re on, I’d say we’re in for an even more unsympathetic, insensitive, and feebleminded society, all thanks to Comment Spammers’s half-measures. Well, sure; Comment Spammers’s crotchets are a masterpiece of uneducated classism, but that doesn’t change reality. It’s easy for us to shake our heads at Comment Spammers’s foolishness and cowardice. It’s easy for us to exclaim that we should straighten out Comment Spammers’s thinking. It’s easy for us to say, “Comment Spammers’s subliminal psywar campaigns are a vehicle for the expression of prejudice, ignorance, and enmity about people who are different from Comment Spammers.” The point is that it’s easy for us to say these things because Comment Spammers’s reason is not true reason. It does not seek the truth but only neo-damnable answers, prolix resolutions to conflicts. Lastly, I can’t end this letter without mentioning that we ought to teach Comment Spammers a lesson.

Works pretty well: but the word letter may have to be changed to “comment”.

The Content Solution – Free For the Next 3 Weeks

From the creator:

Sure thing, you can download a fresh copy It’s completely free for
the next 3 weeks or so, to EVERYONE. We’ve had an absolute horde of downloads in the past few days (who can
say no to free? 🙂 and the response to the new interface and features has been really good.

So give it a look and if you want to do a post on it, the sooner the better as we’re still in the free period where anyone can try it for nothing.

Scott

The price is right – might as well grab a copy while it’s still free.

RSS Magician

RSS Magician just launched with the ability to combine and rewrite RSS feeds on the fly to produce new feeds.

The result will be a feed that will contain a maximum of 10 FULL articles for one keyword at a time, from one of the articles directories. This is NOT like using the regular RSS feed that the article directories give you. By and large, their feeds will only give you a tidbit of the article to update your site with. RSS Magician allows you to grab the full article. You can use them on your sites, blogs or any other way you want (while of course complying with the free article’s terms of service).

RSS Magician also includes on the fly language translation, word substitution based on a synonym library, adding random text and sentence shuffling. While most people serious about BlackHat SEO will code their own content generation programs, at only $49 for the package, RSS Magician is a great way to get your feet wet without putting up much cash.

You can check out the details of RSS Magician here.

And as a bonus for SEO Black Hat Forum members, we’ve worked out a $10 discount on RSS magician so you can pick it up for only $39. The discount code is in this thread.

Happy Spamming!

Google Torpedoes Navy: You Sunk my Battleship!

You sunk my battleship

This is the story of Bill the Navy Guy. Bill learned a battle plan that worked like a charm:

(computer spun content) + (aged domains) + (Adsense) = $ Money in the Bank $

Bill worked his way up where he was purchasing hundreds of domains per day and was hosting thousands of domains across more than 65 dedicated servers. He was pulling in more than $60,000 per month from Google Adsense.

$60,000+ per Month . . . on 1 Adsense account.

Here’s the kicker: All his content across his entire Network of thousands of sites was spun from a total of about 45 sentences. So you could Google one of his sentence fragments and find his entire Network.

What could possibly go wrong here?

The curator of a Western art museum commissioned a local artist to paint a mural-sized painting of Custer’s Last Thought. The artist was told to make it highly symbolic of Custer’s mindset during the debacle at the Little Bighorn. Deep in thought, the artist went to his studio. After many false starts, he proceeded to paint an enormous oil painting.

Finally, after months of work, the opening of the exhibit drew a crowd of people for the unveiling of this, the centerpiece of the exhibit. The canopy came off revealing the artist’s large painting:

In the foreground was a beautiful crystalline blue lake with a single fish leaping. Around the fish’s head is a halo. In the background, the hills and meadows are covered with naked Native American couples copulating.

The curator was both disgusted and baffled by what he saw. In a rage he turned to the artist and asked, “What the hell has this got to do with Custer’s Last Thought?”

The artist replied, “It’s simple. Custer’s last thought must have been, ‘Holy Mackerel! … Where did all these fucking Indians come from?'”

Unlike General Custer to the Indians, Google had an answer to Bill the Navy Guy and Torpedoed the Admiral’s lone Adsense account: $60,000+ a month revenue to $0. Overnight.

Google Battle Lessons

War Buffs and Search Engine Spammers alike will want to learn from Bill the Navy Guy’s critical mistakes. Here are some of the lessons we can glean from Bill’s defeat:

1. Diversify and protect your supply lines. Redundancy is the law of the military; you never want a single point of failure. In fact, that’s why the DoD invented the Internet. Even if you love Adsense, you should never have all your sites under the same Adsense code. Multiple Adsense Accounts + other PPC Engines + Affiliate Revenue makes it so that a hit to any one of your supply lines does not cause a total system failure.

2. Troop Diversification. Did Alexander the Great have just light cavalry? Did William Wallace have only pikemen? Does the US Navy have only submarines? No! By spinning all his content from the same 45 sentences, Bill the Navy Guy went into battle with all the same type of troops. Don’t make that same mistake; build your sites from multiple content sources.

3. Never give the enemy your battle plans or the locations of your troops. You think the landing at Normandy would have been as successful had we sent Hitler the battle plan and troop locations 2 weeks before the invasion? By using the same Adsense account and leaving such huge footprints, Bill the Navy guy might as well have sent Google this email every day:

To: Google Spam Team
From: Bill The Navy Guy
Subject: Please Ban Me

Dear Google,

I am spamming your index and using Adsense to monitize. Please ban my sites and Adsense account at your earliest convenience. Here is the complete list:

[list of every site]

-Bill the Navy Guy

PS: Don’t Give up the Ship

Even if you’re not an SEO Black Hat, there’s a good reason not to use the same Adsense account across your network of sites. Let’s say you want to interlink sites in your 300 site network. With all your sites on the same Adsense account, Google can and will discount the value of intra network links. Instead of a potential 299 quality backlinks, you’re probably looking at a value closer to several on site links (closer to Navigation links).

Don’t lose the War do to tactical mistakes: Get multiple Adsense accounts and diversify your online income to other sources.

The Most Cutting Edge SEO Exploits No One is Publishing

You know that the best SEO Black Hats are doing something more than scraping, using a site generator, comment spamming, and pinging to be raking in more than $100k per month.

But what is it?

Right now, there is way too much good stuff that I simply can’t publish on the SEO Black Hat blog. If I posted these tactics and exploits they would immediately get all the wrong kind of attention. The detailed conversations about how exactly to abuse search engine algorithms, generate massive traffic, and what other Black Hats are doing must remain underground to retain their effectiveness.

But what if I told you that you could discuss these exploits with me without paying my $500 an hour consulting fee? What if I told you there was a way to join in on the private, cutting edge discussions with some of the best Black Hats and web entrepreneurs in the world?

Would you be interested?

Because now you can . . .

Today is the official launch of the resource you’ve looked everywhere for but never found:

The Private SEO Black Hat Forum

Normally what you get on forums are people who don’t know anything talking with people who don’t want to say anything. You can occasionally find amazing tips on some forums: but you have to dig through 400 crappy posts just to find one post that is useful. That becomes a huge time sink.

How are the SEO Black Hat forums different?

Quality: We’re not going to have any contests to see who can make the most posts. That just creates tons of crap that no one wants to read. Our focus is on quality over quantity. Our primary concern is with succinctly answering one question: “What works?”

Sophisticated: Many of the topics we discuss are very advanced and require a high level of technical or business acumen to appreciate.

Expert Discussions: The SEO Black Hat forums are not for everyone and they may not be right for you. If you are relatively new to SEO or building websites, then do not join the SEO Black Hat Forums: you will be in way over your head. There are plenty of newbie forums out there for you – this is not one of them. Our forums are for successful web entrepreneurs to develop strategies that drive more traffic and generate more revenues.

Forum Membership Benefits

Access to Expert Advice and Discussions
We have both White Hat and Black Hat Experts that are already benefiting from new tool development, techniques, scripts and the sharing of ideas.
Some members you may already be familiar with include:

* CountZero from blackhat-seo.com (Black Hat)

* RSnake from ha.ckers.org (Web Security Expert)

* Dan Kramer from Kloakit (Cloaking Expert)

* Jaimie Sirovich from seoegghead.com (Token White Hat / SEO Geek)

There are several other members that you are certainly familiar with who are using handles for anonymity. We have others who are more focused on security, vulnerabilities, and coding. There are still more that you are likely unfamiliar with but are nevertheless web millionaires.

Databases – Large Datasets
If you want your sites to have massive amounts of unique content you need large data sets. The trading, discussion and posting of large data sets is going on right now on our forums.

Expired / Deleted Domain Tools
Want to use to use the same domain Tool that I used to get a Page Rank 6 site in the Gambling Space for just $8? This domain tool is available for members to use for free.

50% off on Kloakit – The Professional Cloaking Software

Scripts – Several useful scripts have already been posted – interesting thing you may not have thought of before are being discussed and developed.

Exploits and Case Studies: The really good stuff I can’t talk about on the SEO Blackhat Blog is being discussed on the SEO Black Hat Forums. Right now, some of the conversations include beating captchas, domain kiting, data mining, hoax marketing, XSS vulnerabilities as they relate to SEO, and much more.

Pricing: $100 per month.

The price will soon be rising significantly as more databases, hosted tools, scripts and exploits are added. However, once you lock in a membership rate it will never go up and you will continue to have access to everything.

So, if you think you’re ready for the most intense Black Hat SEO discussions anywhere, then here’s what you need to do:

1. Register at the SEO Black Hat Forums.

2. Go to the User CP and select Paid Subscription.

I’ll see you on the inside!

Every Search Engine Spammer Needs to Know…

Last Thursday, the boys at the ‘plex announced that they would be releasing 10 gazillion keywords for statistical analysis and other research. That perked my ears up right away. We love large data sets because they are the cornerstone of building massive spam sites targeted niche aggregators.

The fine print is that you have to jump through some hoops to get the data – details are to be released, but you will likely have to be a member of the L.B.C.


“So tell me wuts up wit dis LBC thang?”

Wait . . . make that the LDC, the Linguistic Data Consortium. Their annual membership is $20k and they sometimes make you pay more for certain data sets.

The almost invisible print is pointed out by greywolf and confirmed by Matt Cutts in this threadwatch discussion.

When people sell a mailing list it’s extremly common for sellers to seed the list with some names that only exist for the purpose of catching people who are misusing it. I would have to assume the boys and girls at the plex would do the same. – Greywolf

graywolf, you have a devious, devious mind. How many other people would consider seeding the terms with some nonsense phrases? I ask you–how many other people would come up with an idea like that?

Well, I guess I can think of a couple people.. – Matt Cutts

graywolf, yes you should take it as a compliment. Not to worry, I’m familiar with the practice. My favorite is Lye Close, the fake street in London: http://wiki.openstreetmap.org/index.php/Copyright_Easter_Eggs

billhartzer, sshhh. I was just watching boogybonbon find out about “google monitor query or googletestad” today. Don’t ruin the fun. – Matt Cutts

referring to boogybonbon’s post on keyword research.

Trap admiral akabar from star wars

That’s right, it’s a trap.

We know about poisoning seasoning keyword lists – in fact sometimes we’ll do it ourselves. However, this exchange confirms what a few of us have been thinking all along – that the search engines are on to this tactic and use it as well.

Are you using wordcatcher, overture, the google keyword suggestor or any data directly from the search engines? It seems there’s a good chance that it could be a trap. If you’re using poisoned data, that could certainly explain why your sites are only lasting 6-9 weeks in the SERPs.

Understanding this kinda puts a damper on the 400+ meg file (update:mirror with data)that contains all the AOL searches of 500k users for the last 3 months.

“Jacta alea est!” – Julius Caesar

It’s a war. Develop your own supply lines so you don’t have to get food from the enemy.

Few lines of code: thousands lines of content

The simplest way to create pure orignal spam content, that can bring you millions of visitors if you are using this properly.
The code is really, really simple. You need:
1) webserver, supporting php / or in advanced mode – php+mysql
2) text file with words (you can just copy and paste 100 articles in this file)
3) little technical hand to adjust the code to your specific needs.

The script generates 2 strings:
– title (or the main random keywords, for title, meta keywords, description etc)
– the content (random words)

For 5 minutes this can be integrated into your database – just create a table, name it whatever you want,
add 3 fields (ID, title, content) and you are ready to gamble on the SEs 😉
The coolest thing is that actually the SEs will see the same text when they reffer to specific ID, so the only thing you have to do is some basic on-page seo.

Here is the code
< ?php #define the text file #we will get the words from $content_file = "content.txt"; #split the string, using each space as a pointer $words = split(" ", join("", file($content_file))); srand((double) microtime()*1000000); #define how many words can #be in the title (actually, #thats the string we are going to optimize for) $bh_title = rand(4,13); $bh_real_title = " "; (more…)

The Long Tail of a Black Hat

Chris Anderson’s Long Tail theory:

our culture and economy is increasingly shifting away from a focus on a relatively small number of “hits” (mainstream products and markets) at the head of the demand curve and toward a huge number of niches in the tail.

Kevin Marks looks further down the tail and notes:

A true long tail business is one that copes with the ultimate niches – where there are just one, or even zero customers. You need to be sure that your submission model can cope with these limiting cases and not choke, especially as you do not know a priori which ones are going to garner customers. So, what businesses fit this model?

Answer: Black Hat Search Engine Optimization (AKA Search Engine Spamming).

Many SEO Black Hats create millions of pages over hundreds of domains using some form of automated website creation software. Each page is optimized for an obscure keyword phrase – perhaps one that is only searched for 50 times per month.

Just one of a Search Engine Spammer’s spam sites could have 30,000 pages. This site might receives just one unique visitor on less than 10% of the pages per day. Of those 3,000 visitors if only 2% (60) click on a 10 cent Adsense word, this site would generate $6.00 per day or $180 per month.

So does the smart search engine spammer stop at 1 or 5 site? Of course not. A true SEO Black Hat has hundreds (and in many cases thousands) of these sites operating at once.

As a black hat automates more steps in the site creation process, the time investment decreases dramatically. Recently, I spoke with a Search Engine Spammer who told me that he has automated EVERY step in the process from domain registration to Keyword selection, to new CSS design, to splog indexing. He claimed he can create and index a 50,000+ page website with less than a minute’s worth of incremental keystrokes.

As the cost of page creation, hosting and advertising (production and distribution) approaches zero, the number of customers required for a profitable search engine spamming business drops to much less than one per page (or micro-niche).

Search engine spamming is a business that “copes with the ultimate niches – where there are just one, or even zero customers.” It will therefore be interesting to see if Anderson devotes a few pages to Search Engine Spammers in his upcoming Long Tail Book.