Blogs I Visit Regularly

Here are the 5 blogs that I visit regularly, in alphabetical order:

Aaron Wall’s SEO Book
This blog is focused on the search engines and the SEO world. Aaron usually makes some pretty good points.

Danny Sullivan’s Blog
Danny’s personal blog, where he talks about search engines as well as anything that comes to his mind.

Jensense – Making Sense of Contextual Advertising
This is a must read for anyone involved in Adsense, YPN, or any other similar programs. Jennifer Slegg talks about major happenings with these types of programs. Sometimes I find out about these changes through this site first!

Search Engine Watch
Search Engine Watch has a wealth of information on what’s going on in the search engine world. One nice feature I like is the “Search Engine Forums Spotlight”, where it lists the topics of interest from the various search engine forums.

SEO Black Hat by QuadsZilla
This blog talks about some of the more, shall we say, “interesting”, ways of doing SEO (it’s called SEO Black Hat for a reason, right?) You may or may not agree with some of the techniques discussed there, but it’s always good to know what others are doing.

Research in Search Engine Labs

There is an article on Search Engine Watch that talks about the research going on in major search engine labs. Below is what each search engine lab is focusing on:

Microsoft adCenter Labs

  • Paid search
  • Contextual ads
  • Behavioral targeting
  • Emerging markets

Yahoo Labs

  • How to know what to believe
  • Trust models online and propagation
  • What makes communities thrive and wither
  • Tagging images and videos, and sharing these file types

Google Labs

  • Image processing
  • Fact extraction
  • Statistical machine translation

Out of all the above, the only one that gets me excited is Google Lab’s image processing. I’ve always thought that the tagging mechanism currently used for searching for an image (or video, or other media types) is only a temporary solution. The real end goal is to be able to search an image based on what the image looks like (show me pictures that look very similar to this image). The same thing with goes with video (based on what’s said) and audio (based on what’s said and the action).

There are several companies working on voice recognition and image mapping. One or two of them will make it big.

Number of Browser Bugs

According to Symantec’s twice-yearly Internet Security Threat Report, the number of bugs found by browser during the first 6 months of this year is as follows:

Mozilla browsers: 47
Internet Explorer: 38
Safari: 12
Opera: 7

Interesting that Mozilla browsers had more bugs than IE. I wonder if it’s because Mozilla is gaining in popularity, or if Microsoft is actually doing a better job in designing more secure software? Also, reading this report made me go download Opera right away. :)

To view the entire report, please click here.

Google Webmaster Tools: Part 3, Statistics Reports

This blog talks about the reports available under the Statistics tab in Google Webmaster Tools. The 4 main sections are: Query stats, Crawl stats, Page analysis, and Index stats. Let’s take a look at each one below:

Query stats

This page displays two main reports:
1) Top search queries (queries that most often return a page from your site), and your site’s position for each query.
2) Top search query clicks (queries that generated a click to your site), and your site’s position for each query.

You can also drill down by search time (web, image, etc) or search location (google.com, google.co.uk, googl.ca, etc — note that does not represent where your user is coming from. It simply indicates which Google search engine the user used).

Crawl stats

This report shows PageRank distribution of the pages in your site, as well as which page has the highest PageRank. This is a functionality that I can do without — Google should either provide PR information on all the pages, or just get rid of this report.

Page analysis

This page has two sections: Content, which shows the type of documents Google found on your site, as well as the distribution of language encoding for the pages on your site. Common Words, which shows the words most commonly found on your site, as well as the anchor text most often found in links pointing to your site.

Index stats

This is simply a list of commands that you can use to find more information about your site. For example, site:www.yoursite.com can be used to find the indexed pages from www.yoursite.com.

Google Webmaster Tools: Part 2, Diagnostic Reports

Once you have set up your site with Google Webmaster Tools, you’ll be able to view two types of reports: Diagnostic Reports and Statistics Reports. In this post, I’ll review the information available in the Diagnostic Tab.

Under the Diagnostic tab, there are three main sections: Summary, Crawl errors, and Tools. Let’s take a look at each one below:

Summary

The summary page showing the following:
1. Whether pages from your site are included in the Google index.
2. The date Googlebot last successfully accessed the home page of your site.
3. Whether you have submitted a sitemap to Google.
4. Crawl errors Google found, including:

  • HTTP errorsĀ
  • Not found
  • URLs not followed
  • URLs restricted by robots.txt
  • URLs timed out
  • Unreachable URLs

This error report is useful in helping you identifying incorrect links to your site, especially from internal links.

Crawl errors

The Web crawl report under Crawl errors shows any crawl errors in more detail. The Mobile Web report shows any crawl errors for your mobile site in CHTML, WML/XHTML.

Tools

robots.txt analysis: This report shows whether Google found a robots.txt file in your site. You can also experiment changing the content of robots.txt file and see how that affect Google’s crawlers.

Manage site verification: This report displays information webmasters need for verifying that they are indeed the owner of the site.

Preferred domain: Google allows you to specify whether you want Google to think www.sitename.com and sitename.com are the same. This is the one functionality that I think is the most valuable for Google Webmaster Tools. Since you have no control on how other people link to your site, you’ll want to make sure that Google knows that links to www.sitename.com and sitename.com are the same (this should usually be the case). This way, your site can get full credit for all the incoming links.

Google Webmaster Tools: Part 1, Setting Up

In this post, I’ll talk about how to set up your site in Google Webmaster Tools. In subsequent posts, I’ll look at the reports available in Google Webmaster Tools.

To add a site to Google Webmaster Tools, do the following:

1. Go to http://www.google.com/webmasters/sitemaps.

2. Login with your Google account.

3. Type in your site (starting with http://) into the text box and click on the OK button.

4. Google will show you some initial information it has on the site, such as whether pages from this site are included in Google’s index, and the date Googlebot last accessed your home page.

5. Click on the “Verify your site” link to verify that you are the owner of the site.

6. There are two ways to verify: Add a meta tag to the site’s homepage (Google will tell you what the meta tag should look like), or upload a HTML file to the site’s root directory (Google will tell you the file name to use). Choose your method, and you’ll be given directions to set up properly.

7. Once you have either added a meta tag to the site’s homepage, or upload a HTML file to the site’s root directory, you can click on the “Verify” button. You don’t need to do this right away — you can always come back later when you are ready.

8. You may also submit a sitemap to Google. To add a sitemap, click on the “Add a Sitemap” link for the site after you log in to Google Webmaster Tools.

9. Next, select whether you are submitting a regular web sitemap or a mobile sitemap.

10. Specify the location of the sitemap in the textbox.

11. Before you click on the “Add Web Sitemap” button, you’ll then generate a sitemap for your site. A simple way to generate a sitemap is covered in an earlier article titled Creating a Simple Google Sitemap. Once it’s generated and uploaded to your site, click on the “Add Web Sitemap” button.

12. That’s it. In the next post, I’ll take a look at the reports you can see in Google Webmaster Tools.

Yahoo Site Explorer

Yahoo Site Explorer (http://siteexplorer.search.yahoo.com) is a service provided by Yahoo that shows what the Yahoo search engine knows about your site, specifically which pages are indexed, and the number of inbound links. If you register, you can submit a feed to Yahoo to ensure that Yahoo knows all your pages.

The most useful part of Yahoo Site Explorer is the number of inbound links. As we all know, the link: command provided by Google is notoriously inaccurate, and it appears to me that the count returned by Yahoo Site Explorer is more reliable. In addition, Yahoo has made it easy to exclude inbound links from the same domain or subdomain. This is helpful as webmasters often want to know both the total number of inbound links, as well as the inbound links from external sites. It’s true that the above information was already available on Yahoo before Site Explorer, but it is much easier to get at the information now.

One nice feature about Yahoo Site Explorer is that it’s easy to explore different URL’s. For example, as you are looking through all the pages that link to a particular site, a “Explore URL” button appears as you mouse over each page. You can click on that button and instantly get the information for that page. That was very convenient for me.

I also authenticated my site (you’ll need to use your Yahoo ID) to see what additional information I can get. I found out basically the only benefit of authenticating your site is that you can send a feed (basically a sitemap) to Yahoo, and Yahoo tells you when its crawler last accessed that feed. This is less than what I was hoping for. For example, Yahoo does not tell you it cannot find a page listed in the feed, nor does it tell you how many times your site was clicked on from within Yahoo Search.

Overall, Yahoo Site Explorer is a useful product for users to find out more information about a site/page. As a webmasters tool, it lags behind Google’s Webmasters Tool product. I would recommend that you authenticate your site through Yahoo Site Explorer so that you can submit your feed to ensure Yahoo picks up your new pages, but that’s pretty much it.

Comparing Major Search Engines

When it comes to the complexity of search engine algorithms, it is known that MSN is the least sophisticated, Yahoo (Inktomi) is better than MSN, and Google is the most advanced. With that in mind, it is slightly surprisingly to me that I’d find the following rankings for my SQL Tutorial site:

Query Term = SQL Tutorial
Google rank: 3
Yahoo rank: 9
MSN rank: 12

Note that those rankings are all coming from the .com version of the site.

The relative rankings were somewhat unexpected because I had applied all the basic SEO techniques to this site, which should lead to the site ranking well on MSN and Yahoo, which focus more on page content than Google does. But as you can see, this is not the case.

I ran across an article by Aaron Wall of SEO Book.com on search engine relevancy, which shed some light on this matter. Aaron pointed out that Yahoo and MSN results tend to favor commercial sites, while Google favors information/content sites. As my site is clearly content-oriented (confirmed by doing a search at Yahoo Mindset), this explains why it ranks better on Google than on Yahoo and MSN.