Search Engine Tricks
  Home arrow Search Engine Tricks arrow Top Reasons Why You May Not Be Indexed
Affiliate Promotion  
Blog Help  
Domain Name Tips  
How To  
Newsletter Marketing  
Online Business Help  
Search Engine Tricks  
Web Development  
Web Hosting  
Website Advertising  
Website Content  
Website Marketing  
 Webmaster Tools
 
Base64 Encoding 
Browser Settings 
CSS Coder 
CSS Navigation Menu 
Datetime Converter 
DHTML Tooltip 
Dig Utility 
DNS Utility 
Dropdown Menu 
Fetch Content 
Fetch Header 
Floating Layer 
htaccess Generator 
HTML to PHP 
HTML Encoder 
HTML Entities 
IP Convert 
Meta Tags 
Password Encryption
 
Password Strength
 
Pattern Extractor 
Ping Utility 
Pop-Up Window 
Regex Extractor 
Regex Match 
Scrollbar Color 
Source Viewer 
Syntax Highlighting 
URL Encoding 
Web Safe Colors 
Whois
 
Forums Sitemap 
Mobile Linux 
APP Generation ROI 
IBM® developerWorks 
Weekly Newsletter
 
Developer Updates  
Free Website Content 
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
SEARCH ENGINE TRICKS

Top Reasons Why You May Not Be Indexed
By: Developer Shed
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating:  stars stars stars stars stars / 0
    2003-08-09

    Table of Contents:

    Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT


    Top Reasons Why You May Not Be Indexed


    At one time or another you may have used a submission tool, or submitted by hand and then wondered why you had not been indexed. Unfortunately, there are many reasons that may delay or prevent you from being indexed by a search engine. There's rarely one simple answer for why you're not being found. Fortunately, there is generally an explanation and a way to correct the problem if you know what to look for.

    Below are the Top 21 reasons we've compiled over the years as to why you may not be finding your Web site or Web page in one or more search engines:

    1. INDEX TIME: First, make sure you've allowed enough time to become indexed. The amount of time to allow is sometimes listed on the search engine's submission page. Unfortunately, the engine's own advertised times are often inaccurate or out of date. WebPosition's Submitter report and WebPosition's URL Verification report will both tell you how much time you should allow before being concerned about not being indexed. Average index times often range from one to eight weeks depending on the engine. Some engines like AltaVista and Inktomi offer paid options if you wish to be indexed more quickly.

    2. ALREADY INDEXED: Be sure you're not already indexed but just don't know it. Unfortunately, none of the major engines are kind enough to e-mail or notify you as to if and when you've been indexed.

    In addition, you cannot simply do a search on a keyword that applies to your Web site and expect it to pop up at the top. In fact, you must take pro-active steps to optimize your pages for each search engine. If you don't, it's very unlikely you'll find your Web site except on the most specific of searches.

    The method to determine if a page or domain has been indexed varies from one engine to another, and in many cases, it's difficult to tell for sure whether your pages are in fact in there. Never assume that you're not indexed just because you searched for a bunch of keywords and you never came up in the first few pages of results. You could be there (i.e., indexed) but be buried near the bottom.

    In addition, it's not very practical to check the status of a number of pages on each major engine each week. Fortunately, WebPosition Gold has a URL verification feature in the Reporter that makes this process much easier. Each time you run a mission, it will report which URLs exist and do not exist in each engine. If you're using WebPosition and are not finding your URLs after submitting, be sure to see this page for common pitfalls to watch out for:

    http://www.webposition.com/urlnotfoundhelp.htm

    3. MISSING PAGE: Make sure you have uploaded the pages to your site before submitting them. This one will seem obvious to many people, but submitting a page that does not exist or submitting with a subtle typo in the URL is a goof we might all make at one time or another. If you're using WebPosition's Submitter, there's a checkbox on tab 2 labeled "Verify that each page exists on Web site before submitting." This option defaults so that WebPosition will verify that all your URLs are valid and actually exist before submitting them. This is important since not all search engines will notify you if the URL does not exist when you submit.

    4. ROADMAP FROM HOME PAGE: Some engines have been known to drop pages that cannot be traveled to from the home page. HotBot has been rumored to do this. You may want to consider submitting your home page that links either directly or indirectly to your doorway pages. Think of your Web site as a series of roads (i.e., links) from one page to another. If there's no road from your home page to the page you want indexed, a search engine may decide the page is unimportant or of low-quality. You could submit the page directly, but the engine may reject it or may drop it at a later date when it finds no "road."

    5. EXTERNAL LINKS: Some search engines such as Google and HotBot have been known to refuse to index Web sites that do not have any other Web sites linking to them. Or, they may index your home page but refuse to index any other pages until you achieve at least one or more links from another domain. Or, they may index you for awhile but then "prune" their database later of all Web sites that did not achieve any external links within a certain period of time. However, do not worry! You simply need to establish some links and when that's done, resubmit both your pages and the pages that link to you. Once you have links to your Web site, it becomes much easier to get indexed, stay indexed, and to achieve top rankings.

    To check your link popularity and for tips on how to increase the links to your Web site see our free service at:

    http://www.linkpopularitycheck.com

    6. FRAMES: If you have content inside HTML frames, this can cause problems with submissions. For example, the search engine may index the main content of the page, but not the surrounding menu frame. Visitors to your site will then find some information but may not see the associated menu! It's generally best if you can create non-framed versions of your pages. You should then submit the non-frames versions of your pages, which can of course link to your framed Web site. Alternatively, you can enter your relevant text within the NOFRAMES area of a framed page that most search engine spiders will read. However, don't expect to achieve high rankings while optimizing the NOFRAMES area. Optimizing a NON-framed page will often achieve better results.

    7. SPIDER BLOCKS: Search engine spiders cannot index sites that require any kind of registration or password. A spider cannot fill out a form of any kind. The same rule applies regarding indexing of content from a searchable database. That's because the spider cannot fill out a form to query that database. The solution is to create static pages that the engines will be able to find and index without performing a special action on your site. Depending on the database system you have, there are utility programs out there that help you do this, as well as companies that can assist you.

    8. FREE SITES: Many engines no longer index pages from free web sites or they limit the number of pages they will index from these hosts. Sometimes they will get too many "junk" submissions from free web site domains such as Geocities or others. Therefore, some engines choose not to index anyone from some of these domains. Or, more commonly, they limit the number of pages they will accept.

    It's always best to buy your own domain name (very important) and place it on a respected, paid service to avoid being discriminated against. The free traffic you can generate from the search engines is just too valuable to be sacrificed for the small savings a free hosting service provides. In addition, free hosts are often unreliable or force you to display banners that send valuable visitors away from your Web site soon after arriving. That can cost you sales.

    9. GUILT THROUGH ASSOCIATION: If your Web site shares the same IP address as many other Web sites on your host's Web server, then you may find your IP quietly banned from something another Web site on the same server did! It's always best to ask your hosting service if your domain name has its own unique IP assigned to it. If not, ask them to move it to its own IP to avoid the potential of having your submissions ignored because of something that a site sharing your IP did. We've heard from many people who tried everything to be indexed only to find it was a snap once they changed hosting services.

    10. SUBMISSION LIMITS: Make sure you're submitting within the recommended limits. Some engines do not like more than a certain number of submissions per day for the same domain. If you exceed the limit, you may find that all your submissions are ignored. Fortunately, WebPosition's submitter will warn you regarding current limits and help keep you within them. Some submission consultants feel it is dangerous to submit more than ONE page a day to an engine for a given Web site. For those who wish to be ultra-conservative in their approach, the WebPosition Submitter includes a checkbox to limit submissions to one URL per day per engine.

    11. DYNAMIC PAGES: Dynamic pages are often ignored by the search engine spiders. In fact, any URL containing special symbols like a question mark (?) or an ampersand (&) will be ignored by many engines. Pages generated on the fly from a database often contain these symbols. In this situation, it's important to generate "static" versions of each page you wish to be indexed. In regard to the search engines, the simpler the page is, the better. Does this mean, for example, having a javascript to count visits to the page will prevent you from being indexed, or lower your rankings? No. It simply means that the search engine will most likely ignore the javascript and index the remaining areas of the page. There is evidence that going too far with fancy scripts and code on a page can hurt your rankings if the bulk of your page consists of java or VB scripts.

    12. NON-INDEXABLE CONTENT: It's important to know the types of content that the average search engine cannot index. Most engines cannot index text that is embedded in images. Text that appears in multimedia files (audio and video) will not be indexed. Most engines cannot index information that is generated by Java applets or in XML coding.

    13. LARGE PAGES: If your site has a slow connection or the pages are very complex and take a long time to load, it might time out before the spider can index all the text. For the benefit of your visitors and the search engines, limit your page size to 50K or less. In fact, most Webmasters recommend that your page size PLUS the size of all your images on the page should not exceed 50K-70K total. If it does, many people on dial up connections will leave before the page fully loads.

    14. DEEP LINKS: If you submit just your home page, don't expect a search engine to travel more than one or two links away from the home page or from the page that you submitted. Over time they may venture deeper into your site, but don't count on it happening quickly. You'll often need to submit pages individually that appear further down into your site or create more direct links from the home page (either visible links or hidden links). This way the search engines can find them. Visible links are preferable, but when that's not practical, there are methods to create links that won't be seen by the average visitor:

    http://www.webposition.com/hiddenlinks.htm

    The technique of submitting one page that then links to multiple other pages you want found is called creating a "hallway page." In many cases, this will not only get you indexed in cases where they are ignoring you, but it will often improve your rankings. That's because many engines assign "bonus points" to pages they find on their own versus pages that were specifically submitted to them.

    15. UNRELIABLE HOSTS: If your Web site fails to respond when the search engine spider pays a visit, you will not be indexed. Even worse, if you are indexed and they pay a visit when your site is down, you could be removed from their database! Consequently, it pays to have a reliable hosting service that is up 99.5% of the time. However, at some point a spider is going to hit that other 0.5% and end up yanking your pages by mistake. Therefore, it pays to keep a close eye on your listings and resubmit when needed.

    16. SPAM: If you have ever used any questionable techniques that might be considered an overt attempt at spamming (i.e., excessive repetition of keywords, same color text as background, or other things that the WebPosition Page Critic warns you about), an engine may ignore or reject your submissions. If you're having trouble getting indexed in the expected amount of time, make sure your site is spam-free.

    17. REDIRECTS: If your site contains redirects or meta refresh tags these things can sometimes cause the engines to have trouble indexing your site. Generally they will index the page that it is redirecting TO, but if it thinks you are trying to "trick" the engine by using "cloaking" or IP redirection technology that it can detect, there is a chance that it may not index the site at all.

    18. PROPER DIRECTORY SUBMISSIONS: If you're submitting to a directory site like Yahoo, Open Directory, Looksmart, or others, then a human being will review your site. They must decide if the site is of sufficient "quality" before they will list it. I recommend you read the submission guide on the directory tab of the WebPosition Submitter. It contains tips on how to improve your chances of obtaining a good listing in these directories. Getting listed in major directories first can help you get listed elsewhere.

    19. INDEX TIMES CAN FLUCTUATE: WebPosition will tell you the average index time of each search engine. However, this is only an average. Sometimes engines will index sites every 30 days fairly consistently and then suddenly stop indexing most sites for several months. This can be frustrating, but it does happen. Generally a major engine will not go more than three to four months without refreshing its index. If you're wondering if others are experiencing trouble getting indexed on a particular engine, try asking around. You can often find valuable real-world feedback on a search engine discussion forum such as the one at:

    http://www.marketposition.com

    20. PAGE LIMITS: If you have many pages indexed but are having a hard time getting new ones recognized, be aware that there are limits. Each search engine will only spider so many pages of your Web site. This may range from a few dozen or three or four hundred depending on the engine. Some people have even been successful in getting far more pages indexed depending on the engine. Google is one engine that tends to crawl deeper into your site. However, how deep they go may depend on factors like your link popularity. Sites with higher link popularity are deemed "worthier" of more thorough indexing.

    21. RANDOM ERRORS: Last but not least, sometimes the engines just lose submissions at random through technical errors and bugs. After all, they are managing a database of hundreds of millions of pages. Therefore, some people like to resubmit once or twice a month for good merit in case they do lose a submission. Certainly if you've followed all the "rules" and are still not listed, by all means, re-submit! Sometimes a little persistence is all that's needed.

    TIP: Once your page achieves a desirable ranking, it's best not to continue submitting it. You risk the engine re-evaluating the page and possibly reducing your ranking.

    If any of the above scenarios apply to your submission, you should take the appropriate actions and then re-submit. If that still does not work, you should consider e-mailing or calling the search engine and asking them politely why you have not been indexed yet. Sometimes they will reply back with "Sorry, there was a problem with our system and I've now made sure you'll be indexed within the next couple days." Or, sometimes they'll tell you why you were not indexed so you might correct it. In other cases, they will ignore your e-mail and you'll have to keep e-mailing or calling them until they respond. Still, it's definitely worth the effort to get your site listed with the major engines. This assumes you also take the time to optimize your pages so you'll achieve top rankings.

    Robin Nobles is the Director of Training of the Academy of Web Specialists, which teaches online training in search engine marketing. She is also a trainer with Search Engine Workshops, which presents on location workshops in search engine marketing at various locations across the country. Please visit our site for more information about online training and other resources.

    http://www.marketposition.com/

     


    DISCLAIMER: The content provided in this article is not warranted or guaranteed by Developer Shed, Inc. The content provided is intended for entertainment and/or educational purposes in order to introduce to the reader key ideas, concepts, and/or product reviews. As such it is incumbent upon the reader to employ real-world tactics for security and implementation of best practices. We are not liable for any negative consequences that may result from implementing any information covered in our articles or tutorials. If this is a hardware review, it is not recommended to open and/or modify your hardware.

    More Search Engine Tricks Articles
    More By Developer Shed

     

    IBM® developerWorks developerWorks - FREE Tools!


    Check out the new Jazz space on developerWorks

    <a href="http://zeus.developershed.com/shonuff.php?blackbird=3853&zoneid=442&source=&dest=http%3A%2F%2Fwww.ibm.com%2Fdeveloperworks%2Fspaces%2Fjazz%3FS_TACT%3D105AGY31%26S_CMP%3DDEVSHED&ismap="><img src="http://images.devshed.com/corp/img/news/jazz01.gif" alt="developerWorks Jazz space" align="left"></a>You've heard the buzz about Jazz... want to know more about it from a developer's perspective? Check out the Jazz space on developerWorks. This space is an up-to-date resource for developers, including technical information about Jazz and products built on Jazz, like Rational Team Concert Express. The Jazz space includes content from a wide variety of sources, including links, feeds, and comments from experts.
    FREE! Go There Now!


    NEW! Accelerating Software Innovation on i on Power Systems

    Attend this launch webcast with Scott Hebner, Vice President of IBM Rational Marketing and Strategy, for an overview of Rational’s new software offerings and resources to help modernize and accelerate software innovation on i on Power Systems – while ensuring past application investments are protected and continue to grow. Learn how these solutions are helping customers extend their core i5/OS solutions toward modern architectures such as SOA and web technologies to deliver business improvements that stand the test of time.
    FREE! Go There Now!


    NEW! Evaluate IBM Rational Software Analyzer V7.0

    Download a free trial version of IBM Rational Software Analyzer Developer Edition V7.0 to identify bug defects earlier in the software development cycle. Rational Software Analyzer is an extensible software development solution that reduces the expense of bug-fixes by enabling static analysis code reviews and bug identification very early in the development cycle.
    FREE! Go There Now!


    NEW! Improve your build process with IBM Rational Build Forge, Part 1: Create a continuous build and integration environment

    Learn how to implement a build management system that uses and extends your existing automation technologies. This tutorial shows, step-by-step, how to install and configure IBM Rational Build Forge to manage builds for Jakarta Tomcat from source code.
    FREE! Go There Now!


    NEW! Improve your build process with IBM Rational Build Forge, Part 2: Automate builds for a real-world Tomcat project

    Learn how Rational Build Forge can extend a simple compile and package build process by adding customization and deployment capability. Go from a manual method to automating: checking for code changes; getting the latest source; compiling and packaging; customizing; copying to and restarting a deployment server; and sending e-mail notification that a new version is available.
    FREE! Go There Now!


    NEW! Rational 'Talks to You' Teleconference Series

    This Fall, IBM Rational talks to you directly through a special teleconference series giving you access to the best minds in IBM Rational - product experts and market thought leaders who will answer your questions during these pre-scheduled telephone conference calls. Register today!
    FREE! Go There Now!


    NEW! Run your first CICS application on a PC using TXSeries for Windows

    Learn the basics of the IBM Customer Information Control System (CICS). With a hands-on exercise, learn how to get your first CICS application up and running on your desktop using TXSeries V6.1 for Windows. The tutorial shows you how to download and install a free trial version of TXSeries V6.1.
    FREE! Go There Now!


    NEW! The dirty dozen: preventing common application-level hack attacks

    As organizations have grown increasingly dependent on online software, the risk of malicious attacks has also become far more serious. Fortunately, well-governed organizations can protect their Web applications by injecting vulnerability assessments and ethical hacks into their software development and delivery processes. This paper describes 12 of the most common hacker attacks and provides basic rules that you can follow to help create more hack-resistant Web applications.
    FREE! Go There Now!


    NEW! Trial download: IBM Rational Method Composer V7.2

    Get a free trial download of the latest version of IBM Rational Method Composer V7.2 which helps you deliver customized yet consistent process guidance to your project teams and IT organization, and includes the latest version of IBM Rational Unified Process (RUP), which has provided process guidance to teams since 1996.
    FREE! Go There Now!


    NEW! Try the IBM SOA Sandbox for People

    Visit IBM developerWorks to try the IBM SOA Sandbox for people. The SOA Sandbox for people provides a trial environment with the necessary tooling and components required to enable consistent human and process interaction and collaboration, showing how you can improve user experience and business productivity.
    FREE! Go There Now!



    All FREE IBM® developerWorks Tools!

       

    SEARCH ENGINE TRICKS ARTICLES

    - Your Top 10 SEO Don`ts
    - Search Engine Nightmares: Grammatical Errors...
    - Identifying Keywords
    - Crafting Perfect Keyword Phrases
    - Why Are Search Engines So Popular?
    - Write SEO-Perfect Articles
    - What Does Google Want?
    - Can`t Find the Right Keywords?
    - A Guide to Spamdexing
    - Make it Searchable
    - Search Engine Optimization (SEO) in Internet...
    - Google Adsense - Ads That Make You Money!
    - A Hard Look at PPC, Click Fraud and the Alte...
    - The Net`s New Information Highway
    - Gerrymandering The Google Search Results

     
    Create the Optimal Architecture for your Critical Applications
    Warburton's the largest independently owned bakery in the UK faced a number of d....

     
    Five Best Practices for Deploying a Successful Service-Oriented Architecture
    This white paper describes the benefits you can expect with SOA, and how IBM can....

     
    Gartner Magic Quadrant for Application Delivery Controllers
    Gartner summarizes its view on Application Delivery Controllers, evaluates stren....

     
    Knowledge is Power
    What you don't know can hurt you, and is likely costing you money and increasing....

     
    Rationalizing the Multi-Tool Environment
    The rationalized multi-tool approach is flexible, scalable and cost effective. I....

     




    © 2003-2009 by Developer Shed. All rights reserved. DS Cluster 2 Hosted by Hostway
    For more Enterprise Application Development news, visit eWeek