Register

If this is your first visit, please click the Sign Up now button to begin the process of creating your account so you can begin posting on our forums! The Sign Up process will only take up about a minute of two of your time.

Results 1 to 2 of 2
  1. #1
    Junior Member
    Join Date
    Dec 2009
    Posts
    18
    Member #
    20714
    I am using a cron job to submit my sitemap to google, as suggested by http://sitemap-generators.googlecode...generator.html.

    My crontab is:
    29 6 * * * python /var/www/oilchange/sitemap_gen.py --config=/var/www/oilchange/oilchange_config.xml > /var/log/httpd/oilchange_sitemap_log


    It works, and the log is:

    Reading configuration file: /var/www/oilchange/oilchange_config.xml
    Opened ACCESSLOG file: /var/log/httpd/access_log
    Sorting and normalizing collected URLs.
    Writing Sitemap file "/var/www/oilchange/sitemap.xml.gz" with 664 URLs
    Notifying search engines.
    Notifying: www.google.com
    Count of file extensions on URLs:
    9 .css
    140 .gif
    50 .html
    3 .jpg
    7 .js
    450 .php
    2 .png
    1 .txt
    2 /
    Number of errors: 0
    Number of warnings: 0

    My question is: how could I notify other search engines, such as yahoo and bing, as well as google?


    I noticed in 'sitemap_gen.py', which was from google,

    # Search engines to notify with the updated sitemaps
    #
    # This list is very non-obvious in what's going on. Here's the gist:
    # Each item in the list is a 6-tuple of items. The first 5 are "almost"
    # the same as the input arguments to urlparse.urlunsplit():
    # 0 - schema
    # 1 - netloc
    # 2 - path
    # 3 - query <-- EXCEPTION: specify a query map rather than a string
    # 4 - fragment
    # Additionally, add item 5:
    # 5 - query attribute that should be set to the new Sitemap URL
    # Clear as mud, I know.
    NOTIFICATION_SITES = [
    ('http', 'www.google.com', 'webmasters/sitemaps/ping', {}, '', 'sitemap')
    ]


    I tried to add the following lines to the file.

    ('http', 'www.yahoo.com', 'webmasters/sitemaps/ping', {}, '', 'sitemap')
    ('http', 'www.bing.com', 'webmasters/sitemaps/ping', {}, '', 'sitemap')
    ('http', 'www.ask.com', 'webmasters/sitemaps/ping', {}, '', 'sitemap')

    However, it does not work. With the added lines, The log file records nothing, and the sitemap is not updated. After removing the added lines, the crontab works fine again.


    Any ideas? Thanks in advance
    I am happy working on the following: the first, the second, and the third.

  2.  

  3. #2
    Junior Member
    Join Date
    Jan 2010
    Location
    Alabama
    Posts
    11
    Member #
    21024
    I'm no expert at this subject, but they may have minor differences in accepted code and sitemaps. Might be worth checking out.


Remove Ads

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
All times are GMT -6. The time now is 02:50 PM.
Powered by vBulletin® Version 4.2.3
Copyright © 2019 vBulletin Solutions, Inc. All rights reserved.
vBulletin Skin By: PurevB.com