RE: [WSG] [OT] Google search/index/webmaster help

2009-11-02 Thread michael.brockington
-Original Message-
From: li...@webstandardsgroup.org [mailto:li...@webstandardsgroup.org]
On Behalf Of Philippe Wittenbergh
Sent: 01 November 2009 23:05
To: wsg@webstandardsgroup.org
Subject: Re: [WSG] [OT] Google search/index/webmaster help

>  Because that file is being served as 'text/html' instead 
>  of 'text/xml' as it should. 
>  That is server misconfiguration. 
I agree that this is technically incorrect, but hardly unusual.


>  I'm not surprised Googlebot doesn't pick it up.
I would be absolutely flabbergasted if Google ignored it purely because
of that.
Google has a strong history of being pragmatic; they _want_ to use this
file; why would you expect them to ignore a file with the right name,
the right kind of content, in the right place?

As an aside, how many robots.txt do you think get served up as text/html
?

Regards,
Mike


Mike Brockington
Web Development Specialist

www.calcResult.com
www.stephanieBlakey.me.uk
www.edinburgh.gov.uk

This message does not reflect the opinions of any entity other than the
author alone.


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-11-01 Thread Andrew Harris
How I love this community!
I haven't solved my problems yet, but based on the comments and ideas
I've gathered in the past few days, the site has improved
substantially. This latest comment from Philippe...

On Mon, Nov 2, 2009 at 10:04 AM, Philippe Wittenbergh  wrote:
> Because that file is being served as 'text/html' instead of 'text/xml' as it
> should. That is server misconfiguration.

I dismissed at first, thinking our CMS wouldn't allow me to tweak such
fundamental settings, but it led me into the bowels of the support
forums where I dredged up the little slice of code I needed. Now, the
sitemap.xml as well as the kml and gpx feeds are all served correctly
as text/xml - did I say how I love this community? - and I've a
grudging respect for MySource Matrix too!

-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-11-01 Thread Andrew Harris
> Because that file is being served as 'text/html' instead of 'text/xml' as it
> should. That is server misconfiguration. I'm not surprised Googlebot doesn't
> pick it up.

yes, quite right, unfortunately, I don't think I can get the CMS to
serve it correctly as xml, however google digested it happily enough -
just failed to spider all URLs - something which I now know is normal.

The only puzzle I still have is with the search results in our Custom
Search Engine (still off topic!) but why would the public search
return a different amount to the custom search? I have to admit, after
5 months of no change, this week it's gone from 1 result to 21 - go
figure!

Thanks again to all who replied.

It's just reinforced to me that if you want an internal search engine
that really works and is controllable, that leaving it up to the magic
donkeys at google is really not an option. Still trying to convince
our fine institution of that ;-)

-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-11-01 Thread Philippe Wittenbergh


On Nov 2, 2009, at 6:58 AM, Adam Smith wrote:

Swami; I'll hazard a guess here and assume you're using Firefox; and  
you've done what I did and gone tohttp://maps.unimelb.edu.au/sitemap.xml 
, seen a mass of test on screen


Because that file is being served as 'text/html' instead of 'text/xml'  
as it should. That is server misconfiguration. I'm not surprised  
Googlebot doesn't pick it up.

Safari shows the same issue.

Philippe
---
Philippe Wittenbergh
http://l-c-n.com/







***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-11-01 Thread Adam Smith
Actually, I think you're both right!
 
Swami; I'll hazard a guess here and assume you're using Firefox; and you've 
done what I did and gone to http://maps.unimelb.edu.au/sitemap.xml, seen a mass 
of test on screen and saved the file to be confronted with: 
http://www.sitemaps.org/schemas/sitemap/0.9";> - at the top of the file. 
 
I almost shudder to say this, but use... internet explorer (I feel like 
I need a wash!) for this; and the file's true structure is seen on screen. 
 
If some time presents today, I'll dig around and see if I can find out why 
there's something Firefox can't do better than IE (7 by the way) or if it's one 
of the plugin's I'm running.

>>> On 31/10/2009 at 2:33 PM, Andrew Harris  wrote:

ahh - no.

I did change some stuff on the site, but not the xml file - I suspect
whatever you were looking at it with the first time had to put the
html tags around it just to make sense of it.

On Sat, Oct 31, 2009 at 10:00 AM, Swami Neelamber  wrote:
> Sorry Hassan! It would seem it's been changed.
> Andrew's been beavering away, as one does.
>
> His original "XML" file I downloaded from the same URI as you
> did: , right onto my desktop (it's
> still there), and I believe I probably did that a number of hours before you
> looked.
> Sorry for the confusion. Keep breathing.
>
> Swami  :)
>
>
>
> www.blueskyzen.com/design
>
>
> On Sat, Oct 31, 2009 at 4:35 AM, Hassan Schroeder 
> wrote:
>>
>> Swami Neelamber wrote:
>>
>>> I'm not totally sure about that  you've used top and bottom
>>> of your *sitemap.xml* file?
>>
>> Don't know what you're looking at but there are no such tags in the
>> document at 
>>
>> --
>> Hassan Schroeder - has...@webtuitive.com
>> webtuitive design ===  (+1) 408-621-3445   === http://webtuitive.com
>> twitter: @hassan
>>  dream.  code.
>>
>>
>> ***
>> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
>> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
>> Help: memberh...@webstandardsgroup.org
>> ***
>>
>
>
> ***
> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
> Help: memberh...@webstandardsgroup.org
> ***



-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***


Network Ten Pty Ltd ABN 91 052 515 250

Network Ten Disclaimer.
This e-mail (including all attachments) is intended solely for the 
named addressee. If you receive it in error, please let us know by 
reply e-mail, delete it from your system and destroy the copies. 
This e-mail is also subject to copyright. No part of it should be 
reproduced, adapted or transmitted without the written consent of 
the copyright owner. E-mails may be interfered with, may contain 
computer viruses or other defects and may not be successfully 
replicated on other systems. We give no warranties in relation to 
these matters. If you have any doubts about the authenticity of an 
e-mail purportedly sent by us, please contact us immediately.



***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***


Re: [WSG] [OT] Google search/index/webmaster help

2009-10-30 Thread Andrew Harris
ahh - no.

I did change some stuff on the site, but not the xml file - I suspect
whatever you were looking at it with the first time had to put the
html tags around it just to make sense of it.

On Sat, Oct 31, 2009 at 10:00 AM, Swami Neelamber  wrote:
> Sorry Hassan! It would seem it's been changed.
> Andrew's been beavering away, as one does.
>
> His original "XML" file I downloaded from the same URI as you
> did: , right onto my desktop (it's
> still there), and I believe I probably did that a number of hours before you
> looked.
> Sorry for the confusion. Keep breathing.
>
> Swami  :)
>
>
>
> www.blueskyzen.com/design
>
>
> On Sat, Oct 31, 2009 at 4:35 AM, Hassan Schroeder 
> wrote:
>>
>> Swami Neelamber wrote:
>>
>>> I'm not totally sure about that  you've used top and bottom
>>> of your *sitemap.xml* file?
>>
>> Don't know what you're looking at but there are no such tags in the
>> document at 
>>
>> --
>> Hassan Schroeder - has...@webtuitive.com
>> webtuitive design ===  (+1) 408-621-3445   === http://webtuitive.com
>> twitter: @hassan
>>                          dream.  code.
>>
>>
>> ***
>> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
>> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
>> Help: memberh...@webstandardsgroup.org
>> ***
>>
>
>
> ***
> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
> Help: memberh...@webstandardsgroup.org
> ***



-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-10-30 Thread Swami Neelamber
Sorry Hassan! It would seem it's been changed.

Andrew's been beavering away, as one does.

His original "XML" file I downloaded from the same URI as you did: <
http://maps.unimelb.edu.au/sitemap.xml>, right onto my desktop (it's still
there), and I believe I probably did that a number of hours before you
looked.

Sorry for the confusion. Keep breathing.


Swami  :)




www.blueskyzen.com/design



On Sat, Oct 31, 2009 at 4:35 AM, Hassan Schroeder wrote:

> Swami Neelamber wrote:
>
>  I'm not totally sure about that  you've used top and bottom of
>> your *sitemap.xml* file?
>>
>
> Don't know what you're looking at but there are no such tags in the
> document at 
>
> --
> Hassan Schroeder - has...@webtuitive.com
> webtuitive design ===  (+1) 408-621-3445   === http://webtuitive.com
> twitter: @hassan
>  dream.  code.
>
>
>
> ***
> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
> Help: memberh...@webstandardsgroup.org
> ***
>
>


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***

Re: [WSG] [OT] Google search/index/webmaster help

2009-10-30 Thread Hassan Schroeder

Swami Neelamber wrote:

I'm not totally sure about that  you've used top and bottom 
of your *sitemap.xml* file?


Don't know what you're looking at but there are no such tags in the
document at 

--
Hassan Schroeder - has...@webtuitive.com
webtuitive design ===  (+1) 408-621-3445   === http://webtuitive.com
twitter: @hassan
  dream.  code.


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



[WSG] [OT] Google search/index/webmaster help

2009-10-30 Thread Swami Neelamber
Hi Andrew

I'm not totally sure about that  you've used top and bottom of
your *sitemap.xml* file?

It looks suspiciously like regular HTML to me, albeit with an XML extension
...

I'm a bit of a newbie here, but I've had good success with *this* structure:


http://www.google.com/schemas/sitemap/0.84"; xmlns:xsi="
http://www.w3.org/2001/XMLSchema-instance"; xsi:schemaLocation="
http://www.google.com/schemas/sitemap/0.84
http://www.google.com/schemas/sitemap/0.84/sitemap.xsd";>
   
 .. URI .
   monthly
0.5
   
|
etc. ...
|
   
 .. URI .
   monthly
0.5
   


I'm sure too that there's a wealth of info over at w3c.org ...

Cheers


Swami Neelamber






www.blueskyzen.com/design


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***

Re: [WSG] [OT] Google search/index/webmaster help

2009-10-29 Thread Andrew Harris
Bother! that last reply was supposed to be off list!
Oh well, the discussion had got around to web standards by that point,
so it's fair game.

...and it's a friday afternoon, cut me some slack!!

Thanks to all those who have replied off list. By way of reporting
back to the list, I'll say...

1) Sitemaps are not the magic fix I thought they were.
2) Inbound links and organic indexing are vital.
3) My map pages are pretty short on text - google likes text.

One thing that no-one picked up on was that I still haven't inserted
some common metadata tags - I know they say google doesn't look at the
metadata tags, but it makes me wonder.

Funny how asking your peers to check your work suddenly makes you
aware of basic things you'd missed...
 yes, my pages weren't valid - but they are now!!!  ;-p

--
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***



Re: [WSG] [OT] Google search/index/webmaster help

2009-10-29 Thread Andrew Harris
Craig,
OK - that's a really interesting comment.

I had, as far as I knew, used the right formatting, the sitemap validates as
XML and Google's webmaster tools accepted it as a valid feed (after a few
tweaks!)

I followed this document, which I understand is the definitive source.
http://www.sitemaps.org/protocol.php
and my sitemap looks pretty much like that - apart from a couple of
whitespace discrepancies.

The fact that it worked for some of the URLs makes me think it's not a
problem with the sitemap, but it's all interesting stuff.

Thanks for taking the time to reply.

On Fri, Oct 30, 2009 at 3:24 PM, Craig Jones
wrote:

>  Hi Andrew,
> This is my firts time trying to help...
> It doesn't appear that your sitemap is written in xml
> The sitemap should look like this
> http://www.sitemaps.org/schemas/sitemap/0.9
>
> http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd";>
> 
> www.unimelb.edu.au/campuses/maps.html
> 
> 
> www.unimelb.edu.au/campuses/maps2.html
> 
> 
> then submit you new sitemap in google webmaster tools
> Goodluck
> Craig
>
>
> Andrew Harris wrote:
>
> Yes, I know it's off topic, but I really need a hand with a mystifying
> problem. I've tried the google forums, but have received no replies.
> If there are any listers who understand the free Google Custom Search
> Engine, webmaster tools, sitemaps and indexing problems, then I'd
> really appreciate you contacting me directly in regards to some
> problems I'm having.
>
> Just to give an idea of my 
> quandary...http://maps.unimelb.edu.au/http://maps.unimelb.edu.au/sitemap.xml 
> (100+ URLs submitted 5 months 
> ago)http://www.google.com/search?q=site:maps.unimelb.edu.au (34 results = 
> pathetic!)http://go.unimelb.edu.au/6t6 (1 result = totally pathetic!)
>
> Hopefully, it's nothing completely bleeding obvious that will
> humiliate me in front of my peers ;-)
>
>
>
>
> --
>
> ***
> List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
> Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
> Help: memberh...@webstandardsgroup.org
> ***
>



-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***


[WSG] [OT] Google search/index/webmaster help

2009-10-29 Thread Andrew Harris
Yes, I know it's off topic, but I really need a hand with a mystifying
problem. I've tried the google forums, but have received no replies.
If there are any listers who understand the free Google Custom Search
Engine, webmaster tools, sitemaps and indexing problems, then I'd
really appreciate you contacting me directly in regards to some
problems I'm having.

Just to give an idea of my quandary...
http://maps.unimelb.edu.au/
http://maps.unimelb.edu.au/sitemap.xml (100+ URLs submitted 5 months ago)
http://www.google.com/search?q=site:maps.unimelb.edu.au (34 results = pathetic!)
http://go.unimelb.edu.au/6t6 (1 result = totally pathetic!)

Hopefully, it's nothing completely bleeding obvious that will
humiliate me in front of my peers ;-)

-- 
Andrew Harris
and...@woowoowoo.com
http://www.woowoowoo.com

~~~ <*>< ~~~


***
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: memberh...@webstandardsgroup.org
***