Google is going to include our website in its news service. But they warned
me about a problem with the newspaper section URL: the format http://www.server.com/section/language/date/ will make the google spider to
index only the actual edition of the site and not the following days.
____________________________________________________________________________________
Don't get soaked. Take a quick peek at the forecast
with the Yahoo! Search weather shortcut. http://tools.search.yahoo.com/shortcuts/#loc_weather
Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.
The problem is Google needs the sections URL to categorize every article,
and campsite sections have the language/issue in it (they are not 'clean'
URL's that the spider can follow).
Maybe the solution is to create some kind of alias in the VirtualHost (
httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
place and remove the rest. I don't know and I'm a bit worried.
I attached here a PHP script named section_relocation.php. Please copy this script to your
instance HTML directory:
[campsite_dir]/www/[instance_name]/html.
E.g. for the default campsite install and default campsite instance this directory is:
/usr/local/campsite/www/campsite/html
Open a shell and type the following commands:
1. cd [campsite_dir]/www/[instance_name]/html
2. ln -s section_relocation.php [section_URL_name]
Repeat step 2 for each section in your publication. Replace [section_URL_name] with the value of
the field "URL Name" from section configuration screen.
This will redirect requests like:
http://[site_name]/[section_URL_name]/
to
http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/
That's all.
Mugur
--- digitalnegocio@gmail.com wrote:
> Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.
> The problem is Google needs the sections URL to categorize every article,
> and campsite sections have the language/issue in it (they are not 'clean'
> URL's that the spider can follow).
>
> Maybe the solution is to create some kind of alias in the VirtualHost (
> httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
> place and remove the rest. I don't know and I'm a bit worried.
>
> Thanks.
>
>
>
>
> 2007/3/15, Mugur Rus :
> >
> > Hi,
> >
> > The correct Campsite URL is:
> > - http://www.server.com/
> > - http://www.server.com/language/
> > - http://www.server.com/language/issue/
> > - http://www.server.com/language/issue/section/
> > - http://www.server.com/language/issue/section/article
> > depending on the what are you viewing. There IS NO date in this URL. The
> > shortest URL -
> > http://www.server.com/ - ALWAYS displays the latest published issue. So if
> > yesterday the latest
> > issue was 10 and today you published issue 11 then today
> > http://www.server.com/ will display issue
> > 11.
> >
> > Hope this helps,
> > Mugur
> >
> > --- digitalnegocio@gmail.com wrote:
> > > Google is going to include our website in its news service. But they
> > warned
> > > me about a problem with the newspaper section URL: the format
> > > http://www.server.com/section/language/date/ will make the google spider
> > to
> > > index only the actual edition of the site and not the following days.
> > >
> > > Is there is any way to get URL's like http://www.server.com/section1,
> > > http://www.server.com/section2 , http://www.server.com/section3... that
> > > poins to the actual edition of every section? Did someone front this
> > problem
> > > before? What are the best options to solve it -redirection,
> > mod_rewrite...-?
> > >
> > > Thanks.
> > >
> >
> >
> >
> >
> >
> > ____________________________________________________________________________________
> > Don't get soaked. Take a quick peek at the forecast
> > with the Yahoo! Search weather shortcut.
> > http://tools.search.yahoo.com/shortcuts/#loc_weather
> >
>
Thanks for addressing this issue. Our marketing guy has been on me about
this for months ....
I may be wrong about this, but I don't think the proposed solution will
work. My understanding is that a redirect will cause crawlers from
Google and other search engines to index the page under the new target
URL, not the URL from which the page is being redirected. If this is the
case, there would need to be content at
http://[site_name]/[section_URL_name]/ in order to make Google happy.
As an interim solution, we've been playing with the idea of simply
creating the desired directories and then including the relevant section
pages from the current issue using php. Aside from being kind of messy,
this results in slower page loads and the pages don't recognize logins.
It does produce Google-indexable pages at the desired URLs, but if
there's a better way to do this in Campsite, I'd be grateful.
And again, my apologies if I've misunderstood something about how Google
works!
Best,
Evan
Mugur Rus wrote:
> Hi,
>
> I attached here a PHP script named section_relocation.php. Please copy this script to your
> instance HTML directory:
> [campsite_dir]/www/[instance_name]/html.
>
> E.g. for the default campsite install and default campsite instance this directory is:
> /usr/local/campsite/www/campsite/html
>
> Open a shell and type the following commands:
>
> 1. cd [campsite_dir]/www/[instance_name]/html
> 2. ln -s section_relocation.php [section_URL_name]
>
> Repeat step 2 for each section in your publication. Replace [section_URL_name] with the value of
> the field "URL Name" from section configuration screen.
>
> This will redirect requests like:
> http://[site_name]/[section_URL_name]/
> to
> http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/
>
> That's all.
>
> Mugur
>
> --- digitalnegocio@gmail.com wrote:
>
>> Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.
>> The problem is Google needs the sections URL to categorize every article,
>> and campsite sections have the language/issue in it (they are not 'clean'
>> URL's that the spider can follow).
>>
>> Maybe the solution is to create some kind of alias in the VirtualHost (
>> httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
>> place and remove the rest. I don't know and I'm a bit worried.
>>
>> Thanks.
>>
>>
>>
>>
>> 2007/3/15, Mugur Rus :
>>
>>> Hi,
>>>
>>> The correct Campsite URL is:
>>> - http://www.server.com/
>>> - http://www.server.com/language/
>>> - http://www.server.com/language/issue/
>>> - http://www.server.com/language/issue/section/
>>> - http://www.server.com/language/issue/section/article
>>> depending on the what are you viewing. There IS NO date in this URL. The
>>> shortest URL -
>>> http://www.server.com/ - ALWAYS displays the latest published issue. So if
>>> yesterday the latest
>>> issue was 10 and today you published issue 11 then today
>>> http://www.server.com/ will display issue
>>> 11.
>>>
>>> Hope this helps,
>>> Mugur
>>>
>>> --- digitalnegocio@gmail.com wrote:
>>>
>>>> Google is going to include our website in its news service. But they
>>>>
>>> warned
>>>
>>>> me about a problem with the newspaper section URL: the format
>>>> http://www.server.com/section/language/date/ will make the google spider
>>>>
>>> to
>>>
>>>> index only the actual edition of the site and not the following days.
>>>>
>>>> Is there is any way to get URL's like http://www.server.com/section1,
>>>> http://www.server.com/section2 , http://www.server.com/section3... that
>>>> poins to the actual edition of every section? Did someone front this
>>>>
>>> problem
>>>
>>>> before? What are the best options to solve it -redirection,
>>>>
>>> mod_rewrite...-?
>>>
>>>> Thanks.
>>>>
>>>>
>>>
>>>
>>> ____________________________________________________________________________________
>>> Don't get soaked. Take a quick peek at the forecast
>>> with the Yahoo! Search weather shortcut.
>>> http://tools.search.yahoo.com/shortcuts/#loc_weather
>>>
>>>
>
>
>
>
> ____________________________________________________________________________________
> Sucker-punch spam with award-winning protection.
> Try the free Yahoo! Mail Beta.
> http://advision.webevents.yahoo.com/mailbeta/features_spam.html
>
Mugur, thanks for your solution, but I can't find any attached php script in
your message. Can you please send it again?
2007/3/17, Mugur Rus :
>
> Hi,
>
> I attached here a PHP script named section_relocation.php. Please copy
> this script to your
> instance HTML directory:
> [campsite_dir]/www/[instance_name]/html.
>
> E.g. for the default campsite install and default campsite instance this
> directory is:
> /usr/local/campsite/www/campsite/html
>
> Open a shell and type the following commands:
>
> 1. cd [campsite_dir]/www/[instance_name]/html
> 2. ln -s section_relocation.php [section_URL_name]
>
> Repeat step 2 for each section in your publication. Replace
> [section_URL_name] with the value of
> the field "URL Name" from section configuration screen.
>
> This will redirect requests like:
> http://[site_name]/[section_URL_name]/
> to
>
> http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/
>
> That's all.
>
> Mugur
>
> --- digitalnegocio@gmail.com wrote:
> > Thanks for your reply, Mugur, and sorry for my mistake in the url
> taxonomy.
> > The problem is Google needs the sections URL to categorize every
> article,
> > and campsite sections have the language/issue in it (they are not
> 'clean'
> > URL's that the spider can follow).
> >
> > Maybe the solution is to create some kind of alias in the VirtualHost (
> > httpd.conf). Or maybe to mod_rewrite the URL to put the section in the
> first
> > place and remove the rest. I don't know and I'm a bit worried.
> >
> > Thanks.
> >
> >
> >
> >
> > 2007/3/15, Mugur Rus :
> > >
> > > Hi,
> > >
> > > The correct Campsite URL is:
> > > - http://www.server.com/
> > > - http://www.server.com/language/
> > > - http://www.server.com/language/issue/
> > > - http://www.server.com/language/issue/section/
> > > - http://www.server.com/language/issue/section/article
> > > depending on the what are you viewing. There IS NO date in this URL.
> The
> > > shortest URL -
> > > http://www.server.com/ - ALWAYS displays the latest published issue.
> So if
> > > yesterday the latest
> > > issue was 10 and today you published issue 11 then today
> > > http://www.server.com/ will display issue
> > > 11.
> > >
> > > Hope this helps,
> > > Mugur
> > >
> > > --- digitalnegocio@gmail.com wrote:
> > > > Google is going to include our website in its news service. But they
> > > warned
> > > > me about a problem with the newspaper section URL: the format
> > > > http://www.server.com/section/language/date/ will make the google
> spider
> > > to
> > > > index only the actual edition of the site and not the following
> days.
> > > >
> > > > Is there is any way to get URL's like
> http://www.server.com/section1,
> > > > http://www.server.com/section2 , http://www.server.com/section3...
> that
> > > > poins to the actual edition of every section? Did someone front this
> > > problem
> > > > before? What are the best options to solve it -redirection,
> > > mod_rewrite...-?
> > > >
> > > > Thanks.
> > > >
> > >
> > >
> > >
> > >
> > >
> > >
> ____________________________________________________________________________________
> > > Don't get soaked. Take a quick peek at the forecast
> > > with the Yahoo! Search weather shortcut.
> > > http://tools.search.yahoo.com/shortcuts/#loc_weather
> > >
> >
>
>
>
>
>
> ____________________________________________________________________________________
> Sucker-punch spam with award-winning protection.
> Try the free Yahoo! Mail Beta.
> http://advision.webevents.yahoo.com/mailbeta/features_spam.html
>
Evan, as far as I know, the google spider only needs one fixed URL for
section, just to know where it should begin to crawl. So Mugur's solution
seems to be effective (anyway, I will tell you what they tell me about it).
2007/3/20, Evan Mellander :
>
> Hi,
>
> Thanks for addressing this issue. Our marketing guy has been on me about
> this for months ....
>
> I may be wrong about this, but I don't think the proposed solution will
> work. My understanding is that a redirect will cause crawlers from Google
> and other search engines to index the page under the new target URL, not the
> URL from which the page is being redirected. If this is the case, there
> would need to be content at http://[site_name ]/[section_URL_name]/
> in order to make Google happy.
>
>
>
> As an interim solution, we've been playing with the idea of simply
> creating the desired directories and then including the relevant section
> pages from the current issue using php. Aside from being kind of messy, this
> results in slower page loads and the pages don't recognize logins. It does
> produce Google-indexable pages at the desired URLs, but if there's a better
> way to do this in Campsite, I'd be grateful.
>
>
> And again, my apologies if I've misunderstood something about how Google
> works!
>
> Best,
>
> Evan
>
>
> Mugur Rus wrote:
>
> Hi,
>
> I attached here a PHP script named section_relocation.php. Please copy this script to your
> instance HTML directory:
> [campsite_dir]/www/[instance_name]/html.
> E.g. for the default campsite install and default campsite instance this directory is:
> /usr/local/campsite/www/campsite/html
>
> Open a shell and type the following commands:
>
> 1. cd [campsite_dir]/www/[instance_name]/html
> 2. ln -s section_relocation.php [section_URL_name]
>
> Repeat step 2 for each section in your publication. Replace [section_URL_name] with the value of
> the field "URL Name" from section configuration screen.
>
> This will redirect requests like:
> http://[site_name ]/[section_URL_name]/
> to
> http://[site_name ]/[default_pub_language]/[current_issue]/[section_URL_name]/
>
> That's all.
>
> Mugur
>
> --- digitalnegocio@gmail.com wrote:
>
>
> Thanks for your reply, Mugur, and sorry for my mistake in the url taxonomy.
> The problem is Google needs the sections URL to categorize every article,
> and campsite sections have the language/issue in it (they are not 'clean'
> URL's that the spider can follow).
>
> Maybe the solution is to create some kind of alias in the VirtualHost (httpd.conf). Or maybe to mod_rewrite the URL to put the section in the first
> place and remove the rest. I don't know and I'm a bit worried.
>
> Thanks.
>
>
>
>
> 2007/3/15, Mugur Rus :
>
>
> Hi,
>
> The correct Campsite URL is:
> - http://www.server.com/
> - http://www.server.com/language/
> - http://www.server.com/language/issue/
> - http://www.server.com/language/issue/section/
> - http://www.server.com/language/issue/section/article
> depending on the what are you viewing. There IS NO date in this URL. The
> shortest URL -http://www.server.com/ - ALWAYS displays the latest published issue. So if
> yesterday the latest
> issue was 10 and today you published issue 11 then todayhttp://www.server.com/ will display issue
> 11.
>
> Hope this helps,
> Mugur
>
> --- digitalnegocio@gmail.com wrote:
>
>
> Google is going to include our website in its news service. But they
>
>
> warned
>
>
> me about a problem with the newspaper section URL: the formathttp://www.server.com/section/language/date/ will make the google spider
>
>
> to
>
>
> index only the actual edition of the site and not the following days.
>
> Is there is any way to get URL's like http://www.server.com/section1,http://www.server.com/section2 , http://www.server.com/section3... that
> poins to the actual edition of every section? Did someone front this
>
>
> problem
>
>
> before? What are the best options to solve it -redirection,
>
>
> mod_rewrite...-?
>
>
> Thanks.
>
>
>
>
> ____________________________________________________________________________________
> Don't get soaked. Take a quick peek at the forecast
> with the Yahoo! Search weather shortcut.http://tools.search.yahoo.com/shortcuts/#loc_weather
>
>
>
> ____________________________________________________________________________________
> Sucker-punch spam with award-winning protection.
> Try the free Yahoo! Mail Beta.http://advision.webevents.yahoo.com/mailbeta/features_spam.html
>
>
>
> --
>
> Evan W. Mellander, Ph.D.
>
> Monitor CE / Prague Daily Monitor
>
> P
--- digitalnegocio@gmail.com wrote:
> Mugur, thanks for your solution, but I can't find any attached php script in
> your message. Can you please send it again?
>
>
> 2007/3/17, Mugur Rus :
> >
> > Hi,
> >
> > I attached here a PHP script named section_relocation.php. Please copy
> > this script to your
> > instance HTML directory:
> > [campsite_dir]/www/[instance_name]/html.
> >
> > E.g. for the default campsite install and default campsite instance this
> > directory is:
> > /usr/local/campsite/www/campsite/html
> >
> > Open a shell and type the following commands:
> >
> > 1. cd [campsite_dir]/www/[instance_name]/html
> > 2. ln -s section_relocation.php [section_URL_name]
> >
> > Repeat step 2 for each section in your publication. Replace
> > [section_URL_name] with the value of
> > the field "URL Name" from section configuration screen.
> >
> > This will redirect requests like:
> > http://[site_name]/[section_URL_name]/
> > to
> >
> > http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/
> >
> > That's all.
> >
> > Mugur
> >
> > --- digitalnegocio@gmail.com wrote:
> > > Thanks for your reply, Mugur, and sorry for my mistake in the url
> > taxonomy.
> > > The problem is Google needs the sections URL to categorize every
> > article,
> > > and campsite sections have the language/issue in it (they are not
> > 'clean'
> > > URL's that the spider can follow).
> > >
> > > Maybe the solution is to create some kind of alias in the VirtualHost (
> > > httpd.conf). Or maybe to mod_rewrite the URL to put the section in the
> > first
> > > place and remove the rest. I don't know and I'm a bit worried.
> > >
> > > Thanks.
> > >
> > >
> > >
> > >
> > > 2007/3/15, Mugur Rus :
> > > >
> > > > Hi,
> > > >
> > > > The correct Campsite URL is:
> > > > - http://www.server.com/
> > > > - http://www.server.com/language/
> > > > - http://www.server.com/language/issue/
> > > > - http://www.server.com/language/issue/section/
> > > > - http://www.server.com/language/issue/section/article
> > > > depending on the what are you viewing. There IS NO date in this URL.
> > The
> > > > shortest URL -
> > > > http://www.server.com/ - ALWAYS displays the latest published issue.
> > So if
> > > > yesterday the latest
> > > > issue was 10 and today you published issue 11 then today
> > > > http://www.server.com/ will display issue
> > > > 11.
> > > >
> > > > Hope this helps,
> > > > Mugur
> > > >
> > > > --- digitalnegocio@gmail.com wrote:
> > > > > Google is going to include our website in its news service. But they
> > > > warned
> > > > > me about a problem with the newspaper section URL: the format
> > > > > http://www.server.com/section/language/date/ will make the google
> > spider
> > > > to
> > > > > index only the actual edition of the site and not the following
> > days.
> > > > >
> > > > > Is there is any way to get URL's like
> > http://www.server.com/section1,
> > > > > http://www.server.com/section2 , http://www.server.com/section3...
> > that
> > > > > poins to the actual edition of every section? Did someone front this
> > > > problem
> > > > > before? What are the best options to solve it -redirection,
> > > > mod_rewrite...-?
> > > > >
> > > > > Thanks.
> > > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > ____________________________________________________________________________________
> > > > Don't get soaked. Take a quick peek at the forecast
> > > > with the Yahoo! Search weather shortcut.
> > > > http://tools.search.yahoo.com/shortcuts/#loc_weather
> > > >
> > >
> >
> >
> >
> >
> >
> > ____________________________________________________________________________________
> > Sucker-punch spam with award-winning protection.
> > Try the free Yahoo! Mail Beta.
> > http://advision.webevents.yahoo.com/mailbeta/features_spam.html
> >
>
____________________________________________________________________________________
Don't pick lemons.
See all the new 2007 cars at Yahoo! Autos. http://autos.yahoo.com/new_cars.html
My budget is still closed (I hope it will be open soon), so I will try your
solution, which seems to work fine. I've only had a problem with the
no-extension sections, that were misinterpreted by apache (showing the
source code of section_relocation.php). This lines inserted in .htaccess did
the trick:
ForceType application/x-httpd-php
ForceType application/x-httpd-php
ForceType application/x-httpd-php
ForceType application/x-httpd-php
ForceType application/x-httpd-php
I will let you know what Google has to say about this system.
Thanks again.
2007/3/20, Mugur Rus :
>
> here it is
>
> --- digitalnegocio@gmail.com wrote:
> > Mugur, thanks for your solution, but I can't find any attached php
> script in
> > your message. Can you please send it again?
> >
> >
> > 2007/3/17, Mugur Rus :
> > >
> > > Hi,
> > >
> > > I attached here a PHP script named section_relocation.php. Please copy
> > > this script to your
> > > instance HTML directory:
> > > [campsite_dir]/www/[instance_name]/html.
> > >
> > > E.g. for the default campsite install and default campsite instance
> this
> > > directory is:
> > > /usr/local/campsite/www/campsite/html
> > >
> > > Open a shell and type the following commands:
> > >
> > > 1. cd [campsite_dir]/www/[instance_name]/html
> > > 2. ln -s section_relocation.php [section_URL_name]
> > >
> > > Repeat step 2 for each section in your publication. Replace
> > > [section_URL_name] with the value of
> > > the field "URL Name" from section configuration screen.
> > >
> > > This will redirect requests like:
> > > http://[site_name]/[section_URL_name]/
> > > to
> > >
> > >
> http://[site_name]/[default_pub_language]/[current_issue]/[section_URL_name]/
> > >
> > > That's all.
> > >
> > > Mugur
> > >
> > > --- digitalnegocio@gmail.com wrote:
> > > > Thanks for your reply, Mugur, and sorry for my mistake in the url
> > > taxonomy.
> > > > The problem is Google needs the sections URL to categorize every
> > > article,
> > > > and campsite sections have the language/issue in it (they are not
> > > 'clean'
> > > > URL's that the spider can follow).
> > > >
> > > > Maybe the solution is to create some kind of alias in the
> VirtualHost (
> > > > httpd.conf). Or maybe to mod_rewrite the URL to put the section in
> the
> > > first
> > > > place and remove the rest. I don't know and I'm a bit worried.
> > > >
> > > > Thanks.
> > > >
> > > >
> > > >
> > > >
> > > > 2007/3/15, Mugur Rus :
> > > > >
> > > > > Hi,
> > > > >
> > > > > The correct Campsite URL is:
> > > > > - http://www.server.com/
> > > > > - http://www.server.com/language/
> > > > > - http://www.server.com/language/issue/
> > > > > - http://www.server.com/language/issue/section/
> > > > > - http://www.server.com/language/issue/section/article
> > > > > depending on the what are you viewing. There IS NO date in this
> URL.
> > > The
> > > > > shortest URL -
> > > > > http://www.server.com/ - ALWAYS displays the latest published
> issue.
> > > So if
> > > > > yesterday the latest
> > > > > issue was 10 and today you published issue 11 then today
> > > > > http://www.server.com/ will display issue
> > > > > 11.
> > > > >
> > > > > Hope this helps,
> > > > > Mugur
> > > > >
> > > > > --- digitalnegocio@gmail.com wrote:
> > > > > > Google is going to include our website in its news service. But
> they
> > > > > warned
> > > > > > me about a problem with the newspaper section URL: the format
> > > > > > http://www.server.com/section/language/date/ will make the
> google
> > > spider
> > > > > to
> > > > > > index only the actual edition of the site and not the following
> > > days.
> > > > > >
> > > > > > Is there is any way to get URL's like
> > > http://www.server.com/section1,
> > > > > > http://www.server.com/section2 , http://www.server.com/section3.
> ..
> > > that
> > > > > > poins to the actual edition of every section? Did someone front
> this
> > > > > problem
> > > > > > before? What are the best options to solve it -redirection,
> > > > > mod_rewrite...-?
> > > > > >
> > > > > > Thanks.
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > >
> ____________________________________________________________________________________
> > > > > Don't get soaked. Take a quick peek at the forecast
> > > > > with the Yahoo! Search weather shortcut.
> > > > > http://tools.search.yahoo.com/shortcuts/#loc_weather
> > > > >
> > > >
> > >
> > >
> > >
> > >
> > >
> > >
> ____________________________________________________________________________________
> > > Sucker-punch spam with award-winning protection.
> > > Try the free Yahoo! Mail Beta.
> > > http://advision.webevents.yahoo.com/mailbeta/features_spam.html
> > >
> >
>
>
>
>
>
> ____________________________________________________________________________________
> Don't pick lemons.
> See all the new 2007 cars at Yahoo! Autos.
> http://autos.yahoo.com/new_cars.html
>