[ic] Removing '?' and '&' from the URL for search engine submission, Generate Static

Dan Browning interchange-users@interchange.redhat.com
Fri Apr 26 21:39:01 2002


At 09:05 PM 4/26/2002 -0400, you wrote:
>On Fri, Apr 26, 2002 at 03:34:39PM -0700, Dan Browning wrote:
> > At 02:43 PM 4/26/2002 -0400, you wrote:
> > >On Fri, Apr 26, 2002 at 11:00:59AM -0700, Dan Browning wrote:
> > >> At 09:17 PM 4/25/2002 -0400, you wrote:
> > >> >On Thu, Apr 25, 2002 at 03:45:08PM -0700, Dan Browning wrote:
> > >> >> Fellow Interchangers,
> > >> >>
> > >> >> I recently filed a bug for the Generate Static feature (
> > >> >> http://interchange.redhat.com/bugs/show_bug.cgi?id=419 ).  (It is 
> also
> > >> >> pasted below).  Please comment on that if you would 
> like.  Otherwise,
> > >I
> > >> >> would like to ask what you are doing to submit sites to search 
> engines
> > >> >that
> > >> >> require that '?' and '&' are not in the URL.  Are you using Static
> > >> >> generation?
> > >> >
> > >> >I'm not convinced it's an issue.  Google doesn't seem to care, at least
> > >> >for top level pages.  If you get static pages indexed, you then lose
> > >> >the referer and that seems more of an issue.  We use web server
> > >> >rewrites and include minivend in ssi where a client gets really
> > >> >concerned about bcentral reports.  :-)
>.....
> > >
> > >Anyway, you will probably get more out of web server rewrites over the
> > >long haul.
> >
> > I agree, has anyone seen a posted Rewrite regex complex enough to do what
> > we're discussing?
>
>There are **lots** of ways to do that.  Don't get sucked into thinking
>that filtering an IC page is the way to go.  Maybe.
>
>FWIW, consider:
>
>use DBI
>for (@all_products)
>   print qq`<A HREF="vlink/sku">product</A><BR>`
>}
>
>Run that from cron once a night and redirect any
>robots to it rather than your catalog.  Just how will
>depend on your server.

We must be on the same page because my backup plan was basically that.

Build a static index.html and strip all "?id=...".  Add in a [item-list] 
for links to all products (without ?id=... of course) and submit that to 
the search engine.  It isn't very elegant, however, and other catalogs with 
content hidden a little deeper would have to be manually linked in.

+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
| Dan Browning, Kavod Technologies <db@kavod.com>
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
No good deed goes unpunished.
                 -- Clare Boothe Luce