本帖最后由 owenlee 于 2009-4-8 11:29 编辑
Google indexes dynamically generated webpages, including .asp pages, .php pages, and pages with question marks in their URLs. However, these pages can cause problems for our crawler and may be ignored. If you're concerned that your dynamically generated pages are being ignored, you may want to consider creating static copies of these pages for our crawler. If you do this, please be sure to include a robots.txt file that disallows the dynamic pages in order to ensure that these pages aren't seen as having duplicate content.
我粗略地做了下翻译,大意如下:
谷歌的索引可以动态聚合网页,包括ASP、PHP和一些带有问句式的标鉴页面。然而,这些页面会造成问题导致我们的蜘蛛将其忽略。如果您认为您的动态生成页面被我们的搜索机器人忽略,您也许会考虑为我们的蜘蛛创建静态副本。如果你这么做了,请确保您的robots.txt文件不允许抓取动态页面,从而避免我们的搜索机器人将它看作复制的内容而将其忽略。
根据我的理解,应该是SS7.0的静态配置体系考虑不太完善,站长设置了静态配置之后,对于一个主题,会生成两个页面,即一个动态页,一个静态页,如果在robots.txt里面没有设定相应的禁止收录动态页的规则,会导致google的搜索机器人犹豫不决,不知道收录你的哪个页面是好。于是,问题也就出现了。 |