Xtreeme - web development software
Site Search:      

 German version  
Home Products Download Buy Now Affiliates Forum
 Search Engine Software
 Search Engine Studio home page
 Requirements
 Download
 Features
 Screenshots
 Compare web search engine solutions
 Sample web site search engines
Check out some sample search engines generated by Search Engine Studio
 Buy Now / Pricing
Place your order online
 SES Service
Schedule automatic search engine updates without having to be logged on using an NT/XP service
 Technical Support
  Frequently asked questions
  Discussion Forum
 How does it work?
 Creating a web site search engine
 Creating an intranet search engine
 Creating an offline search engine (CD-ROM/DVD)
Our Customers
Web sites using Xtreeme search engine script
Question ID: Q2029
Question: How can I specify an exact list of URLs/documents to be indexed? / What's the format of indexer's input XML file?
Product: Search Engine Studio


You should use the XML mode of the indexer if you want to specify a precise list of URLs/documents to be indexed.

Below is more information about the format of the XML file to be used. Please note that this is the same format that's used to import structures to Xtreeme SiteXpert. The difference is that, unlike in SiteXpert, no structure needs to be created, the URLs should be simply listed in a long list.

The XML document should conform to the following DTD (document type definition):

<!DOCTYPE sitemap [
  <!ELEMENT sitemap (node+)>
  <!ELEMENT node (node*|text)*>
  <!ATTLIST node
    href CDATA #IMPLIED>
  <!ELEMENT text (#PCDATA)>
]>

Here's a sample XML file that can be used:

<sitemap>
<node href="http://www.xtreeme.com/"/>
<node href="http://www.xtreeme.com/sitexpert/"/>
<node href="http://www.xtreeme.com/search-engine-studio/"/>
<node href="http://www.xtreeme.com/dhtml/"/>
<node href="http://www.xtreeme.com/mailxpert/"/>
<node href="http://www.xtreeme.com/followupxpert/"/>
</sitemap>

If you have a different format of XML data, you can easily convert it to the above format by specifying an XSLT transformation file.

 Save This Page       Copyright 1998-2007 Xtreeme GmbH