Navigation mode
Step 2 - Navigation options
In this step you define where and how Layout Wizard should scan for files. The most important part of this step is setting the start URL.
This is the file from which the scanning process will begin. Usually this is the index file located in the root folder of a website, eg. http://www.xtreeme.com/index.html.
You may enter either a local file (or click Browse to select it), or a remote URL on the web.
Note: Setting start URL to a local file may not work if you don't have Internet Explorer 4.0 or better installed.
You can go ahead and skip rest of the options of this step as well as the next step of Layout Wizard. Default options set for you should work fine with most websites. In case you want to customize the process, here are the other options of this step:
Include this URL in site map - Select it if you want the start URL to be included in site map's layout
Use relative paths - This option determines whether site map links will be absolute or relative.
For instance, given the start URL: http://www.xtreeme.com/, the URL 'http://www.xtreeme.com/sitexpert/sxsupport.html' can be expressed in two ways:
Ignore bookmarks - If a file has bookmarks (eg. index.html#sales and index.html#contact), selecting this option will
prevent the file from being listed more than once. Leaving this option unselected will list the file once for every bookmark referred to from another file.
Ignore parameters in parameterized links - Select this option if you want Layout Wizard to ignore parameters of parameterized links, eg. <a href="goto.asp?target=new.htm"> would be interpreted as <a href="goto.asp">.
Match layout with the original directory structure - Selecting this option will work similar to using the file structure mode of Layout Wizard. The resulting layout will reflect the directory structure of scanned URLs.
For example, assuming that http://www.xtreeme.com/ and http://www.xtreeme.com/sitexpert/ will be scanned, http://www.xtreeme.com/sitexpert/ will be a direct child of http://www.xtreeme.com/ no matter if http://www.xtreeme.com/ directly links to http://www.xtreeme.com/sitexpert/.
If a URL is referred to more than once use... - Specify what Layout Wizard should do if it finds two or more links to one file.
Get URL descriptions from - Setting this option determines the source of item descriptions. This can be:
When a link to a different web site is found - Specify what Layout Wizard should do if it finds a link to an external web site.
If for example the start URL is http://www.xtreeme.com, a link to 'http://www.xtreeme.com/sitexpert' will be treated as internal but a link to
'http://www.yahoo.com' will be external. Note that a link to 'http://shop.xtreeme.com' would also be external.
Warning: Selecting the first option may result in an infinite scanning process.
If you want to allow scanning only selected external web sites, select the second option and define allowed external links in the next step of Layout Wizard.
The third option will only list external links without further scanning for nested links.
Treat domain with a different prefix as external web site -
When this option is selected, links to the same web site, but with a different
domain prefix (eg. for the www.microsoft.com web site, msdn.microsoft.com would be such domain)
are treated as links to external web sites, and depending on
other options may be ignored. Note: In the next step of Layout Wizard you may specify additional domains that will be
interpreted as internal and scanned for links.
Step 3 - Include and Exclude files
In this step you specify filters that determine which file groups should be automatically included in the site map. By default HTML documents are
added to the Include File Filters. You may add or remove filters by clicking on buttons with a plus or cross icon. You may also select one of standard choices:
Allowed external web sites allow you to specify which external web sites Layout Wizard is allowed to scan (eg. microsoft.com, yahoo.com). This feature is useful if you want your site map
to show a structure of documents located on a few different web sites. Note: Links such as <A HREF="../folder/file.html"> will be treated as
external if such a link is found in an HTML page located in the same folder as the start URL. However if this link refers to a subfolder of start URL's parent folder
it will be an internal link. Also, depending on the "Treat domain with a different prefix as external web site" setting from the previous step of Layout Wizard,
if your start URL is eg. www.microsoft.com, then msdn.microsoft.com might need to be added to this list if you want pages from the
msdn.microsoft.com server to be mapped as well.
There are three more options on the bottom of the window. The first one - 'Erase old layout before starting Layout Wizard' determines whether Layout Wizard
should delete all items from the layout tree before it begins scanning. Unselect this option if you want to update the layout tree (for example
if your website has changed and you want to add newly created pages to the site map without destroying previous layout structure).
The next option -- 'Sort alphabetically' will sort all items in the layout tree alphabetically, after Layout Wizard's job is done.
The last option -- 'Do not include files above level' tells Layout Wizard to skip all files located deeper in the hierarchy than the given level number.
Selecting this option and entering 1 will only create a list of files referenced from the base URL.
You can also set advanced options for Layout Wizard by clicking on the 'Advanced' button. In the advanced options dialog you will find the following options:
Default display name for files - This option determines how site map items found by Layout Wizard will be formatted.
Use the Layout view to manually change item names. You may choose files to be represented by their file name (with or without extension), or by their description.
The last choice (custom) can be used to customize the display name format according to the text entered in the text field. This text can contain one or more of the following special entries:
Add/remove suffix - Clicking this button will take you to Suffix Builder for Descriptions -- a tool that performs automatic processing of item text
by adding or removing suffixes (prefixes or postfixes) for certain items, or even removing items from the layout based on their description
Click here to go to a detailed description of Suffix Builder
Do not include SiteXpert-generated files - This option prevents Layout Wizard from including custom icons and files previously generated by SiteXpert.
Ignored links - This is a list of links ignored by Layout Wizard. Whenever Layout Wizard finds a link during the scanning process
it checks the link against all items from this list. If there is a match, the link is ignored. This allows you to remove a part of the website from the site map.
The items in this list can contain wildcard characters (? and *). For instance, adding '*products*' will disable the products section from being included in the site map.
Placing the star character '*' before and after 'products' is necessary to include links like:
Connection timeout - Selecting this option and setting a value will change the default timeout option for the internet connection.
Use larger timeout value if your internet connection is very slow.
Connection retries - Number of times Layout Wizard will try to reconnect if connection to an URL fails.
Discover redirected URLs - Selecting this option will slow down the scanning process a little, but when
a link refers to a URL that is automatically redirected to a different URL, this option will add the correct (redirected)
URL to the layout tree.
Skip directories with extensions - Selecting this option will speed up the scanning process. You should not select this option if at least one of the directories on your web site contains extensions (e.g. http://www.domain.com/dir.ext/). This setting does not affect files with extensions, e.g. page.htm
Don't list any excluded files - Selecting this option will prevent all files matching one or more exclude file filters
from being listed in the layout tree. Otherwise, some of the excluded files will be listed in the layout tree
(if they are HTML files with links to other files matching one or more include file filters).
Ask for password when required - Select this option if you want Layout Wizard to display a login/password window whenever authentication is required (on password protected pages or for proxy authentication). A lock icon will be displayed next to a page that authentication.
Submit HTML forms - Select this option if you want Layout Wizard to submit HTML forms. You will be prompted each time a form is encountered. This is especially useful if you want to scan password-protected pages accessible through HTML forms. Clicking the 'more' button will allow you to define HTML Form submit rules -- rules that determine which forms should be processed or ignored by SiteXpert. Note: you can also choose not to display form parameters in sitemap. This is important if form parameters include login and password information.
Accelerate scanning process - SiteXpert offers a very quick scanning mode that simultaneously opens multiple URLs using separate connections. You should disable this mode only if you encounter problems (such as timeouts with a very slow connection).
HTML extensions - This is a list of HTML extensions. In some situations (when content-type of document is unavailable) SiteXpert needs to know if a file is an HTML file. Make sure all possible extensions on your website are listed here.
Frame options - Choose how you want frameset and nested frame pages to be processed by Layout Wizard. The default option lists both the frameset page and nested frame pages. No matter which option you choose,
Layout Wizard will still scan frame pages for links.
When you're done setting advanced options, click OK to go back to Layout Wizard.
Please note there are two SiteXpert-specific tags you may insert into your HTML documents:
To allow Layout Wizard to start scanning for files click the Finish button. The scanning process might take a while so please be patient. If you wish, you can stop or pause the
process at any time by clicking the 'Stop' button. You will be notified by a message on the top-right corner of SiteXpert's main window
whether all links were found or not. Press the 'More' button to see a list of invalid links. Some of these links might be valid but since they are external links
SiteXpert has not even checked them. This list is also saved in file ScanLog.txt in subfolder 'Logs' of SiteXpert's program folder.

Fig 1. Layout Wizard - Navigation mode options
Relative links are more flexible if the site map file will be moved to a different location together with the whole site. If you want to move the site map file only, you should
use absolute links. Relative links have another advantage: when a visitor downloads your whole website to a local disk, all site map links will point to his/her local disk
instead of the original location.

Fig 2. Layout Wizard - Include and exclude files
A filter can contain wildcard characters. Here are some sample filters:
Exclude File/URL Filters are file groups that, although they match one or more include filters, should not be added to the site map. For example, adding 's*' to
exclude filters will prevent files starting with an 's' from being included.
Please note: In navigation mode of Layout Wizard, you can also specify absolute URLs which should be
excluded. For example, specifying http://www.xtreeme.com/ as the start URL and adding http://www.xtreeme.com/support/* to 'Exclude File/URL Filters' will
filter out all URLs placed inside the support directory and its subdirectories. As a result, http://www.xtreeme.com/support/ and http://www.xtreeme.com/support/sitexpert/index.htm will be both filtered out.
Fig 3. Layout Wizard - Advanced options
Description comes from one of a few available sources (defined in the previous step of Layout Wizard) such as the TITLE or META tag. SiteXpert will also look for the description between <SX-DESCRIPTION> and </SX-DESCRIPTION> tags (which should be located inside a comment).
If a description cannot be found, file name will be used. You may also click on 'Add/remove suffix' to change this formatting in an intelligent way.
An alternative way to ignore links is inserting the <!--SX-DONT-SCAN--> tag into an HTML page.