This chapter contains the following sections:
The Desktop Service consists of global and dynamic attributes. The values applied to the global attributes are applied across the Sun Java System Sun Java System Access Manager configuration and are inherited by every configured organization. They cannot be applied directly to roles or organizations as the goal of global attributes is to customize the Sun Java System Access Manager application. Values applied to the dynamic attributes are assigned to a role or organization. When the role is assigned to a user or a user is created in an organization, the dynamic attribute then becomes a characteristic of the user.
This section describes the global attributes for the Desktop Service.
The table below describes the desktop service global attributes and their description.
Table 1–1 Desktop Service - Global Attributes
This section describes the dynamic attributes for the Desktop Service.
The table below describes the desktop service dynamic attributes and their description.
Table 1–2 Desktop Service - Dynamic Attributes
The Rewriter service provides a Java class library for rewriting URI (Uniform Resource Identifier) references in Web languages such as HTML, JavaScript, and WML (Wireless Markup Language), and in HTTP Location headers (redirections). For example, the Rewriter rewrites relative URIs of image tags so that the browser requests the images from the appropriate server. The Rewriter also functions in the Portal Gateway. For example, the Rewriter rewrites HTML links to provide access to an internal server. The Rewriter service is available as a Rewriter Rulesets tab listed in the Secure Remote Access (SRA) feature in the Portal Server Administration console.
The topics in this chapter that describe the Rewriter Service are:
You can perform the following tasks from the Rewriter tab in the Gateway Profile menu bar:
Select the Enable option in the Rewriting of All URIs attribute to enable the Gateway and to rewrite all URIs.
Click Save to record the change.
The prerequisite to map a URI to a Ruleset, to specify MIME (Multi Purpose Internet Mail Extensions) mappings, to specify the URIs not to rewrite, and to configure Rewriter proxies, specify one or more configuration attributes. See the http://docs.sun.com/app/docs/coll/1303.1 for more information on configuration attributes.
Navigate to the URIs Not to Rewrite attribute, and specify the URI in the text box.
Click Add to add the URI in the URIs Not to Rewrite list.
Click Save to record the change.
Navigate to the Map URIs to RuleSets attribute and map a URI to the appropriate RuleSet.
Click Save to record the change.
Navigate to the Map Parser to MIME Types field, and map a Parser to the appropriate MIME type.
Click Save to record the change.
Select the Enable option in the Rewriter Proxy attribute to enable the Rewriter proxy.
Specify the Rewriter Proxy value in the Rewriter Proxy List attribute text box.
Click Add to add the value in the Rewriter Proxy List.
The Rewriter Proxy attribute is available in the Rewriter Proxy and Netlet Proxy option in the Deployment tab of Gateway Profile.
Click Save to record the change.
Rewriter Rulesets is a service, which is available as the Rewriter Rulesets tab in the Secure Remote Access tab of Portal Server Administration console. When you click the tab, the Rewriter Rulesets page is displayed. The Rewriter Rulesets page lists the available rulesets in the Gateway. Each ruleset is a file that defines how the contents in a Web page should be rewritten by the Rewriter attributes. You can download an existing ruleset, upload a new ruleset to the list, and delete an available ruleset from the list. You can perform these tasks as follows:
The prerequisite to perform these tasks is to specify one or more configuration attributes . See the http://docs.sun.com/app/docs/coll/1303.1 for more information on configuration attributes.
Select a ruleset from the Gateway Rulesets list.
Click Download.
You can save the ruleset or open and view in the default Web browser.
Click Upload New.
The Upload New Ruleset page displays.
Click Browse to choose the ruleset to upload.
Click Upload.
This chapter explains the attributes listed in the Search Servers page. The following section introduces the Search Server attributes and describes the attributes in Search Server Settings.
When you select Search Servers tab in the Portal Server console, the Search Servers page appears. This page lists the existing Search Servers. To view an existing Search Server, click on the search server name. The Search Server Settings page for the selected Search Server is displayed.
Table 3–1 Server Settings Attributes
This chapter explains about the attributes available in the Search Robot. The properties for the robot are quite complex. You can select the sites to be searched, check to see if a site is valid, define what types of documents should be picked up, and schedule when the searches take place.
This chapter contains the following sections:
The Robot Overview panel is where you can see what the robot is doing. If it is Off, Idle, Running, or Paused; and if it is Running, what progress it is making in the search since the panel is refreshed about every 30 seconds. The refresh rate is defined using the robot-refresh parameter in the search.conf file.
If the robot is Off, the buttons are Start and Clear Robot Database . The Start button is at the top and the Clear Robot Database button is at the bottom of the panel. If the robot is Running or Idle, the two buttons are Stop and Pause. If it is Paused, the two buttons are Stop and Resume. By selecting on any of the Attributes, you go to the Reports section where you can get a detailed up-to-the-minute report of that Attribute.
The table below lists the Robot Overview attributes and their description.
Table 4–1 Robot Overview Attributes
Attribute |
Default Value |
Description |
---|---|---|
The Robot is |
Current activity |
The Robot’s state. Value can be Idle, Running, Paused, or Off |
Last Updated at |
Date and time last refreshed. |
This page is refreshed to keep you aware of what progress the robot is making. |
Starting Points |
Number defined |
Displays the sites that the robot crawls to generate resource descriptions. The robot does not index resources from disabled sites. |
URL Pool |
Number URLs waiting |
Number of URLs (Uniform Resource Locator) yet to be investigated. When you begin a search, the starting point URLs are entered into the URL pool. As the search progresses, the robot discovers links to other URLs. These URLs get added to the pool. After all the URLs in the pool have been processed, the URL pool is empty and the robot is idle. |
Extracting |
Number connections per second |
Number of resources looked at in a second. Extracting is the process of discovering or locating resources, documents or hyperlinks to be included in the database and filtering out unwanted items. |
Filtering |
Number URLs rejected |
Total number of URLs that are excluded. |
Indexing |
Number URLs per second |
Number of resources or documents turned into a resource description in a second. Indexing is the phase when all the information that has been gathered on a document is turned into a resource description for inclusion in the search database. |
Excluded URLs |
Number URLs excluded by filters |
Number of URLs that did not meet the filtering criteria. |
Number URLs excluded by errors |
Number of URLs where the robot encountered errors as file not found. |
|
Resource Descriptions |
Number RDs contributed |
Number of resource descriptions added to the database. |
Number Bytes of RDs contributed |
Number of bytes added to the database. |
|
General Stats |
Number URLs retrieved |
Number of URLs retrieved during run. |
Number Bytes average size of RDs | ||
Time in days, hours, minutes, and seconds running |
The amount of time the robot has been running. |
When you click the Sites tab, the Manage Sites page is displayed. This page displays the list of Site Names and the Status of each site (enabled or disabled) that the robot crawls to generate resource descriptions. When you select the checkbox, the Delete, Enable, and Disable buttons become active. Select the Delete button to delete a selected site. You can enable or disable the selected site by clicking the Enable or Disable buttons. A disabled site is not searched when the robot is run.
The table below provides the attributes and their description in the Manage Sites page.
Table 4–2 Manage Sites Attributes
Attribute |
Default Value |
Description |
---|---|---|
Lock or cluster graphic |
Status of site |
Lock open means that the URL is accessible. The closed lock means that the site is a secure web server and uses SSL. The cluster means that the site is a domain. |
Enabled/Disabled |
Enabled |
Choose to search this site or not when the robot is run. |
You can create a new site, by clicking the New button. When you click the New button, the New Robot Site page appears. This page allows you to set up a new Robot site. The table below provides the attributes available in the New Robot Site page and their description.
Table 4–3 New Robot Site Attributes
Attribute |
Default Value |
Description |
---|---|---|
Type |
URL |
Select URL or Domain from the list box. |
Site |
Blank |
If you have selected the Type as URL, enter the URL of the site you want to create. The URL format is: http://www.sesta.com If you have selected the Type as Domain, enter the domain of the site you want to create. The Domain format is: *.sesta.com |
Depth |
10 |
You have a choice of 1 for this URL only, 2 for this URL and first links, 3 - 10, 100 or unlimited. The default value is set in the Robot —> Manage Properties page. |
Destination Database |
Use Internal Default |
Select the database that you want to use from the list box showing the available databases. |
Click on the Site name to navigate to the Edit a Site page. You can use this page to define the search site more completely. You can specify what type of server it is, redefine the depth of the search, and select what type of files to add to the search database. The attributes for URL and Domain sites are mostly the same. The additional column in this table shows which attributes are shared and which are unique.
You can verify the server name for the search site you entered. In the Server Group section, click the New button to add more servers to the server group. In the Starting Points section, click the New button to add more starting points. In the Filter Definition section, you can add or delete, exclude or include certain types of files as well as change the order the filters for these files are applied.
The table below provides the attributes and their description in the Edit a Site page.
Table 4–4 Edit a Site Attributes
Under the Filters tab, there is a Manage Filters page, which lists all the defined Filter Rules, Status of each Filter Rule, Default value for New Site, and Used in Sites. Each Filter Rule is preceded by a checkbox. To delete a Filter Rule, you need to select the corresponding checkbox and click the Delete button. To create a new filter:
Click the New button.
The New Robot Filter Wizard appears. As a first step, the Specify Filter Name and Description page is displayed.
Enter the filter name in the Filter Name text box.
Enter the description for the filter in the Filter Description text box.
Click the Next button.
The Specify Filter Definition and Behavior page appears. This page provides the Filter Definition — Matching Rules section. The table below lists the attributes and their description provided in the Filter Definition and Behavior section.
Click the Finish button.
Attribute |
Default Value |
Description |
---|---|---|
Filter Source |
URL |
Choose an option from the list box to specify the source of the filter. The available values are: URL, protocol, host, path, and MIME type. |
Filter By |
is |
Choose an option from the list box to specify the how you want to filter the source. The available values are: is, contains, begins with, ends with, and regular expression. |
Filter String |
Blank |
You can enter the string to define the filter. |
Filter Default |
Selected |
Assign this filer to new sites when they are created. |
Filter Behavior |
Exclude documents that match this filter when Robot runs |
The default option excludes documents that match this filter when robot runs. The other unselected option includes documents that match this filter when Robot runs. |
Click on the Filter Rule to navigate to the Edit a Filter page. The table below lists the attributes and their description in the Edit a Filter page. The default value for these attributes are same as provided in the previous table.
Table 4–6 Edit a Filter Attributes
Attribute |
Description |
---|---|
Filter Name |
A descriptive name that reflects the type of file the filter applies to. |
Drop down list of Filter Sources |
URL, protocol, host, path, MIME type |
Drop down list of positions |
is, contains, begins with, ends with, regular expression |
Text box for type (directory, protocol, file extensions) specifics |
In this text box, list what you want to match. What would match in this example - http://docs.sesta.com/manual.html protocol is http; host contains sesta; file ends with html. |
Filter Description |
Describe the filter rule for yourself. The robot does not use it. |
Filter Default |
Use this as one of the default filters when creating new sites. If you do not check this, you can still add this filter to a new site by editing the site on the Robot, Sites page. |
Filter Behavior |
This attribute provides two options: Exclude documents that match this filter when Robot runs. Include documents that match this filter when Robot runs. By default, the first option is selected. |
Click the Robot —> Properties tab. The Manage Properties page appears. The settings on this page control the robot’s operational parameters and defaults. It is divided into these sections: Crawling Speed, Completion Actions, Logfile Settings, Standard Compliance, Authentication Parameters, Proxy Settings, Link Following, Advanced Settings, and Indexing Settings.
The table below lists the attributes and their description in the Manage Properties page.
Table 4–7 Manage Properties Attributes
The robot searches sites and collects documents based on the filters you have selected. The documents collected are in many different formats. To make them uniform and easily readable they need to be in one format, which is HTML. This page controls some of the parts that go into each resource description.
You can find the simulator attributes in the Robot Utilities page under the Utilities tab. The Robot Utilities page is a debugging tool that performs a partial simulation of robot filtering on a URL. You can type in a new URL to check. It checks the URL, DNS translations (including Smart Host Heuristics), and site redirections. It does not check the contents of the document specified by the URL, so it does not detect duplications, MIME types, network errors, permissions, and the like. The simulator indicates whether the listed sites would be accepted by the robot (ACCEPTED) or not (WARNING).
The table below provides the attributes and their description in the Simulator section in the Robot Utilities page.
Table 4–8 Robot Simulator Attributes
Attribute |
Default Value |
Description |
---|---|---|
Run Simulator on |
URLs you have already defined and one blank text box. |
You can check access to a new site by typing its URL in the blank text box. This checks to see if the new site accepts crawling. Format http://www.sesta.com:80/ |
Show advanced DNS information |
Unselected |
Selected displays more information about the site. |
Check for server redirects |
Selected |
Selected checks for any server redirects. |
The site probe attributes are also available in the Robot Utilities page. This page is a debugging tool that checks for DNS aliases, server redirects, and virtual servers. This tool returns information about site but does not test its acceptance of crawling.
The table below provides the site Probe attributes and their description.
Table 4–9 Robot Site Probe Attributes
Attribute |
Default Value |
Description |
---|---|---|
Run Site Probe on |
Blank |
Type in URL in format http://www.sesta.com:80 |
Show advanced DNS information |
Unselected |
Selected displays more information about the site including IP addresses. |
This chapter explains the attributes provided for the search database. The Database attributes are divided as follows:
Before knowing about the Search database, you need to know how to partition the database. To partition the database, use the run-cs-cli rdmgr -G command, because stopping the search server is required.
The initial Manage Databases page lists the available databases. You can select a database by selecting the checkbox preceding to it. Click the New, Reindex, Purge, Analyze, Manage, or Expire resource descriptions button to perform the necessary action on the selected database.
You should reindex the database if you have edited the schema to add or remove an indexed field (as author), or if a disk error has corrupted the index. You need to restart the server after you change the schema.
Because the time required to reindex the database is proportional to the number of RDs in the database, a large database should be reindexed when the server is not in high demand.
When you purge the contents of the database, disk space used for indexes will be recovered, but disk space used by the main database will not be recovered; instead, it is reused as new data is added to the database.
Expiring a database deletes all RDs that are deemed out-of-date. It does not decrease the size of the database. By default, an RD is scheduled to expire in 90 days from the time of creation.
The table below lists the Database Management attributes and their description.
Table 5–1 Database Management Attributes
Attribute |
Default Value |
Description |
---|---|---|
Name |
True or False |
Name for the database used by Search. |
Federated |
True or False |
For a Federated database, this value is True. Otherwise, the value is False. |
Import agents are the processes that bring resource descriptions from other servers or databases and merge them into your search database.
The initial Manage Import Agents page lists the available import agents. You can select an import agent by selecting the checkbox preceding to it. Click the New, Enable, Disable, Delete, or Run All Enabled Import Agents to perform the necessary action on the selected import agent. To schedule the import agents, select Scheduling on the lower menu bar.
If you choose to create a new import agent or edit or modify an existing import agent, the following database import agent attributes are displayed.
The table below lists the Database Import Agent attributes and their description.
Table 5–2 Database Import Agent Attributes
The initial Resource Descriptions page allows you to search the Resource Descriptions in the database. For example, you can correct a typographical error in an RD or manually assign RDs discovered by the robot to categories.
The table below lists the Resource Descriptions attributes and their description.
Table 5–3 Resource Descriptions Attributes
Attribute |
Default Value |
Description |
---|---|---|
New |
Opens up the New Resource Description page where you can enter the URL to create a new search RD. |
|
Edit |
Opens up the Edit URL page where you can modify only the attributes of a search RD, which can be edited. |
|
Edit All |
Opens up the Edit Resource Descriptions page where you can modify a group of search RD. |
|
Delete |
Deletes the selected search RD. |
|
Filter |
All |
The options available are Categorized (to list Categorized RDs), Uncategorized (to list Uncategorized RDs), and Custom Filter. |
Custom Filter |
This attribute provides the following options: Query (Selected by default) URL Category Text box — To enter the search string. On selecting the Category option, the Choose button appears. Click the Choose button to go to the Select a Category page where you cab select the category. |
A successful search displays the Number of RDs found and a list box with the RDs found. If you navigate to the Edit page for a resource description, you can modify only the attributes of a resource description, which can be edited. By default, you cannot edit some of the RD attributes listed in the table below. To edit all these attributes except the Classification attribute, change the settings in the Database/Schema/Edit schema attribute page.
The table below lists the Database RD Editable attributes and their description. The default value for these attributes depends on the selected RD.
Table 5–4 Database RD Editable Attributes
Attribute |
Description |
---|---|
Author |
Author(s) of the document. |
Author e-mail |
Email address to contact the Author(s) of the document. |
Classification |
Category name if classified; No Classification if not classified. |
ReadACL |
Related to document level security. |
Content-Charset |
Content-Charset information from HTTP Server. |
Content-Encoding |
Content-Encoding information from HTTP Server. |
Content-Language |
Content-Language information from HTTP Server. |
Content-Length |
Content-Length information from HTTP Server. |
Content-Type |
Content-Type information from HTTP Server. |
Description |
Description from RD. |
Expires |
Date on which resource description is no longer valid. |
Full-Text |
Entire contents of the document. |
Keywords |
Keywords taken from meta- tags. |
Last-Modified |
Date when the document was last modified. |
Partial-text |
Partial selection of text from the document |
Phone |
Phone number for Author contact |
Title |
Title of RD |
URL | |
virtual-db |
Used to implement virtual database. |
When you click the Schema tab under Databases, you will get the Manage Search Schema page. This page lists the available Search Schema attributes. The schema determines what information is in a resource description and what form that information is in. You can add new attributes or fields to an RD and set which ones can be edited and which ones can be indexed. When importing new RDs, you can convert schemas embedded in new RDs into your own schema.
The table below lists the Search Schema attributes and their description.
Table 5–5 Search Schema Attributes
Attribute |
Description |
---|---|
Author |
Author(s) of the document. |
Author-EMail |
Email address to contact the Author(s) of the document. |
Content-Charset |
Content-Charset information from HTTP Server. |
Content-Encoding |
Content-Encoding information from HTTP Server. |
Content-Language |
Content-Language information from HTTP Server. |
Content-Length |
Content-Length information from HTTP Server. |
Content-Type |
Content-Type information from HTTP Server. |
Description |
Brief one-line description for document. |
Expires |
Date on which resource description is no longer valid. |
Full-Text |
Entire contents of the document. |
Keywords |
Keywords that best describe the document. |
Last-modified |
Date when the document was last modified. |
Partial-Text |
Partial selection of text from the document. |
Phone |
Phone number for Author contact. |
ReadACL |
Used by Search servers to enforce security. |
Title |
Title of the document. |
URL |
Uniform Resource Locator for the document |
virtual-db |
Used to implement virtual database. |
When you select the checkbox preceding to a search schema attribute and click on it, the Edit search schema name page appears. This page displays all the attributes to edit a search schema attribute. The table below lists the attributes and their description to edit a search schema attribute.
Table 5–6 Edit Search Schema Attribute Attributes
Attribute |
Default Value |
Description |
---|---|---|
Name Description Aliases |
Author Author(s) of the document Blank |
When you import new RDs, you can convert schemas embedded in new RDs into your own schema. You would use this conversion when there are discrepancies between the names used for fields in the import database schema and the schema used for RDs in your database. An example would be if you imported RDs that used Writer as a field for the author and you used Author in your RDs as the field for the author. The conversion would be Writer to Author, so you would enter Writer in this text box. |
Editable |
false |
If true (checked), the selected attribute (field) appears as Editable attribute in the Edit page for a resource description. Description, Keywords, Title and ReadACL are editable. |
Indexable |
true |
If true (checked), the selected attribute (field) can be used as a basis for indexing. Author, Title and URL appear in the menu in the Advanced Search screen for the end user. This allows end users to search for values in those particular fields. Author, Expires, Keywords, Last Modified, Title, URL and ReadACL can be used as the basis for indexing. |
Score Multiplier |
Blank |
A weighting field for scoring a particular element. Any positive value is valid. |
Data Type |
String |
Defines the data type. You need to choose the data type from the list box. |
The Analysis page shows a sorted list of all sites and the number of resources from that site currently in the search database. Select Update Analysis to update the analysis on file.
The table below lists the Database Analysis attributes and their description.
Table 5–7 Database Analysis Attributes
Attribute |
Default Value |
Description |
---|---|---|
Number of RDs |
Current number of RDs retrieved from the URL. |
Lists current number of RDs from that URL. |
URL |
URL that the robot has successfully searched. |
A URL that has added. |
Protocol |
Protocol it uses to retrieve the RDs from that URL. |
Lists the protocol used while collecting the RDs from a web site. |
This chapter explains the attributes available for Search Categories. This chapter contains the following sections:
End users interact with the search database in two distinct ways: They can type direct queries to search the database, or they can browse through the database contents using a set of categories you design. You assign resources in a search database to categories to clarify complexity. If a large number of items are in the database, it is helpful to group related items together. Your primary concern in setting up your categories should be usability so that end users can more quickly locate specific kinds of items.
The search server uses a hierarchy of categories called a taxonomy. The term taxonomy in general describes any system of categories. In the context of a networked resource database such as the search server database, it describes any method you choose of categorizing network resources to facilitate retrieval.
When you click the Browse/Search tab under Categories, you will get the Manage Categories page. This page displays the categories in the taxonomy allowing you to browse the categories. You can also use this page to search for a category.
The attributes present in the Manage Categories page are shown in two tables. The first table lists the Manage Categories Attributes and the second table lists the Edit Category Attributes.
Table 6–1 Manage Categories Attributes
Attribute |
Default Value |
Description |
---|---|---|
New |
Opens up the New Search Category page which you can use to create a new category. |
|
Edit |
Opens up the Edit Category page for that category where you can modify attributes of a Category. |
|
Delete |
Deletes the selected Category. |
|
Reindex |
Reindexes the database. If you have just created your taxonomy, you need to index the database to make category search available to your end users. If you have changed your categories, you need to reindex the database to make it up-to-date. Save the categories tree before you reindex the database. Load the new taxonomy. |
|
Search |
Blank |
Used to search for a Category. |
When you select the checkbox preceding to a Category and click the Edit button, the Edit selected Category name page appears. You can use the attributes in this page to edit a Category. The following table lists the attributes available in the Edit page and their description.
Table 6–2 Edit Category Attributes
Attribute |
Description |
---|---|
Category |
Displays the Category name. |
Description |
Displays the description of the selected category. |
Matching Rule |
Displays the matching rule to use with the selected category. |
After you set up the categories for your database, Click New to set or change the rules the robot for selected categories to assign resources to categories.The table below lists the Classification Rules attributes and their description.
Table 6–3 Classification Rules Attributes
This chapter describes how you can schedule the Robot and Import Agents in Portal server 7.1 console application. This chapter contains the following sections:
In the Sun Java System Portal Server 7.1 Administration console, click the Scheduling tab. By default, the Robot tab is enabled and shows the Scheduling Robot page. This page is where you set up the automatic search schedule for the robot. Use the attributes in the page to start and stop the robot at the times you specify.The table below lists the robot schedule attributes and their description.
Table 7–1 Robot Schedule Attributes
Attribute |
Default Value |
Description |
---|---|---|
Start Robot Time in hours and minutes |
none selected |
This is the time that the robot starts to search. You need to specify the robot start time to schedule. The format is hh : mm : am or pm. For example, 12 : 00 : am. |
Days |
none selected |
Sun, Mon, Tue, Wed, Thu, Fri, or Sat Select at least one day. |
Stop Robot Time in hours and minutes |
none selected |
If you plan to run the robot continuously, it is recommended that you stop and restart it at least once per day. This gives the robot a chance to release resources and re-initialize itself. You need to specify the time in the format: hh : mm : am or pm. For example, 12 : 00 : am. |
Days |
none selected |
The Import Agents tab is located under the Scheduling tab. When you click the Import Agents tab, the Scheduling Import Agents page appears. This page is where you set up the schedule for running the import agents. The table below lists the database import schedule attributes and their description.
Table 7–2 Database Import Schedule Attributes
Attribute |
Default Value |
Description |
---|---|---|
Start Import Time in hours and minutes |
none selected |
Time that the import agent starts to import. You need to specify the start import time to schedule. The format is hh : mm : am or pm. For example, 12 : 00 : am. |
Days |
none selected |
Sun -Sat Select at least one day. |
When you click the Autoclassify tab under Scheduling, the Scheduling Autoclassify page appears. This page lists the available Start Autoclassify attributes, which you can use to schedule autoclassify to start at the times you specify. The table below lists the attributes Start Autoclassify attributes and their description.
Table 7–3 Scheduling Autoclassify Attributes
Attributes |
Default Value |
Description |
---|---|---|
Day of the Week |
None selected |
You need to choose the particular day in a week to schedule autoclassify. The listed days are Sunday, Monday, Tuesday, Wednesday, Thursday, Friday, and Saturday. |
Time |
None selected |
You need to specify the time to schedule autoclassify. The format is hh : mm : am or pm. For example, 12 : 00 : am. |
This chapter explains the attributes available for Search Reports. This chapter contains the following sections:
The Reports section allows you to monitor your search server. You can see a summary of its activity: what sites were searched, what URLs were excluded and why, detailed information about URLs visited by the robot, and what your end users are interested in.
This page shows a list of robot runs. To display a list of reasons URLs were excluded, set a date, select Show, then select one of the Reasons for Exclusion. Displayed is a list of the excluded URLs for that reason. Duplicate and warning exclusions have been removed.
The table below lists Reports Excluded URLs attributes and their description.
Table 8–1 Reports Excluded URLs Attributes
Attribute |
Default Value |
Description |
---|---|---|
Count |
Numbers |
List of numbers with reasons for exclusion. |
Reason for Exclusion |
List of reasons sites have not been allowed. Each reason is linked to a list of all the URLs that were excluded for that reason. |
Filter rules, file not found, site not allowed, protocol not allowed, errors, duplication are some of the reasons URLs were excluded. |
When you click the Advanced Robot Reports tab under Reports, you will get the Robot Report Files page. This page gives you access to a number of different reports from the robot. Select from a drop down list to get information for chosen report to show up. The Refresh button gets the current information.
The table below lists the Reports Robot Report Files attributes and their description.
Table 8–2 Robot Report Files Attributes
Attribute |
Default Value |
Description |
---|---|---|
Reports |
Blank |
Overview, Version, Configuration, Performance, Database, Libnet, Modules, DNS Cache Dump, All Servers Found, URLs-Ready For Extraction, URLs-Ready For Indexing, URLs- Waiting For Filtering (URL pool), URLs- Waiting for Indexing. |
This page allows you to view the entries or specific lines from a log file. Drop down list of log files. Enter the number of lines you want to be displayed when you select Show button.
The table below lists the Reports View Log Files attributes and their description.
Table 8–3 Reports —> Log Files Attributes
Attribute |
Default Value |
Description |
---|---|---|
Log File |
Blank |
Excluded URLs (filter), RD Manager (rdmgr), RDM Server (rdmsvr), Robot Activities (robot), Search Engine (searchengine), User Queries (rdm). |
Lines to show |
25 |
A number you can enter to display the most current entries in the log file. |
When you click the Popular Searches tab under Reports, you will get the Manage Popular Searches page. This page lists the information on your popular searches. This page allows you to see what users are searching for. The most frequent searches appear first in the report. The table lists the Popular Searches attributes and their description.
Table 8–4 Popular Searches Attributes
Attribute |
Default Value |
Description |
---|---|---|
Filter |
All Items |
All Items includes what categories users browse in. Exclude Browse excludes browsing statistics. |
You can set the log level for all the server log settings attributes listed in the table below in the /var/opt/SUNWportal/searchservers/search1/config/SearchLogConfig.properties file.The table lists the Search Server Log Settings attributes and their description.
Table 8–5 Search Server Log Settings Attributes
The possible levels of all the above logs are SEVERE, WARNING, INFO, FINE, FINER, and FINEST.
A Subscription is a profile of interest covering many sources of information such as categories, discussions, and searchable documents. The Subscriptions Service consists of root level, organization and user attributes. These attributes are available in the Sun Java System Portal Server Console. This chapter explains the following Subscriptions Service attributes:
This section describes the root level attributes for the Subscriptions Service with which you can set the system-wide default maximum number of subscriptions for each type, categories, discussion, and saved searches. You can find these root level attributes in the Subscriptions page, which is in the Subscriptions Settings tab, when you select the TopLevel (Global) DN from the Select DNs drop down list. You can use this Subscriptions page to edit the subscription service attributes at root level.
The table below lists the subscription service attributes at root level and their description.
Table 9–1 Subscriptions Service - Root Level Attributes
Attribute |
Default Value |
Description |
---|---|---|
Maximum Category Subscriptions |
5 |
Specifies the maximum number of subscriptions on categories that can be defined and stored in the Sun Java System Access Manager. |
Maximum Discussion Subscriptions |
5 |
Specifies the maximum number of subscriptions on discussions that can be defined and stored in the Sun Java System Access Manager. |
Maximum Saved Search |
5 |
Specifies the maximum number of saved search subscriptions that can be defined and stored in the Sun Java System Access Manager. |
This section describes the organization attributes for the Subscriptions Service.
You can find these organization attributes in the Subscriptions page, which is in the Subscriptions Settings tab, when you select the Organization name [Org] DN from the Select DNs drop down list. You can use this Subscriptions page to edit the organization attributes. The table below lists the subscription service organization attributes and their description.
Table 9–2 Subscriptions Service — Organization Attributes
This section describes the user attributes for Subscriptions Service.
You can find these User attributes in the Subscriptions page, which appears under the Settings tab, when you select the Username [User] DN from the Select DNs drop down list. You can use this Subscriptions page to edit the subscription service user attributes. The table below lists the subscription service user attributes and their description.
Table 9–3 Subscriptions Service - User Attributes
This section describes the scheduling profiler attributes for Subscriptions Service.
You can find the scheduling profiler attributes in the Subscriptions page, which is in the Subscriptions Scheduling Profiler tab. You can use this Subscriptions page to start and stop times for the profiler. The following tables provide attributes for profiler run.
Click the New tab to enable the Start Profiler attributes. You can use these Start Profiler attributes shown in the table to setup the automatic start time for the profiler.
Table 9–4 Attributes for Start Profiler
Attribute |
Default Value |
Description |
---|---|---|
Day of the Week |
None Selected |
Sun, Mon, Tue, Wed, Thu, Fri, or Sat |
Time |
None Selected |
This is the time when the profiler starts to search. You need to select the hours : minutes : am or pm. For example, 12 : 00 : pm |
Click the New tab to enable the Stop Profiler attributes. You can use these Stop Profiler attributes shown in the table to setup the automatic stop time for the profiler.
Table 9–5 Attributes for Stop Profiler
Attribute |
Default Value |
Description |
---|---|---|
Day of the Week |
None Selected |
Sun, Mon, Tue, Wed, Thu, Fri, or Sat |
Time |
None Selected |
This is the time when the profiler stops. You need to select the hours : minutes : am or pm. For example, 12 : 00 : pm |