6
The Page Dimension
The Page dimension consists of a single record for each logical page hosted on a Web site. URI stems combined with a set of content identifying query string parameters identify an impressionable page.
This chapter contains the following topics:
Page Dimension Hierarchies
The Page dimension contains the following hierarchies:
The Page Category Hierarchy
This hierarchy contains the following levels, listed from top to bottom. For details about the attributes of each level, see Page Dimension Levels.
- Page Category 6
- Page Category 5
- Page Category 4
- Page Category 3
- Page Category 2
- Page Category 1
- Page
The Page Resource Hierarchy
This hierarchy contains the following levels, listed from top to bottom. For details about the attributes of each level, see Page Dimension Levels.
- Resource Type
- Resource
- Page
Figure 6-1 The Page Category and Page Resource Level Hierarchies
Text description of the illustration page-h.gif
Page Dimension Levels
The Page dimension is comprised of the following levels. For each level, information about each attribute (column name) is provided in the following format:
- Attribute: Description (example value)
levels are presented in descending order. In addition to the pre-defined attributes, each level contains five "generic" attributes that can be defined by the user.
CLK_L_PAGE_CAT6
- Page_Cat6_Code : Identifying code of the highest-level page content category; natural key. (User-defined)
- Page_Cat6_Name : Name of the highest-level page content category. (User-defined)
- Page_Cat6_Description : Description of the highest-level page content category. (User-defined)
- Page_Cat6_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_PAGE_CAT5
- Page_Cat5_Code : Identifying code of the intermediate page content category; natural key. (User-defined)
- Page_Cat5_Name : Name of the intermediate page content category . (User-defined)
- Page_Cat5_Description : Description of the intermediate page content category . (User-defined)
- Page_Cat5_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_PAGE_CAT4
- Page_Cat4_Code : Identifying code of the intermediate page content category; natural key. (User-defined)
- Page_Cat4_Name : Name of the intermediate page content category . (User-defined)
- Page_Cat4_Description : Description of the intermediate page content category . (User-defined)
- Page_Cat4_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_PAGE_CAT3
- Page_Cat3_Code : Identifying code of the intermediate page content category; natural key. (User-defined)
- Page_Cat3_Name : Name of the intermediate page content category . (User-defined)
- Page_Cat3_Description : Description of the intermediate page content category . (User-defined)
- Page_Cat3_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_PAGE_CAT2
- Page_Cat2_Code : Identifying code of the intermediate page content category; natural key. (User-defined)
- Page_Cat2_Name : Name of the intermediate page content category . (User-defined)
- Page_Cat2_Description : Description of the intermediate page content category . (User-defined)
- Page_Cat2_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_PAGE_CAT1
- Page_Cat1_Code : Identifying code of the lowest-level page content category; natural key. (User-defined)
- Page_Cat1_Name : Name of the lowest-level page content category . (User-defined)
- Page_Cat1_Description : Description of the lowest-level page content category . (User-defined)
- Page_Cat1_Attributes 1-5 : Page content category level user-defined attributes.
CLK_L_RESOURCE_TYPE
- Resource_Type_Code : Identifying code of the type of object served by a resource; natural key. Only CONTENT and DOWNLOAD types are included in the impression fact table. (AUDIO, CODE, CONTENT, DESIGN, DOWNLOAD, IMAGE, VIDEO, OTHER, UNKNOWN)
- Resource_Type_Name : Name of the type of object served by a resource. (Audio, Code, Content, Design, Download, Image, Video, Other, Unknown)
- Resource_Type_Attributes 1-5 : Resource type level user-defined attributes.
CLK_L_RESOURCE
The resource level contains a single record for each physical object hosted on a Web site and could include non-page resources. A resource may represent a static server object such and an HTML document or an image. A resource could also be a CGI program. Resources are assumed to be uniquely identified by the path component of a URI- that is, the non-query portion of the URI. So, a given CGI program will have only one record in the resource level regardless of the number of distinct pages generated by the program.
- Resource_Site_ID : Site dimension foreign key for the site to which this resource belongs; natural key. (1, 2, 3, ...)
- Resource_URI_Stem : Portion of the URI that is not part of the query string and that identifies this resource. This means that any 'URL data' must not appear in this attribute; natural key. (/cgi/index.pl, /home.html, /images/navbar.gif)
- Resource_Description : User-defined description of the resource. (For example, a description of what a script does in the case of a CGI program.)
- Resource_Indentifies_Page: Indicates whether this resource is used to identify pages. (Y,N)
- Resource_Results_Page : Indicates if this resource is a search engine results page. (Y,N)
- Resource_Search_Param : When this resource is a search engine results page, this is the query string parameter containing the search expression. (s, search)
- Resource_File_Directory : Full directory path, excluding the file name. (/users/dmso)
- Resource_File_Extension : File extension minus the leading period character. If there is no extension, then this is simply a period (.) (html, pl, .)
- Resource_File_Name : File name portion of the URL stem . (Index.html, index.pl, index)
- Resource_MIME_Type: MIME type of this resource. ( text/plain, text/html, text/xml)
- Resource_Delivery_Method : Delivery method used for this resource, such as 'dynamic', if this resource is a CGI script. (static, dynamic, unknown)
- Resource_Attributes 1-5 : Resource level user-defined attributes.
CLK_L_PAGE
- Page_Site_ID : Site dimension foreign key for the site to which this page belongs; natural key. (1, 2, 3, ...)
- Page_Code : Portion of the URI that is not part of the query string and that identifies the resource corresponding to this page; natural key. (/cgi/index.pl, /home.html, /images/navbar.gif)
- Page_URI_Query : Normalized query string including the leading '?' and containing just content identifying parameters. Query parameters are sorted alphabetically. If there is no query parameters, then this should be '?' (?a=123&b=xyz,?)
- Page_Description : User-defined description of this page.
- Page_Type: User-defined type for this page . (Checkout, Homepage, Order Confirmation)
- Page_Title : The title of this page if it has one, such as the value of the <title> element in HTML documents.
- Page_Analyze_Neighbors: Whether this page is to be included when performing Paths previous and next page analysis. (Y, N)
- Page_Analyze_Paths_From: Whether this page is to be included as a source when performing Paths-From analysis. (Y, N)
- Page_Analyze_Paths_To : Whether this page is to be included as a destination when performing Paths-To analysis. (Y, N)
- Page_Attributes 1-5: Page level user-defined attributes.