Oracle Commerce Guided Search - Using the CAS Server Java Client

Using the CAS Server Java Client

The CAS Server API allows users to build client programs that invoke the CAS Server to programmatically modify and control a variety of file system and CMS crawling operations.

CAS Server Java Client Sample Files and Directories

This topic describes the contents of the CAS Server Java Client directory.

The CAS Server Java Client (in the /sample directory) has the following directory structure:

/cas-server-java-client
  /lib
  /src
  .classpath
  .project
  build.xml

The contents are as follows:

lib – Contains the Java libraries for the CAS Server Java client application.
src – Contains the Java source file for the CAS Server Java Client application.
.classpath – The classpath file for the Eclipse project.
.project – The Eclipse project file for the recordstore-java-client project.
build.xml – The Ant build file for the Record Store Java client application.

String host = System.getproperty(CAS_HOST_PROPERTY);
if (host == null || "".equals(host)) {
			host = "localhost";
		}
		if (port == null || "".equals(port)) {
			port = "8500";
		}

Arguments are created for the WSDL URL (the service definition interface) and the QName.

final URL wsdlUrl = new URL("http://" + host + ":" + port + "/cas?wsdl");
final QName name = new QName("http://endeca.com/itl/cas/2011-12", "CasCrawlerService");

Using the WSDL URL and QName values, create a Web service locator and then use the CasCrawlerService.getCasCrawlerPort() method to get a handle to the CAS Service port.
```
CasCrawlerService service =  new CasCrawlerService(wsdlUrl, name);
CasCrawler crawler = service.getCasCrawlerPort();
```

Using a CrawlId object, set the name of the crawl.

CrawlId crawlId = new CrawlId();
crawlId.setId("SampleClientTestCrawl");

Using the sampleCreateCrawl method, create the new file system crawl. Text extraction is not enabled, which means that a probe crawl will be run. Note that the CasCrawler.createCrawl() method actually creates the crawl.
```
System.out.println("Creating Crawl with CrawlId '" + crawlId.getId() + "' ...");
sampleCreateCrawl(crawler, crawlId);
```
Using the sampleRunFullCrawl method, run the probe crawl, specifying a maximum of 10 seconds for the crawl duration. The CasCrawler.startCrawl() method is used to actually start the crawl, and then the CasCrawler.stopCrawl() method is used to stop crawl after 10 seconds has elapsed.
```
System.out.println("Running probe crawl...");
sampleRunFullCrawl(crawler, crawlId, 10);
```
Using the sampleUpdateCrawlAddingFiltersAndTextExtraction method, enable text extraction and set wildcard (htm*) filters that are evaluated against the Endeca.FileSystem.Extension record property. The original crawl configuration is retrieved with the CasCrawler.getCrawlConfig() method and the updated configuration is sent to the CAS Server with the CasCrawler.updateConfig() method.
```
System.out.println("Adding filters and enabling text extraction...");
sampleUpdateCrawlAddingFiltersAndTextExtraction(crawler, crawlId);
```
Using the sampleRunFullCrawl method, run a second full crawl that does text extraction and uses the added filters. As with the previous crawl, a maximum of 10 seconds is specified for the crawl duration.
```
System.out.println("Running full crawl...");
sampleRunFullCrawl(crawler, crawlId, 10);
```
Using the sampleDeleteCrawl method, delete the SampleClientTestCrawl demo crawl. Note that the class uses the CasCrawler.deleteCrawl() method to actually delete the crawl.
```
System.out.println("Deleting crawl...");
sampleDeleteCrawl(crawler, crawlId);
```

The sample client program also shows the use of other CAS Server API functions, such as the CasCrawler.listCrawls(), CasCrawler.getStatus() and CasCrawler.getMetrics() methods.

You can modify the file and add other crawling operations, such as changing the output options (to send output to a Record Store instance), adding other types of filters (including date and regex filters), enabling archive expansion, and even returning information about the CAS Server. You can also use the sample code as a basis for creating and running CMS crawls.

These operations, and the methods that call them, are described elsewhere in this guide.

Using the CAS Server Java Client

CAS Server Java Client Sample Files and Directories

About the CAS Server Java Client Program

Building and Running the Java Client with Ant

Note

Opening the cas-server-java-client project in Eclipse

Running the operations of the Java Client

Content Acquisition System Developer's Guide