Skip to main content

Use an RSS Feed as Corpus Document Source

Abstract

Use an RSS Feed as Corpus Document Source

This section contains a short guide on how to upload documents to your corpus by crawling RSS feeds.

You can easily use the content of RSS feeds for your corpus in PoolParty.

  1. Select the Crawl RSS Feed tab in the Upload Documents dialogue.

  2. In the Provide URL field enter the feed's URL.

    • Enforce Corpus Language: activate this check box to let PoolParty use the corpus language you defined during its creation, regardless of other feed languages. (Default: enabled)

  3. Click Crawl to start the upload process.

The content from the provided feed and the linked pages are grabbed and stored as files to your corpus. When the upload is finished, a message about the upload's status will be displayed.

23899908.png

You can view all documents uploaded to your corpus in the Corpus Documents tab.

Note

Unlimited file upload is only available for PoolParty Enterprise server and PoolParty Semantic Integrator. For all other license types the upload is limited to 100 files or an overall size of 10 MB.