Skip to main content

Web Service Method: Categorize a URL

Abstract

Web Service Method: Categorize a URL

Description

[url] Categorizes a given URL by first extracting concepts and then aggregating them by their associated top concepts.

The aggregated top concepts represent the categories; the more concepts are found in the URL from a specific top concept, the higher this top concept's score will be summing up to 100%.

URL: /extractor/api/categorization

Request

Supported Methods

POST

GET

Content-Type

Content-Type: application/x-www-form-urlencoded

HTTP Parameter

Parameter

Type

Required

Description

conceptSchemeFilters

Array of String

false

Concept scheme URI filters

customClassFilters

Array of String

false

Custom class URI filters

disambiguation

boolean

false

Use disambiguation. If not supplied, defaults is false.

displayText

boolean

false

Include text extracted from url in response, default = false

language

String

false

Language of text (en|de|es|fr|...)

projectId

Array of String

true

Thesaurus projectIds

scoringAlgorithm

String

false

Scoring algorithm to use (simple|ppxBoost). If not supplied, defaults to simple.

url

String

true

Url to document be categorized

Response

This method returns execution results in JSON format.

Categorization response

Attribute

Type

Comment

categories

Array of Category

Categories found in text

text

String

Text as extracted from url or file

title

String

Title as extracted from url or file

Categorization result

Attribute

Type

Comment

categoryConceptResults

Array of ConceptCategory

Categorized concepts

prefLabel

String

Preferred label

score

double

Score

uri

String

Uri

Categorized concept

Attribute

Type

Comment

prefLabel

String

Preferred label

score

double

Score

uri

String

Uri