Exclude certain html tags from calais processing

Hi,

I am using tagging API using URL "https://api.thomsonreuters.com/permid/calais". I pass HTML content to the API for processing. I want to exclude certain parts of the HTML from being processed. For example, tags with class "robots-nocontent" or hidden elements or by specifying something else on the tag.

How can I achieve this?

Best Answer

  • Eran S.
    Answer ✓

    HTML content can be tagged using the following header: Content-Type: text/html

    If there are sections that you wish not to tag, you can remove them prior to submitting the API request.