Release Notes

  • NEW Release

    June 14, 2017

    Transliteration for Arabizi

    • New Endpoint: Transliteration We've added a transliteration endpoint that converts between Arabic written in ASCII, also called Romanized Arabic chat or Arabizi, and native Arabic script.

    Arabic Sentiment Analysis

    • Beta Arabic Support for /sentiment We now return document-level and entity-level sentiment analysis results for Arabic language input.

    Relationship Extraction

    • Personal pronoun resolution for /relationships Building on the pronoun resolution capabilities of our /entities endpoint, pronouns which are resolved to named entities can now be arguments in relationships.


    • Improved Confidence Scoring: Confidence score calculation is improved to correlate well with precision and may be used for thresholding and removal of false positives.


    • New support for emoticons, emoji, @mentions, hashtags, URLs, and email addresses These special characters and character combinations are now kept together as a single token in all languages, greatly improving the accuracy of analysis further downstream.

    Morphological Analysis

    • Improved accuracy for English and Spanish For this release, we updated our English and Spanish dictionaries. We also introduced new, more advanced disambiguation models for these languages, which help Rosette to correctly determine a given word’s part of speech. For example, words like “object” can be either a noun (“this is an object”) or a verb (“I object!”).

    • Lemmatization and normalization of emoticons, emoji, @mentions, hashtags, URLs, and email addresses Rosette now normalizes and lemmatizes these special characters and character combinations to streamline analysis.

    • Improved decompounding for Dutch Dutch language text is now decompounded more accurately, Dutch text is now decompounded more accurately, producing better tokens for search enhancement and other applications.

  • NEW Release

    March 23, 2017

    Relationship Extraction

    • Improved Accuracy of Corporate Relationships: Improvements made to the identification of relationships between corporations. The relationships involved are: ORG-SUBSIDIARY-OF, ORG-COLLABORATORS, ORG-ACQUIRED-BY and ORG-PROVIDER-TO.

    • Removed the ORG-PARTNERSHIPS Relationship: The ORG-PARTNERSHIPS relationship is now subsumed under ORG-COLLABORATORS and is no longer extracted as an independent relationship.

    Entity Extraction and Linking

    • Improved Linking Accuracy via Inclusion of New Context Features: The statistical model for entity linking includes features that measure the vector space similarity between an entity context and the Wikipedia contexts of its potential linking targets. The new features result in higher F-Scores across all supported languages.

    • Entity Linking in Japanese, Chinese and Spanish: Entity linking to Wikidata with QIDs for Japanese, Chinese and Spanish text is supported.

    • Removed Long Text Linking: Entity linking to Wikidata (with QIDs) for long texts is removed, which, as a result, removed entity linking capabilities in Arabic.

    Text Embedding

    • Vector dimension reduced from 512 to 300: We are able to produce smaller vectors that are more efficient and memory friendly without sacrificing overall speed or accuracy.

    • Improved Speed and Accuracy: A number of speed enhancements have been made along with much larger vocabularies to increase accuracy.

    Language Identification

    • Improved Accuracy on Texts with Mixed Scripts: A script specific model is now selected based on the weighted frequency of the different scripts in the input.

    Name Matching

    • Japanese Improvements: Rosette API now has better support for Japanese name matching. This includes the new use of word embeddings, which are used to match words with similar semantic meaning, as well as improved Japanese name segmentation.
  • NEW Functionality

    January 10, 2017

    Targeted Relationship Extraction

    • New Endpoint Functionality: The /relationships endpoint now returns targeted relationships, as opposed to the former open relationships, as its default extracted relationships. Targeted relationships are specifically between two entities, and are labeled by a certain relationship type. You can see the former open relationships by setting the option of "discoveryMode" to "true". Read more and try it out!

    /entities/linked REMOVED

    • Removed Deprecated Endpoint: The /entities/linked endpoint, previously deprecated, is now completely removed. All functionality is available through the /entities endpoint. You will receive a 404 when calling /entities/linked.

    Entity Extraction

    • Social Media Linking in Japanese and Chinese: Our fast short text entity linker to Wikidata is now available for Japanese and Chinese.
    • Removal of long text entity linking: Our long text entity linker has been replaced by our fast short string entity linker. You will now see entity linking results from our short string linker by default. This removes linking support for Arabic.
    • Additional Language Support: The entity extractor now supports Vietnamese.

    CJK Support for Names

    • Name Translation and Similarity CJK Improvements: The /name-similarity and /name-translation endpoints now support matching and translating between Japanese-Chinese, Japanese-Korean, and Korean-Chinese. Japanese accuracy was improved significantly.

    Text Embeddings Improvement

    • Improved Accuracy for Document-level Embeddings: We made some improvements to our algorithm for calculating text embeddings across multi-word input, so you should see more accurate results for document-level vectors.

    Japanese Sentiment Analysis

    • Beta Japanese Support for /sentiment: We now return document-level and entity-level sentiment analysis results for Japanese language input.
  • NEW Release

    October 27, 2016

    Syntactic Dependencies (NEW)

    • New Endpoint: We've added a syntactic dependencies endpoint that returns the parse tree of the input text as a list of labeled directed links between tokens, as well as the list of tokens in the input sentence.

    Relationship Extraction

    • Entities Linked to Wikidata: Where available, Rosette will now link entities extracted within relationships to Wikidata. You'll see this information returned as a QID in the argument ID.
    • Modality Returned: We've also added a "modality" field to Rosette's Relationship Extraction. Modality is the semantic context of the possibility or necessity of the relationship; the values can be “assertion”, “negation”, “uncertainty”, “opinion”, or “question”.

    Starter Plan (NEW)

    • New $99 API Plan: For a limited time, we’re offering a special Starter plan. $99/month gets you 40,000 Rosette API calls. Want to dive deep into Rosette but don’t need a whole 100,000 calls? This plan is for you.
  • NEW Endpoint

    September 15, 2016

    Text Embedding (NEW)

    • New Endpoint: We added a text embedding endpoint that returns a single vector of floating point numbers that represents the document or word in the semantic space. Read more and try it out!

    Sentiment Analysis

    • Additional entities: We changed the /sentiment endpoint to return the sentiment of all entities discovered by Rosette, including Person, Location, Organization, Date, Time, and more entity types.

    Entity Extraction

    • Turn off entity linking: We added an option to disable entity linking in order to improve the call speed. Add "options": {"linkEntities": "false"} to your /entities call. Rosette returns a list of the entities with a temporary ID (TID).

    Global changes

    • Concurrency header: We added the X-RosetteAPI-Concurrency header to return the number of concurrent calls allowed on your plan. If you are receiving 429 errors, Too Many Requests, then Contact us for greater concurrency.

  • Entities Unified

    July 21, 2016

    Global changes

    • Input genre: The genre field is available for /entities and /entities/linked to indicate the input is from social media. Specifying genre=social-media does not affect the output of the other endpoints.
      Applies to: /entities, /entities/linked, /relationships, /categories, /sentiment, /language, /morphology, /tokens, /sentences.

    Entity Linking

    Entity Extraction

    • Entity endpoints unified: We combined the /entities and /entities/linked endpoints into one endpoint, /entities. Rosette now returns the entity mentions and the entityId, if available. The entityId replaced the indocChainId.
      The output of /sentiment has not changed.

    • Entities Linked deprecated: We deprecated the /entities/linked endpoint. It is still available, but we recommend that you adapt your applications to the new /entities endpoint.

    • Additional entities: Rosette now extracts more entity types: Date, Time, Longitude and Latitude, and Distance.

    • Japanese entityId: We added support for linking entities in Japanese (jpn) text to their entityId.

    • Spanish social media: We added support for extracting entities from social-media in Spanish language documents, using the genre field.

    • Malay entities: We added support for extracting entities in Malay (msa).

    Error code

    • 409 Error: We added the 409 error code for when the binding version is out of date. If you receive this error, update your binding to the most recent version.

    Sentiment Analysis

    • Spanish support: We added support for analyzing the sentiment of Spanish language documents.
  • Ruby and R bindings

    June 20, 2016


    • Ruby: We added the Ruby binding to the gray column to the right and on Github. There is a Ruby gem available as well.

    • R: We added the R binding to the gray column to the right and on Github.

    • cURL examples: We changed the shell examples in the gray column on Features and Functions to be cURL code examples.

  • Social media

    May 10, 2016

    Entity Linking

    • Social input: We added a request field, "genre": "social-media", to speed up and improve the accuracy of linking Person, Location, Organization and Product entities in social media posts. English input only.
  • Rosette API V1

    March 29, 2016

    Global changes

      • Language used: We added a Response Header, X-RosetteAPI-ProcessedLanguage, to return the language used by Rosette for processing the call.
        Applies to: /entities, /entities/linked, /relationships, /categories, /sentiment, /language, /morphology, /tokens, /sentences

      • requestId moved: We moved the requestId object from the JSON response body to the Response Header as X-RosetteAPI-Request-Id.
        Applies to: /entities, /entities/linked, /relationships, /categories, /sentiment, /language, /morphology, /tokens, /sentences, /name-translation, /name-similarity

      • Rosette API Key: We changed the user_key header’s name to X-RosetteAPI-Key. The user_key header is deprecated.
        Applies to: /entities, /entities/linked, /relationships, /categories, /sentiment, /language, /morphology, /tokens, /sentences, /name-translation, /name-similarity

      • unit parameter removed: We removed the optional unit request parameter. All input will be handled as a doc.
        Applies to: /entities, /entities/linked, /relationships, /sentiment, /morphology

      • Base64: We removed support for Base64 encoding. You can submit binary files as a multipart/form-data call type.
        Applies to: /entities, /entities/linked, /relationships, /categories, /sentiment, /language, /morphology, /tokens, /sentences

      Entity Extraction

      • Confidence removed: The confidence value has been removed from the response object.

      Relationship Extraction

      • Accuracy mode: We removed the optional accuracy mode. All input will be processed with the precision accuracy mode, so Rosette will return a precise list of accurate relationships.
      • Explanations removed: The explanations value has been removed from the response object.


      • Explanations removed: The explanations value has been removed from the response object.

      Sentiment Analysis

      • Entity sentiment: We added support for entity-level sentiment analysis. The JSON response for the /sentiment endpoint now includes two objects – document and entities. See the interactive documentation for examples of this new response.

      • Neutral result: We added a neutral label for documents and entities with a neutral sentiment.

      • Short strings: Rosette will automatically process short and long content with our proprietary algorithm for sentiment analysis.

      • Explanations removed: The explanations value has been removed from the response object.

      Morphological Analysis

      • Added language support: We added language support for Dari, Persian, Urdu, and Western Farsi for Parts-of-Speech Tags.

      • Universal POS Tags: We return Universal Parts-of-Speech Tags for all supported languages.

      • Tokens list: Rosette returns parallel lists of tokens, lemmas, compound components, parts-of-speech tags, and Han-readings. If a token does not have a lemma, compound component, POS tag, or Han-reading, or if the language is not supported, then Rosette will return “null” in that list.

      Name Translation

      • Renamed to /name-translation: To clarify the endpoint’s function, we renamed /translated-name to /name-translation. /translated-name is no longer available.

      • Removed result layer: Within the response to /name-translation and /name-similarity endpoints, we removed the result layer so the results are in the response object.
        Also applies to: /name-similarity

      • TargetScheme requires uppercase: For advanced users who would like to specify a targetScheme, the scheme must be submitted in uppercase.

      Name Matching

      • Renamed to /name-similarity: To clarify the endpoint’s function, we renamed /matched-name to /name-similarity. /matched-name is no longer available.

  • Limited Public Beta Release

    November 16, 2015

    Limited public release delivering almost all of the Rosette SDK functionality via RESTful calls, with these additional capabilities:

    • Relationship Extraction - for English

    Additional Bindings:

    • Node.js

  • Beta Private Release

    July 13, 2015

    Initial private release delivering almost all of the Rosette SDK functionality via RESTful calls, with these language capabilities:

    • Text Extraction from web pages
    • Language Identification - includes high accuracy for short strings
    • Morphological Analysis - for 20+ languages
    • Entity Extraction - for 17 languages, inc. Indonesian
    • Entity Linking against Wikidata - for English, Spanish and Chinese
    • Sentiment Scoring - for English
    • Categorization - for English
    • Name Translation - between English and 11 languages
    • Name Matching - including entity type and language guessing

    Additional Bindings:

    • C#
    • Java

  • Alpha Private Release

    March 15, 2015

    Private release to a select audience delivering parts of the Rosette SDK functionality via RESTful calls, including:

    • Text Extraction from web pages
    • Language Identification
    • Morphological Analysis, (tokens, lemmas and Universal POS tags)
    • Entity Extraction
    • Entity Linking through Wikidata
    • Sentiment Scoring
    • Categorization (using QAG Taxonomy)
    • Name Translation - including language of origin guessing


    • cURL
    • PHP
    • Python