Assembly: Org.Carrot2.Core.NET (in Org.Carrot2.Core.NET.dll) Version: 22.214.171.124
public sealed class Document
Document content will usually vary based on what data is available: document abstract, first few paragraphs, contextual snippet returned by the search engine, full document text. Please note that full-text clustering may take significantly more time than e.g. snippet- or abstract-based clustering, while not always providing better results.
Providing document titles is optional, but, if available, highly recommended. Clustering algorithms usually give more weight to document titles to improve the clustering quality.
Optionally, an URL pointing to the source of the document and the document's language can also be provided as hints for clustering algorithms.