Structure
Project descriptor is a JSON file that defines all information required to index and analyze a data set.
The general structure of Lingo4G project descriptor JSON is the following:
{
// General project settings.
"id": "project-id",
"directories": [ ... ],
// Document source specification.
"source": { ... },
// Index fields; their types, attached analyzers, etc.
"fields": [ ... ],
// Tokenization chains (analyzers) and query parsing.
"analyzers": [ ... ],
"queryParsers": [ ... ],
// Project dictionaries.
"dictionaries": [ ... ],
// Indexing settings.
"indexer": { ... },
// Analysis settings (API v1).
"analysis": { ... },
// Analysis settings (API v2).
"analysis_v2": { ... }
}
Most of the properties and configuration blocks in the descriptor file are optional. You can use the show command to display the fully resolved project descriptor (with defaults filled in).
Click the property names in the outline above to follow to the reference page of the specific project descriptor section.