Grammar

An anti-fabrication pipeline turns the Brihaddeśī into three layered artefacts: the named bricks the text talks about, the operative rules attached to them, and an executable formal grammar synthesised per domain.

Conceptsthe bricks
Ruleswhat is said about them
Formal grammarexecutable synthesis

Concepts layer 1 · step 5

Mentions of the same term across pages are clustered into a single concept. Each concept carries its raw mentions, its page footprint, its co-occurrence degree.

1,763concepts
122in Sanskrit dictionary
Browse all concepts →

Generative rules layer 2 · step 6b

For each concept, the operative rules the text asserts. Every rule is backed by ≥1 verbatim quote (anti-fabrication). 10 categories: definition, structural, relation, enumeration, assignment, derivation, classification, composition, validation, transformation.

1,087rules
812concepts covered
What each category means
definitionStates what a concept *is* (e.g. « nyāsa is the svara on which a gīta ends »).
structuralDescribes the internal structure of a thing (e.g. « ṣaḍja-grāma has seven svaras »).
relationAsserts a relation between two concepts (e.g. « vādī and saṁvādī are consonant »).
enumerationLists all members of a closed set (e.g. « the twenty-two śrutis are… »).
assignmentMaps one entity to another (e.g. « this rāga belongs to that grāma »).
derivationSays how something is obtained from something else (e.g. « mūrchanā is generated by cyclic rotation of the grāma »).
classificationStates that X is a kind of Y (taxonomic membership).
compositionDescribes how multiple parts combine into a whole (e.g. « varṇailā is composed of N elements »).
validationForbids or requires a configuration (e.g. « if X then not Y »).
transformationModifies one form into another (e.g. « ṣāḍava is obtained by omitting one svara »).
Browse all rules →

Structures & tables side-channel · step 4e

Visual structures lifted from the scanned pages (tables, ordered sequences, diagrams) and cited verbatim by the rules. These are the *evidence* behind every "the text says N of X" claim. Five kinds — see below.

364structures
119pages with structures
2pages cited by grammar
What each kind means
tableRectangular grid (rows × columns) such as the mūrchanā-maṇḍala.
shifted_sequence_tableSequence repeated with a row-by-row offset — typical of mūrchanā rotations and tāna permutations.
ordered_sequence_listOrdered enumeration without column structure (named svaras, mūrchanā list, etc.).
diagramVisual schema (circular gamut, tree, dependency graph) — captured as a region, not parsed cell-by-cell.
phraseNamed definitional or canonical phrase pulled verbatim from a visual emphasis box.
Browse all structures →

Formal grammar layer 3 · step 6c

Per-domain executable Python modules. Domains derived by Leiden community detection on the concept-edge graph (6c.1); types by formal concept analysis (6c.2); operations + constraints synthesised per domain (6c.3); assembled into a unified package (6c.4).

What types, operations, constants and constraints mean
typesClosed enumerations and structured records — what kinds of things the domain talks about (e.g. Svara, Grāma, Jāti).
constantsNamed values pinned directly to source quotes (numbers, ordered lists, named entities such as the 22 śrutis or the 14 mūrchanās).
operationsPure functions that take types as input and return new values (e.g. derive ṣāḍava form by omitting a svara). Each one cites the rule(s) that justify it.
constraintsValidators that raise an error on configurations the rules forbid (e.g. « niṣāda cannot be aṁśa in Ṣaḍjamadhyamā »). Each one cites the rule it enforces.
11modules
107types
86operations
58constraints
Browse formal grammar →  ·  Unified package (6c.4) →

REST API programmatic access

JSON-over-HTTP exposure of every layer of this grammar : concepts, rules, structures, formal-grammar manifests, domain handbooks. Designed so a downstream tool (a notebook, a paper's analysis pipeline, another encyclopedia) can consume the grammar without scraping the HTML pages.

What each endpoint exposes
/api/v1/sourcesList of source treatises currently ingested (Brihaddeśī as the pilot).
/api/v1/conceptsPaginated list of concepts with all their metadata — same data that backs /grammar/concepts.
/api/v1/concepts/{cluster_id}A single concept with its affirmations, attached rules and edges.
/api/v1/rulesPaginated list of generative rules — filter by category, concept, source.
/api/v1/rules/{rule_id}A single rule with body, evidence quotes, attached concept.
/api/openapi.jsonFull OpenAPI 3.1 specification of the API — drop into Swagger / Postman / generated client.
Open the API explorer →  ·  OpenAPI spec (JSON) →