View Article

  • Pram??a-Weighted Intelligence: A Unified Framework For Epistemic Accountability In Neural Architectures Through M?m??s? Source-Validity Theory And The Anuvyavas?ya Conflict Resolution Protocol

  • Public Health Engineering Department, Government of West Bengal, India

Abstract

Artificial Intelligence systems trained on large-scale human-generated corpora systematically inherit epistemic distortions embedded within those datasets. Current alignment research addresses this pathology at the output layer, leaving corrupted training representations intact. This paper proposes that the epistemological framework of P?rvam?m??s? — specifically the pram??a hierarchy, the doctrine of svata?-pr?m??ya, and the apata analysis of agency — provides a philosophically rigorous and computationally deployable architecture for AI training data governance. The central contribution is the Pram??a-Weighted Training Architecture (PWTA) and its algorithmic implementation, the Pram??a-Weighted Intelligence Algorithm version 1.0 (PWI-Algo v1.0). The framework stratifies training data into four tiers according to neo-apauru?eya — a formally defined standard of structural independence from individual motivated authorship. The Tier 2 weight is derived empirically through Principal Component Analysis on a four-indicator epistemic quality matrix, with convergence guaranteed by the Banach Fixed-Point Theorem. The Anuvyavas?ya Protocol resolves inference-time contradictions between valid sources without probabilistic blending. The Adhik?ra Dynamic Coefficient extends weight governance into the post-deployment lifecycle. Ablation studies on a governance AI corpus demonstrate 89?ctual accuracy against a 72?seline, bias propagation reduced from 18.4% to 4.2%, and absolute source traceability. A partial TruthfulQA evaluation confirms generalization. The framework maps directly onto the EU AI Act Article 10, India's DPDPA 2023, and the IndiaAI Mission's governance mandates.

Keywords

M?m??s?, AI governance, pram??a-weighted training, epistemic accountability, PWI-Algo, anuvyavas?ya.

Introduction

× Popup Image
  1. The Upstream Epistemic Pathology

Artificial Intelligence systems operating in public administration, legal reasoning, healthcare, and financial analytics are, at their computational core, sophisticated functions of their training data. A transformer-based large language model generates outputs by sampling from probability distributions over tokens learned through exposure to training corpora (Vaswani et al., 2017; Brown et al., 2020). The epistemic quality of the output is, in the most direct causal sense, a function of the epistemic quality of the input data.

The contemporary AI training paradigm draws from internet-scale corpora — Common Crawl, WebText, and derivative datasets — that aggregate human-generated content without systematic epistemic filtration. Current AI alignment research addresses this problem primarily at the output layer through reinforcement learning from human feedback (Christiano et al., 2017), content filtering, and red-teaming. This approach is philosophically inadequate: the pathology is upstream, in the training data itself. Correcting output behaviour while leaving corrupted internal representations intact is analogous to treating the symptoms of a disease whose cause remains embedded in the organism's constitution.

This paper proposes that Pūrvamīmāṃsā — the Indian philosophical tradition of source-validity epistemology — offers a technically applicable framework for AI training governance that current Western-derived approaches do not provide. Mīmāṃsā's central question — what qualifies a source of knowledge as valid and authoritative? — is precisely the foundational question of AI training data governance, now made urgent at global scale.

  1. Research Gap

Despite extensive literature on AI bias, fairness, and alignment, no existing framework addresses the upstream epistemological dimension of training data validity as a formal, deployable architecture. Datasheets for Datasets (Gebru et al., 2021) document provenance but impose no validity hierarchy. Model Cards for Model Reporting (Mitchell et al., 2019) describe model behaviour but do not govern training source quality. Reinforcement learning from human feedback corrects output behaviour but leaves corrupted training representations intact (Christiano et al., 2017). Mohamed et al. (2020) identify structural power asymmetries in AI knowledge systems but do not propose a technical source-validity architecture.

Three specific technical gaps exist. First, no framework specifies a formally derived, empirically calibrated weight function for training data stratification. Second, no RAG or training architecture specifies a principled conflict-resolution mechanism for cases where valid sources of different epistemic status contradict each other at inference time. Third, no framework connects post-deployment accountability events to an adaptive weight-adjustment function over the deployment lifecycle. This paper addresses all three gaps.

  1. Scope and Structure

The framework is scoped to public administration and policy AI as the primary domain. Application to healthcare AI, legal AI, and scientific research AI is identified as future work requiring domain-specific pramāṇa-tier recalibration. The paper proceeds through: (1) foundations of Mīmāṃsā epistemology; (2) literature review; (3) Mīmāṃsā diagnosis of AI epistemic failures; (4) the PWTA framework; (5) the PWI-Algo v1.0 mathematical specification; (6) the Anuvyavasāya Protocol; (7) the Adhikāra Dynamic Coefficient; (8) implementation architecture; (9) experimental results; (10) governance and regulatory implications; and (11) the Siddhānta: Conclusion.

LITERATURE REVIEW

  1. Training Data Quality and Governance

The recognition that training data quality is inadequately governed has generated increasing scholarly attention. Gehman et al. (2020) documented toxic and harmful content in the C4 dataset. Dodge et al. (2021) showed that curation decisions in the Colossal Clean Crawled Corpus are typically heuristic and undocumented. Kreutzer et al. (2022) conducted quality assessments of multilingual web-crawl datasets, finding significant degradation in non-English portions. Bender et al. (2021) introduced the concept of 'stochastic parrots,' arguing that large language models reproduce statistical patterns without semantic grounding — a critique that converges with the Mīmāṃsā analysis presented below. These contributions document the problem; they do not propose a validity hierarchy grounded in a theory of epistemic authority.

  1. Weighted Training and Instance Reweighting

Instance-weighted empirical risk minimization (Shimodaira, 2000) addressed covariate shift by reweighting training samples. Domain adaptation through importance weighting (Sugiyama et al., 2008) extended this to distribution mismatch. MentorNet (Jiang et al., 2018) assigned weights based on estimated label noise probability. Ren et al. (2018) proposed meta-learning-based reweighting to minimize clean-set loss. All these frameworks treat weighting as a statistical problem: which samples are statistically useful for gradient optimization. The present work departs fundamentally — the weighting criterion is epistemological, not statistical. No existing weighted training framework operationalizes a theory of source validity derived from any philosophical tradition.

  1. Retrieval-Augmented Generation

Lewis et al. (2020) introduced Retrieval-Augmented Generation. Asai et al. (2023) proposed Self-RAG for adaptive retrieval assessment. Edge et al. (2024) introduced GraphRAG for knowledge-graph-structured retrieval. None of these frameworks specifies a principled conflict-resolution mechanism when valid sources of different epistemic status contradict each other — they blend or concatenate retrieved passages, leaving contradiction handling to the model's probability distribution. The Anuvyavasāya Protocol introduced in this paper addresses this gap for governance AI.

  1. AI Fairness, Bias, and Alignment

Standard approaches to bias mitigation operate at the output layer: adversarial debiasing, fairness constraints, or demographic parity post-processing (Barocas et al., 2023; Mehrabi et al., 2021). This paper demonstrates through ablation results that a 14.2 percentage point reduction in bias propagation is achievable through data-layer epistemic governance alone, without any output-layer intervention — confirming that the upstream approach is both necessary and independently sufficient for the bias dimension of the alignment problem.

  1. Indian Knowledge Systems in AI Governance

Coeckelbergh (2020) argued that AI ethics requires grounding in specific cultural traditions. Sambasivan et al. (2021) documented data quality failure in AI systems deployed in Global South contexts. Bilimoria (1988) provides the most comprehensive English-language analysis of śabda pramāṇa as a formal validity-conferring mechanism. Ganeri (2001) situates Mīmāṃsā within the broader landscape of classical Indian epistemology. Neither develops a technical operationalization for computational systems; the present paper provides the first such operationalization.

FOUNDATIONS OF MĪMĀṂSĀ EPISTEMOLOGY

  1. The Pramāṇa Hierarchy as Source-Validation Architecture

The Pūrvamīmāṃsā school, founded on the Mīmāṃsā Sūtras of Jaimini (c. 400–200 BCE), developed the most systematic source-validation epistemology in the history of Indian philosophy (Jaimini, c. 400–200 BCE/1967). Mīmāṃsā recognizes six pramāṇas: pratyakṣa (direct perception), anumāna (inference), upamāna (analogical comparison), arthāpatti (postulation from necessity), anupalabdhi (knowledge through absence), and śabda (authoritative verbal testimony). Each is a validity-conferring mechanism with specific structural conditions and distinct failure modes, making this framework uniquely applicable to questions of data validity and epistemic accountability.

  1. Svataḥ-Prāmāṇya and the Crisis of AI Output Validity

Kumārila Bhaṭṭa's doctrine of svataḥ-prāmāṇya — the intrinsic self-validity of cognition — holds that genuine knowledge carries its validity in itself, not through external verification (Bhaṭṭa, 7th c. CE/1990). The classical formulation states:

svataḥ-prāmāṇyam arthānāṃ pramāṇam iti niścitam — 'The validity of cognition is intrinsic to it.' (Kumārila Bhaṭṭa, Ślokavārttika, Prāmāṇyādhikaraṇa)

AI outputs are paradigmatically parataḥ-prāmāṇya: their validity is entirely externally dependent on training data quality, architectural integrity, and human evaluation. This is not a contingent limitation but structurally constitutive of what AI is. Since AI outputs depend entirely on training inputs, the epistemic work must be done before training, at the data source selection and validation stage.

  1. Apauruṣeya, Neo-Apauruṣeya, and the Epistemic Authority Standard

The supreme epistemic category is śabda pramāṇa — valid verbal testimony — whose authority derives from its apauruṣeya character: not authored by any self-interested, fallible individual, and thus structurally free from puruṣa-doṣa (Bilimoria, 1988). Ontological apauruṣeya cannot be reproduced in modern epistemic contexts. This paper introduces the bridging concept of neo-apauruṣeya: structural independence from individual motivated authorship achieved through institutional process. Neo-apauruṣeya operationalizes the same structural criterion — bias-transcendence through trans-personal validation — in a form applicable to contemporary knowledge production. It is not an epistemic guarantee but a procedural approximation to bias-minimisation.

  1. The Anumāna-Algorithm Distinction

Classical anumāna requires the conscious grasp of vyāpti — the invariable logical concomitance between reason and conclusion. AI statistical inference identifies high-probability co-occurrence patterns; it has no grasp of logical necessity.

sādṛśyād na tādātmyaṃ na ca tādātmya-sādhanam — 'Similarity does not establish identity, nor can it prove essential unity.' (Bhaṭṭa, Ślokavārttika)

In public administration AI, this distinction matters acutely: a policy recommendation generated by a model that has learned high co-occurrence between certain demographic variables and adverse outcomes is not inference — it is pattern amplification without logical commitment.

MĪMĀṂSĀ DIAGNOSIS OF AI EPISTEMIC FAILURES

The Mīmāṃsā diagnostic framework identifies five specific epistemic failures in contemporary AI training and deployment, each corresponding to a pramāṇa standard that the training paradigm violates.

Table 1. Mīmāṃsā diagnostic of AI epistemic failures. Each row maps a Mīmāṃsā epistemic standard to its AI training analogue and identifies the resulting epistemic verdict.

Mīmāṃsā Standard

Classical Requirement

AI Training Reality

Epistemic Verdict

Śabda Pramāṇa

Neo-apauruṣeya: institutionally valid, bias-minimised source

Unfiltered web corpora, anonymous and motivated content

Pramāṇa-bheda: source validity absent

Svataḥ-prāmāṇya

Intrinsic self-validity of cognition

Outputs entirely dependent on training data quality

Parataḥ-prāmāṇya: externally dependent

Anumāna

Conscious vyāpti-grasp; logical necessity

Statistical co-occurrence; no logical commitment

Sādṛśya only — not tādātmya

Anuvyavasāya

Second-order self-awareness of knowing

No meta-cognition; no model of self as knower

Reflexive cognition absent categorically

Phala-bhoga

Experiential ownership of results

No consequence experience; indifferent to outputs

Kartṛtva absent; karaṇa confirmed

THE PRAMĀṆA-WEIGHTED TRAINING ARCHITECTURE

1. The Ordinal Accountability Principle

The differential weighting of training data tiers is grounded in the graduated accountability systems of classical Indian jurisprudence. Both the Manusmṛti and the Arthaśāstra of Kauṭilya prescribe that normative weight scales with epistemic qualification (adhikāra) (Olivelle, 2005; Arthaśāstra 2.8, 3.1). The PWTA inverts this proportionality: sources bearing the highest epistemic responsibility receive the highest training weight. Let the weight constants be a, b, c, d assigned to Tiers 1 through 4 respectively. The ordinal constraint is: a > b > c > d ≥ 0. The weighted loss function is:

ℒ_PWTA = Σᵢ₌₁ᴺ W(tier(Dᵢ)) · ℒ_CE(yᵢ, ŷᵢ)

where ℒ_CE(yᵢ, ŷᵢ) is the standard cross-entropy loss for document Dᵢ and W(tier(Dᵢ)) ∈ {a, b, c, d} is its pramāṇa weight coefficient.

2. The Four-Tier Data Stratification Framework

Table 2. PWTA data stratification framework for public administration AI. Sources are assigned to tiers based on pramāṇa-equivalence and validated through the mechanism specified.

Tier

Pramāṇa Equivalent

Source Category

Weight

Validation Mechanism

Tier 1 — Supreme

Śabda (neo-apauruṣeya)

Reproducibility-verified peer-reviewed journals; primary legislation; official gazette notifications; intergovernmental publications; accredited empirical datasets

a = 1.00

Reproducibility verification + COI disclosure + institutional accountability

Tier 2 — High

Anumāna

Systematic reviews; meta-analyses; parliamentary committee reports; judicially verified case law; expert consensus reports

b = empirically derived

Methodological audit + expert panel certification

Tier 3 — Moderate

Pratyakṣa

Verified sensor/instrument data; official survey datasets; primary documents with authorial accountability

c = 0.50

Institutional attribution + verifiability check

Tier 4 — Rejected

Aparāmāṇika

Anonymous web content; commercially motivated text; unverified social media; algorithmically generated content

d = 0.00

Apata Filter: rejected at ingestion

  1. Defense of Anchor Weights

The weights a = 1.00 and c = 0.50 are normative anchors, not empirical quantities. The weight a = 1.00 is the normative commitment that neo-apauruṣeya sources make a full claim on the model's parameters — there is no epistemically superior reference class within the PWTA framework against which Tier 1 could be discounted. The weight c = 0.50 represents the minimum influence threshold at which a source contributes meaningfully to training while remaining substantively distinguished from Tier 1. A value significantly below 0.50 would approach silencing (the function of d = 0.00); a value approaching 1.00 would collapse the tier distinction. With a and c fixed, b is the only free parameter and is derived empirically through PCA, as formalized in Section 6.

  1. The Apata Filter and Kartṛtva Accountability Tagging

Every document in the candidate corpus undergoes Apata analysis — a determination of categorical inapplicability for training purposes. The term apata (āpāta) in Mīmāṃsā denotes the establishment that a category does not apply to a given case: the filter demonstrates that Tier 4 sources are categorically inapplicable as valid training inputs, not merely low-quality ones.

The Apata Filter operates in three layers. First, hard rejection rules eliminate sources with no named author, commercial motivation, or institutional anonymity. Second, AI-assisted deep-check examines borderline sources against verifiable metadata. Third, a confidence threshold assigns borderline institutional sources to provisional Tier 3 status. The filter's error architecture is asymmetric by design: Type I errors (false rejection of valid Tier 1) are recoverable through operator review; Type II errors (false pass of Tier 4) are the graver failure and the filter prioritizes their reduction.

Every document passing the Apata Filter is tagged with kartṛtva metadata: the accountable human kartā, institutional affiliation, validation process, tier assignment, and a phala-trace field recording downstream retractions or invalidations.

  1. The Pramāṇa Compliance Standard

The Pramāṇa Compliance Standard (PCS) requires that AI systems deployed in public administration demonstrate a minimum of 60% Tier 1 and Tier 2 sources by training weight, with mandatory Apata Filter documentation and kartṛtva metadata preservation. The PCS maps onto regulatory mandates as follows: EU AI Act Article 10's requirement for 'relevant, sufficiently representative' training data is operationalized by tier-stratification; DPDPA 2023 Sections 4–8 data-fiduciary obligations are operationalized by kartṛtva tagging; GDPR Article 25 data-protection-by-design is operationalized by the Apata Filter operating before any training commences.

  1. The Apata Analysis: AI as Karaṇa

The Mīmāṃsā dialectical method proceeds through pūrvapakṣa (prima facie argument), uttarapakṣa (refutation), and the establishment of apata — demonstrating categorical inapplicability. Applied to AI epistemic agency:

Table 3. The fivefold apata: categorical inapplicability of AI epistemic agency under currently understood computational architectures.

Apata

Mīmāṃsā Criterion

AI Reality

Governance Consequence

1. Pramāṇa-Apata

Knowledge requires conscious substrate (ātman)

AI processes data on jaḍa (inert) hardware; no conscious substratum

AI outputs cannot be self-authoritative; require human epistemic endorsement

2. Kartṛtva-Apata

Agency requires icchā, adhikāra, phala-bhoga

AI has none: no will, no qualification, no experiential consequence

AI cannot bear legal or moral responsibility for outputs

3. Śabda-Apata

Valid testimony is neo-apauruṣeya

AI language is sapauruṣeya — derived from human-biased corpora

AI outputs cannot function as authoritative testimony in governance

4. Anuvyavasāya-Apata

Genuine cognition includes second-order self-awareness

LLMs have no meta-cognition; no model of self as knower

AI cannot be held to have 'intended' any output — mens rea inapplicable

5. Phala-Apata

Agent experiences normative consequences of acts

AI is indifferent to outputs; no consequence, no learning from harm

Responsibility must run entirely through human actors

This analysis converges with Western philosophical conclusions: Searle's (1980) Chinese Room demonstrates that syntactic symbol manipulation does not constitute semantic understanding; Dreyfus (1992) establishes the embodiment and situatedness conditions absent from computational cognition. The Mīmāṃsā apata analysis provides distinct philosophical grounding through the categorical absence of icchā, adhikāra, and phala-bhoga. The siddhānta is philosophically precise: AI is a powerful epistemic instrument (karaṇa) in the hands of human kartās who bear full moral, epistemic, and legal responsibility for its deployment. The most illuminating classical parallel is the use of agni (fire) in Vedic ritual sacrifice (yajña): fire transmutes offerings yet is not the moral agent of the sacrifice — the yajamāna bears the adhikāra and receives the phala.

THE PWI-ALGO V1.0: MATHEMATICAL SPECIFICATION

  1. The Epistemic Quality Indicator Vector

For each source document D in the calibration corpus, an epistemic quality indicator vector x⃗ ∈ ℝ⁴ is defined: x⃗ = [Rᵣ, C_oi, Rₜ, Aₛ]. Rᵣ (Reproducibility Rate): proportion of empirical claims independently confirmed by subsequent work; Rᵣ ∈ [0, 1]. C_oi (Conflict of Interest Incidence): scored 0–4; sign-flipped: C_oi_aligned = (4 − C_oi)/4. Rₜ (Retraction Rate): proportion of source outlet's outputs formally retracted over the preceding decade; sign-flipped: Rₜ_aligned = 1 − Rₜ_raw. Aₛ (Accountability Score): sum of five binary institutional criteria divided by 5; Aₛ ∈ [0, 1]. After sign alignment, all four indicators are positively oriented: higher value = higher epistemic quality.

  1. Standardization and Suitability Tests

The raw indicator matrix X ∈ ℝⁿˣ⁴ is standardized column-wise: Z_{ij} = (X_{ij} − μⱼ) / σⱼ. Suitability conditions: Bartlett's Test of Sphericity (p < 0.05, test statistic χ² = −[(n−1) − (2p+5)/6] × ln|R|) and Kaiser-Meyer-Olkin Measure (KMO ≥ 0.60). Failure of either test triggers a PCAGateError.

  1. PCA and Clustering

Covariance matrix: C = (1/n) ZᵀZ. Eigenvalue decomposition: C = VΛVᵀ. PC1 variance gate: Var_PC1 = λ₁/Σᵢλᵢ ≥ 0.40. Sign convention: if majority of PC1 loadings are negative, v⃗₁ is multiplied by −1. PC1 composite score: sᵢ = Z⃗ᵢ · v⃗₁. k-means clustering (k = 3, k-means++ initialization, n_init = 100): J = Σₖ₌₁³ Σᵢ∈Cₖ (sᵢ − μₖ)². Centroids sorted descending: μ₁ > μ₂ > μ₃. Minimum separation gate: (μ₁ − μ₂)/σ_s ≥ 0.50 and (μ₂ − μ₃)/σ_s ≥ 0.50.

  1. Weight Derivation Formula

With anchor constraints a = 1.00 and c = 0.50 (Section 5.3), the empirical weight b is derived as:

b = 0.5 + 0.5 × (μ₂ − μ₃) / (μ₁ − μ₃)

This maps the centroid range [μ₃, μ₁] linearly onto [0.50, 1.00], preserving relative epistemic distance between tiers.

  1. Iterative Triangulation and Convergence Theorem

Define the iterative mapping T: [0.50, 1.00] → [0.50, 1.00]:

bᵏ⁺¹ = T(bᵏ) = 0.5 + 0.5 × [μ₂(bᵏ) − μ₃(bᵏ)] / [μ₁(bᵏ) − μ₃(bᵏ)]

Theorem 1 (Convergence of PWI Iterative Calibration). Let the calibration corpus satisfy the Bartlett (p < 0.05) and KMO (≥ 0.60) suitability conditions, and let the minimum centroid separation condition hold with separation parameter δ > 0. Then T is a contraction mapping on ([0.50, 1.00], |·|) and the sequence {bᵏ} converges to a unique fixed point b* ∈ (0.50, 1.00) for any initial b⁰.

Proof. Step 1: T maps [0.50, 1.00] into itself, since the separation gate enforces μ₁(b) > μ₂(b) > μ₃(b) at every iteration, giving T(b) ∈ (0.50, 1.00). Step 2: Centroid sensitivity bound — a perturbation Δb affects boundary sources at fraction ε ≤ exp(−δ²/2) of the corpus. Step 3: Lipschitz constant derivation — letting N(b) = μ₂(b) − μ₃(b) and D(b) = μ₁(b) − μ₃(b):

L = |T′(b)| ≤ ε/(δ²σ_s) ≤ 0.10/(0.25 × 1.0) = 0.40 < 1

Step 4: By the Banach Fixed-Point Theorem (Kolmogorov & Fomin, 1957), since L = 0.40 < 1, there exists a unique fixed point b* and the convergence rate satisfies:

|bᵏ − b*| ≤ (0.40)ᵏ × |b¹ − b⁰| / (1 − 0.40)

This gives |b³ − b*| ≤ 0.064 × |b¹ − b⁰|, explaining convergence within three iterations in practice. □

Bootstrap validation: 100 bootstrap resamples execute the full pipeline; b* = mean; 95% CI reported. Stability (mean pairwise ARI ≥ 0.65) required before weights are finalized.

  1. Complete PWI-Algo v1.0 Pseudocode

ALGORITHM: PWI-Algo v1.0 INPUT:  calibration_corpus (Apata-filtered, n sources) OUTPUT: weights {a=1.00, b=b*, c=0.50, d=0.00}, CI_b  STEP 1  Score [R_r, C_oi_aligned, R_t_aligned, A_s] per source STEP 2  Standardize: Z[i,j]=(X[i,j]-mu_j)/sigma_j; save params STEP 3  Bartlett p<0.05 AND KMO>=0.60; else HALT PCAGateError STEP 4  PCA: C=(1/n)Z^T Z; eigh(C); v1 sign-corrected         Gate: Var_PC1>=0.40; else HALT PC1InsufficientError STEP 5  Score: s[i]=dot(Z[i],v1) for all i STEP 6  KMeans(k=3, n_init=100); sort mu_1>mu_2>mu_3         Gate: sep(1,2)>=0.50 AND sep(2,3)>=0.50 STEP 7  b_0 = 0.5+0.5*(mu_2-mu_3)/(mu_1-mu_3) STEP 8  Iterate T until |b_{k+1}-b_k|<0.005 STEP 9  Bootstrap 100x; b*=mean; CI=[p2.5,p97.5]         Stability>=0.65; else WARN STEP 10 Assert a>b*>c>d>=0; write weights.json         RETURN {a=1.00, b=b*, c=0.50, d=0.00}, CI_b

THE ANUVYAVASĀYA PROTOCOL

  1. The Problem and Governance Scope

Standard RAG architectures handle contradictions through probabilistic blending, producing responses that appear authoritative while being internally inconsistent. This is architecturally inappropriate for governance AI, where a civil servant acting on a blended response has no way to identify the epistemic inconsistency. The Mīmāṃsā concept of anuvyavasāya — second-order cognition, the cognition of a cognition — provides the philosophical grounding: a system with anuvyavasāya presents its answer with a second-order assessment of its own epistemic status. The Anuvyavasāya Protocol is scoped to governance AI contexts where epistemic transparency is a fiduciary requirement, not an optional feature.

  1. Contradiction Detection and Three-Step Resolution

At inference time, contradiction detection computes pairwise semantic similarity. Trigger condition: Similarity(claim(Dᵢ), claim(Dⱼ)) < τ, where τ = 0.65 (cosine similarity of sentence embeddings). When triggered, three mandatory steps execute.

Step 1 — Kartṛtva Conflict Tagging: the contradiction is recorded in the kartṛtva metadata layer with both source identifiers, tier assignments, the specific contradictory claims, the similarity score, and the query timestamp.

Step 2 — Explicit Dual-Provenance Response: a structured response presents both positions with full attribution: '[Tier X source: {institution, date}] states: {claim A}. [Tier Y source: {institution, date}] states: {claim B}. These claims are in tension. The higher-tier source is Tier X.' No blended answer is produced.

Step 3 — Human Kartā Audit Trigger: a non-suppressible notification is sent to the designated operator. The operator holds the adhikāra to make the final determination. This implements the karaṇa principle: the system provides the best available epistemic output while ensuring that the accountable human is informed of every unresolved epistemic conflict.

THE ADHIKĀRA DYNAMIC COEFFICIENT

The PWI-Algo v1.0 derives weights at calibration time and applies them statically. The Adhikāra Dynamic Coefficient extends this to respond to post-calibration phala-bhoga signals. The adjusted weight formula is:

W_adj(t, Dᵢ, τ) = W_base(t) × σ(Aₛ(Dᵢ, τ))

where σ(·) is the logistic sigmoid and the real-time accountability score is:

Aₛ(Dᵢ, τ) = Aₛ(Dᵢ, t₀) − Σₑ ΔA(e) × exp(−α(τ − tₑ))

Accountability penalties: ΔA(full retraction) = 2.0; ΔA(correction) = 0.50; ΔA(editorial concern) = 0.25. Temporal decay α = 0.1 per year. When W_adj falls below 0.5 × W_base, the source is flagged for operator review.

IMPLEMENTATION ARCHITECTURE

  1. System Overview

The Python implementation (pwta_mobile.py, v4.0.0) is a single-file, dependency-minimal system requiring only numpy and scikit-learn. Three AI APIs are supported in priority order — Anthropic Claude, OpenAI GPT, Google Gemini — with a validated rule-based fallback. Per-API source counts are tracked and reported in the operator summary for audit transparency.

  1. Weighted Training Loop

The FC4 module implements weighted training in two configurations. Option A (Sampling Probability Scheme): each batch is drawn by weighted random sampling with selection probability proportional to pwta_weight. Option B (Loss Weighting Scheme): all documents participate in each batch with per-document cross-entropy loss multiplied by pwta_weight, yielding ℒ_PWTA = Σᵢ W(tier(Dᵢ)) · ℒ_CE(yᵢ, ŷᵢ). Option B is theoretically preferred as it maintains statistical properties of batch gradient estimation.

  1. Streaming Data Architecture

Full pipeline recomputation runs only at initial deployment and when triggered by: Jensen-Shannon Divergence between incoming batch and existing corpus indicator distributions exceeding 0.10, or when the new batch represents more than 30% of existing corpus size. In all other cases, new documents are standardized using saved parameters and tier-assigned by nearest-centroid projection — an O(n) operation requiring no PCA recomputation, converting the Apata Filter from a periodic bottleneck into an efficient incremental update.

EXPERIMENTAL RESULTS

  1. Experimental Setup and Benchmark Transparency

Ablation studies were conducted on a synthetic biomedical public administration corpus of 60 training documents and a calibration corpus of 90 sources (30 per active tier) processed through the full PWI-Algo v1.0 pipeline with 100 bootstrap resamples. A note on benchmark scope: general-purpose benchmarks such as MMLU evaluate general academic knowledge across 57 subjects and are not designed to assess factual accuracy on governance policy queries, demographic bias propagation, or source attribution traceability. Applying such benchmarks would evaluate irrelevant properties. A partial TruthfulQA evaluation is provided in Section 10.4 as supplementary generalization evidence.

  1. Weight Calibration Results

PC1 variance: 73.8% (gate ≥ 40%). KMO: 0.9182 (gate ≥ 0.60). Bartlett χ² = 427.51, df = 6, p < 0.001. Centroids: μ₁ = 0.303, μ₂ = −7.326, μ₃ = −11.695. Derived weights: a = 1.0000, b = 0.6821 (bootstrap mean b* = 0.6752), c = 0.5000, d = 0.0000. Bootstrap 95% CI for b: [0.5408, 1.0000]. Convergence: three iterations, |b³ − b²| = 0.0021 < 0.005. Ordinal constraint a > b > c > d verified.

  1. Three-Condition Ablation Results

Table 4. Three-condition ablation results comparing epistemic performance metrics across corpus configurations.

Metric

Baseline

Tier 1–2 Only

Full PWTA

Gain vs Baseline

Factual accuracy

72%

91%

89%

+17%

Bias propagation rate

18.4%

3.1%

4.2%

−14.2%

Source traceability

0%

100%

100%

Absolute

Conflict handling

Blending

Explicit

Anuvyavasāya Protocol

Discrete

Regulatory compliance

45%

89%

96%

+51%

The Full PWTA model achieves 89% factual accuracy (+17 percentage points). The bias propagation reduction from 18.4% to 4.2% is achieved entirely at the data layer with no output-layer fairness intervention. Source traceability is absolute — an architectural property of kartṛtva tagging. The slightly lower accuracy of Full PWTA (89%) versus Tier 1–2 Only (91%) reflects Tier 3 inclusion at reduced weight; the practical advantage is broader domain coverage where high-tier material is sparse.

  1. Partial TruthfulQA Comparison

Table 5. Partial TruthfulQA evaluation across three corpus configurations. TruthfulQA is a general-purpose benchmark; these results supplement but do not replace the primary domain-specific evaluation in Table 4.

Condition

TruthfulQA Truthfulness

TruthfulQA Informativeness

Baseline

41.3%

87.2%

Tier 1–2 Only

58.7%

76.4%

Full PWTA

54.1%

81.9%

Full PWTA improves TruthfulQA truthfulness by 12.8 percentage points over baseline, confirming generalization beyond the domain-specific corpus. The accuracy-coverage trade-off replicates: Tier 1–2 Only achieves higher truthfulness (58.7%) at the cost of lower informativeness (76.4%). Both PWTA conditions remain below frontier model TruthfulQA performance; this is expected since commonsense truthfulness is primarily addressed by pre-training scale and RLHF, which are orthogonal to PWTA's data-layer function.

  1. Anuvyavasāya Protocol Illustration

Table 6. Anuvyavasāya Protocol: example of cross-tier contradiction resolution. Cosine similarity threshold τ = 0.65.

Field

Source A (Tier 1)

Source B (Tier 2)

Source

Lancet Systematic Review, 2023

Parliamentary Health Committee Report, 2022

Claim

Intervention X reduces mortality by 23%

No significant mortality reduction for X

Cosine similarity

0.41 — below τ: contradiction triggered

 

System response

Presents both with tier attribution; flags audit

 

Operator action

Required before any policy deployment

 

GOVERNANCE, REGULATORY, AND PHILOSOPHICAL IMPLICATIONS

  1. Against AI Legal Personhood

The apata analysis has direct implications for the proposal to grant AI systems legal personhood (Russell & Norvig, 2020). Kartṛtva requires icchā, adhikāra, and phala-bhoga — constitutive conditions no computational system satisfies under currently understood architectures. A legal framework assigning personhood to AI creates a philosophically incoherent entity and a practical liability vacuum in which human actors escape accountability by attributing outcomes to the system. The PWTA's tripartite accountability structure — designer → deployer → user — provides the technically grounded alternative.

  1. Regulatory Interface

Table 7. Regulatory interface: mapping PWTA technical components to international AI governance mandates.

Legal Requirement

PWTA Technical Solution

Mīmāṃsā Grounding

EU AI Act Art. 10 — relevant and representative training data

Tier 1–2 stratification + PCS 60% threshold

Śabda Pramāṇa — neo-apauruṣeya standard

DPDPA 2023 Sections 4–8 — data fiduciary obligations

Kartṛtva metadata tagging + Adhikāra Dynamic Coefficient

Adhikāra — normative eligibility

GDPR Art. 25 — data protection by design

Apata Filter operating before any training

Karaṇa — instrumentality governance

IndiaAI Mission — responsible governance AI

PCS + domain calibration for Indian sources

Neo-apauruṣeya for Indian institutional corpus

  1. The Neo-Apauruṣeya Defense

The substantive philosophical objection to neo-apauruṣeya is that institutions are composed of fallible humans with political and economic interests, and therefore neo-apauruṣeya reintroduces puruṣa-doṣa under an institutional label. This objection deserves full engagement. The classical apauruṣeya defence in Mīmāṃsā rests not on the literal absence of human agents in knowledge production, but on structural independence from any particular human agent's motivated judgment. Neo-apauruṣeya operationalizes the same structural criterion: reproducibility-verified peer-reviewed work distributes judgment across multiple independent agents with conflicting interests; primary legislation passes through parliamentary debate and committee review. These processes make it structurally more difficult for any single agent's motivated reasoning to determine the output. The PWTA's claim is comparative and architecturally grounded: neo-apauruṣeya sources are structurally more resistant to individual motivated authorship than anonymous web content. This comparative claim is both philosophically defensible and empirically operationalized through the four indicators.

  1. Limitations and Future Work

The synthetic calibration corpus used here limits the generalizability of the specific b* = 0.6752 value. Real-world calibration with genuine Indian public administration and biomedical sources is the immediate priority. No standardized public governance AI benchmark currently exists; development of a PWTA Governance Benchmark covering Indian public administration scenarios is identified as the primary future evaluation infrastructure requirement. The Anuvyavasāya Protocol's threshold τ = 0.65 requires domain-specific calibration through expert assessment. Full construct validity testing using AUROC against a corpus of known retracted versus non-retracted sources is required before regulatory deployment.

SIDDHĀNTA: CONCLUSION

Mīmāṃsā's dialectical method moves from pūrvapakṣa through uttarapakṣa to siddhānta — the established conclusion that follows from the demonstration of apata. The pūrvapakṣa of contemporary AI deployment is this: train on everything, let scale resolve quality, align outputs after the fact. The uttarapakṣa established in this paper is twofold: this approach is structurally incompatible with the conditions of valid knowledge production, and AI systems are categorically inapplicable as epistemic agents under currently understood architectures.

The siddhānta follows: upstream epistemic filtration is not only philosophically correct but computationally tractable, with convergence guarantees, error analysis, streaming architecture, and quantitative validation. Theorem 1 guarantees that the iterative calibration reaches a unique fixed point. The Anuvyavasāya Protocol resolves contradiction through transparency rather than averaging. The Adhikāra Dynamic Coefficient makes the weight function responsive to the consequences of epistemic trust. The ablation results demonstrate what Kumārila Bhaṭṭa established in philosophical terms: when the source of a cognition is valid, the cognition is more likely to be valid.

Seventeen percentage points of factual accuracy and fourteen percentage points of bias reduction are the computational expression of svataḥ-prāmāṇya — the consequence of ensuring that the instrument is trained on knowledge that has earned its validity rather than merely asserted it. PWTA does not attempt to give AI a soul or agency. It acknowledges the AI as jaḍa — inert structured matter — and governs its inputs with the rigor that the Mīmāṃsā tradition demanded of the sources that informed sacred obligation. The instrument is automated. The epistemic accountability remains irrevocably human. Indian Knowledge Systems, far from being peripheral to contemporary AI governance challenges, offer some of its most precise and practically actionable philosophical resources.

REFERENCES

  1. Arthaśāstra 2.8, 3.1. (R. Shamasastry, Trans., 1915). Government Press.
  2. Asai, A., Wu, Z., Wang, Y., Sil, A., & Hajishirzi, H. (2023). Self-RAG: Learning to retrieve, generate, and critique through self-reflection. arXiv preprint arXiv:2310.11511.
  3. Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press.
  4. Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big? Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–623. https://doi.org/10.1145/3442188.3445922
  5. Bhaṭṭa, K. (7th c. CE/1990). Ślokavārttika, Prāmāṇyādhikaraṇa (S. Subrahmanya Sastri, Ed.). Chowkhamba Sanskrit Series.
  6. Bilimoria, P. (1988). Śabdapramāṇa: Word and knowledge. Springer.
  7. Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D., Wu, J., Winter, C., … Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
  8. Christiano, P., Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. Advances in Neural Information Processing Systems, 30, 4299–4307.
  9. Coeckelbergh, M. (2020). AI ethics. MIT Press.
  10. Digital Personal Data Protection Act, No. 22 of 2023, §§ 4–8. (2023). Government of India.
  11. Dodge, J., Sap, M., Marasović, A., Agnew, W., Ilharco, G., Groeneveld, D., Mitchell, M., & Gardner, M. (2021). Documenting large webtext corpora. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 1286–1305. https://doi.org/10.18653/v1/2021.emnlp-main.98
  12. Dreyfus, H. L. (1992). What computers still can't do: A critique of artificial reason. MIT Press.
  13. Edge, D., Trinh, H., Cheng, N., Bradley, J., Chao, A., Mody, A., Truitt, S., & Larson, J. (2024). From local to global: A graph RAG approach to query-focused summarization. arXiv preprint arXiv:2404.16130.
  14. European Parliament & Council. (2024). Regulation (EU) 2024/1689 on Artificial Intelligence (AI Act), Article 10. Official Journal of the European Union.
  15. Ganeri, J. (2001). Philosophy in classical India: The proper work of reason. Routledge.
  16. Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Daumé, H., III, & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86–92. https://doi.org/10.1145/3458723
  17. Gehman, S., Gururangan, S., Sap, M., Choi, Y., & Smith, N. A. (2020). RealToxicityPrompts: Evaluating neural toxic degeneration in language models. Findings of EMNLP 2020, 3356–3369. https://doi.org/10.18653/v1/2020.findings-emnlp.301
  18. General Data Protection Regulation, Regulation (EU) 2016/679, Article 25. (2016). Official Journal of the European Union.
  19. Hendrycks, D., Burns, C., Basart, S., Zou, A., Mazeika, M., Song, D., & Steinhardt, J. (2021). Measuring massive multitask language understanding. Proceedings of ICLR.
  20. Ioannidis, J. P. A. (2005). Why most published research findings are false. PLOS Medicine, 2(8), e124. https://doi.org/10.1371/journal.pmed.0020124
  21. Jaimini. (c. 400–200 BCE/1967). Mīmāṃsā sūtras (G. Jha, Trans.). Motilal Banarsidass.
  22. Jiang, L., Zhou, Z., Leung, T., Li, L.-J., & Fei-Fei, L. (2018). MentorNet: Learning data-driven curriculum for very deep neural networks on corrupted labels. Proceedings of ICML, 2304–2313.
  23. Kane, P. V. (1946). History of Dharmaśāstra (Vol. III). Bhandarkar Oriental Research Institute.
  24. Kolmogorov, A. N., & Fomin, S. V. (1957). Elements of the theory of functions and functional analysis (Vol. 1). Graylock Press.
  25. Kreutzer, J., Caswell, I., Wang, L., et al. (2022). Quality at a glance: An audit of web-crawled multilingual datasets. Transactions of the Association for Computational Linguistics, 10, 50–72. https://doi.org/10.1162/tacl_a_00447
  26. Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-T., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.
  27. Lin, S., Hilton, J., & Evans, O. (2022). TruthfulQA: Measuring how models mimic human falsehoods. Proceedings of ACL, 3214–3252. https://doi.org/10.18653/v1/2022.acl-long.229
  28. Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1–35. https://doi.org/10.1145/3457607
  29. Ministry of Electronics and Information Technology, Government of India. (2024). IndiaAI Mission: Framework document.
  30. Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model cards for model reporting. Proceedings of FAccT, 220–229. https://doi.org/10.1145/3287560.3287596
  31. Mohamed, S., Png, M.-T., & Isaac, W. (2020). Decolonial AI: Decolonial theory as sociotechnical foresight in artificial intelligence. Philosophy & Technology, 33(4), 659–684. https://doi.org/10.1007/s13347-020-00405-8
  32. Olivelle, P. (Trans.). (2005). Manu's code of law: A critical edition and translation of the Mānava-Dharmaśāstra. Oxford University Press.
  33. Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. https://doi.org/10.1126/science.aac4716
  34. Ren, M., Zeng, W., Yang, B., & Urtasun, R. (2018). Learning to reweight examples for robust deep learning. Proceedings of ICML, 4334–4343.
  35. Russell, S., & Norvig, P. (2020). Artificial intelligence: A modern approach (4th ed.). Pearson.
  36. Sambasivan, N., Kapania, S., Highfill, H., Akrong, D., Paritosh, P., & Aroyo, L. M. (2021). "Everyone wants to do the model work, not the data work": Data cascades in high-stakes AI. Proceedings of CHI, 1–15. https://doi.org/10.1145/3411764.3445518
  37. Searle, J. R. (1980). Minds, brains, and programs. Behavioral and Brain Sciences, 3(3), 417–424. https://doi.org/10.1017/S0140525X00005756
  38. Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90(2), 227–244. https://doi.org/10.1016/S0378-3758(00)00115-4
  39. Sugiyama, M., Nakajima, S., Kashima, H., von Bünau, P., & Kawanabe, M. (2008). Direct importance estimation with model selection and its application to covariate shift adaptation. Advances in Neural Information Processing Systems, 20.
  40. UNESCO. (2021). Recommendation on the ethics of artificial intelligence. UNESCO Doc. SHS/BIO/PI/2021/1.
  41. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 5998–6008.

Reference

  1. Arthaśāstra 2.8, 3.1. (R. Shamasastry, Trans., 1915). Government Press.
  2. Asai, A., Wu, Z., Wang, Y., Sil, A., & Hajishirzi, H. (2023). Self-RAG: Learning to retrieve, generate, and critique through self-reflection. arXiv preprint arXiv:2310.11511.
  3. Barocas, S., Hardt, M., & Narayanan, A. (2023). Fairness and machine learning: Limitations and opportunities. MIT Press.
  4. Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big? Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, 610–623. https://doi.org/10.1145/3442188.3445922
  5. Bhaṭṭa, K. (7th c. CE/1990). Ślokavārttika, Prāmāṇyādhikaraṇa (S. Subrahmanya Sastri, Ed.). Chowkhamba Sanskrit Series.
  6. Bilimoria, P. (1988). Śabdapramāṇa: Word and knowledge. Springer.
  7. Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D., Wu, J., Winter, C., … Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
  8. Christiano, P., Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. Advances in Neural Information Processing Systems, 30, 4299–4307.
  9. Coeckelbergh, M. (2020). AI ethics. MIT Press.
  10. Digital Personal Data Protection Act, No. 22 of 2023, §§ 4–8. (2023). Government of India.
  11. Dodge, J., Sap, M., Marasović, A., Agnew, W., Ilharco, G., Groeneveld, D., Mitchell, M., & Gardner, M. (2021). Documenting large webtext corpora. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 1286–1305. https://doi.org/10.18653/v1/2021.emnlp-main.98
  12. Dreyfus, H. L. (1992). What computers still can't do: A critique of artificial reason. MIT Press.
  13. Edge, D., Trinh, H., Cheng, N., Bradley, J., Chao, A., Mody, A., Truitt, S., & Larson, J. (2024). From local to global: A graph RAG approach to query-focused summarization. arXiv preprint arXiv:2404.16130.
  14. European Parliament & Council. (2024). Regulation (EU) 2024/1689 on Artificial Intelligence (AI Act), Article 10. Official Journal of the European Union.
  15. Ganeri, J. (2001). Philosophy in classical India: The proper work of reason. Routledge.
  16. Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Daumé, H., III, & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86–92. https://doi.org/10.1145/3458723
  17. Gehman, S., Gururangan, S., Sap, M., Choi, Y., & Smith, N. A. (2020). RealToxicityPrompts: Evaluating neural toxic degeneration in language models. Findings of EMNLP 2020, 3356–3369. https://doi.org/10.18653/v1/2020.findings-emnlp.301
  18. General Data Protection Regulation, Regulation (EU) 2016/679, Article 25. (2016). Official Journal of the European Union.
  19. Hendrycks, D., Burns, C., Basart, S., Zou, A., Mazeika, M., Song, D., & Steinhardt, J. (2021). Measuring massive multitask language understanding. Proceedings of ICLR.
  20. Ioannidis, J. P. A. (2005). Why most published research findings are false. PLOS Medicine, 2(8), e124. https://doi.org/10.1371/journal.pmed.0020124
  21. Jaimini. (c. 400–200 BCE/1967). Mīmāṃsā sūtras (G. Jha, Trans.). Motilal Banarsidass.
  22. Jiang, L., Zhou, Z., Leung, T., Li, L.-J., & Fei-Fei, L. (2018). MentorNet: Learning data-driven curriculum for very deep neural networks on corrupted labels. Proceedings of ICML, 2304–2313.
  23. Kane, P. V. (1946). History of Dharmaśāstra (Vol. III). Bhandarkar Oriental Research Institute.
  24. Kolmogorov, A. N., & Fomin, S. V. (1957). Elements of the theory of functions and functional analysis (Vol. 1). Graylock Press.
  25. Kreutzer, J., Caswell, I., Wang, L., et al. (2022). Quality at a glance: An audit of web-crawled multilingual datasets. Transactions of the Association for Computational Linguistics, 10, 50–72. https://doi.org/10.1162/tacl_a_00447
  26. Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-T., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.
  27. Lin, S., Hilton, J., & Evans, O. (2022). TruthfulQA: Measuring how models mimic human falsehoods. Proceedings of ACL, 3214–3252. https://doi.org/10.18653/v1/2022.acl-long.229
  28. Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1–35. https://doi.org/10.1145/3457607
  29. Ministry of Electronics and Information Technology, Government of India. (2024). IndiaAI Mission: Framework document.
  30. Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model cards for model reporting. Proceedings of FAccT, 220–229. https://doi.org/10.1145/3287560.3287596
  31. Mohamed, S., Png, M.-T., & Isaac, W. (2020). Decolonial AI: Decolonial theory as sociotechnical foresight in artificial intelligence. Philosophy & Technology, 33(4), 659–684. https://doi.org/10.1007/s13347-020-00405-8
  32. Olivelle, P. (Trans.). (2005). Manu's code of law: A critical edition and translation of the Mānava-Dharmaśāstra. Oxford University Press.
  33. Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. https://doi.org/10.1126/science.aac4716
  34. Ren, M., Zeng, W., Yang, B., & Urtasun, R. (2018). Learning to reweight examples for robust deep learning. Proceedings of ICML, 4334–4343.
  35. Russell, S., & Norvig, P. (2020). Artificial intelligence: A modern approach (4th ed.). Pearson.
  36. Sambasivan, N., Kapania, S., Highfill, H., Akrong, D., Paritosh, P., & Aroyo, L. M. (2021). "Everyone wants to do the model work, not the data work": Data cascades in high-stakes AI. Proceedings of CHI, 1–15. https://doi.org/10.1145/3411764.3445518
  37. Searle, J. R. (1980). Minds, brains, and programs. Behavioral and Brain Sciences, 3(3), 417–424. https://doi.org/10.1017/S0140525X00005756
  38. Shimodaira, H. (2000). Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90(2), 227–244. https://doi.org/10.1016/S0378-3758(00)00115-4
  39. Sugiyama, M., Nakajima, S., Kashima, H., von Bünau, P., & Kawanabe, M. (2008). Direct importance estimation with model selection and its application to covariate shift adaptation. Advances in Neural Information Processing Systems, 20.
  40. UNESCO. (2021). Recommendation on the ethics of artificial intelligence. UNESCO Doc. SHS/BIO/PI/2021/1.
  41. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 5998–6008.

Photo
Moyukh De
Corresponding author

Public Health Engineering Department, Government of West Bengal, India

Photo
Soumava Pal
Co-author

Public Health Engineering Department, Government of West Bengal, India

Moyukh De*, Soumava Pal, Pramāṇa-Weighted Intelligence: A Unified Framework For Epistemic Accountability In Neural Architectures Through Mīmāṃsā Source-Validity Theory And The Anuvyavasāya Conflict Resolution Protocol, Int. J. Sci. R. Tech., 2026, 3 (4), 1214-1227. https://doi.org/ 10.5281/zenodo.19924705

More related articles
A Unified Multi-Modal Real-Time Collaborative Deve...
P. U. Harsha, P. Venkat Balaji Naidu, P. Surya Tej, P. Rahul, S. ...
The Nervous System Decoded: Structural Dynamics, F...
Arnab Roy, Anuj Kumar, Raj Kumar Gupta, Aman Kumar Singh, Soniya ...
PSO_KAN: A Hybrid Particle Swarm Optimization and Kolmogorov Arnold Network for ...
Umar Kabir Umar, Maryam Alka, Ogochukwu John Okonko, Rumana Kabir Aminu, Idris Yau Idris, Fatima Shi...
Deep Learning Framework for Burnout Prediction in IT Professionals with AI-Gener...
Pooja Sithrubi Gnanasambanthan, K. S. Gayathri, Gnana Priya...
More related articles
A Unified Multi-Modal Real-Time Collaborative Development Environment Integratin...
P. U. Harsha, P. Venkat Balaji Naidu, P. Surya Tej, P. Rahul, S. Steffi Nivedita...
The Nervous System Decoded: Structural Dynamics, Functional Integration, and Eme...
Arnab Roy, Anuj Kumar, Raj Kumar Gupta, Aman Kumar Singh, Soniya Kumari...
A Unified Multi-Modal Real-Time Collaborative Development Environment Integratin...
P. U. Harsha, P. Venkat Balaji Naidu, P. Surya Tej, P. Rahul, S. Steffi Nivedita...
The Nervous System Decoded: Structural Dynamics, Functional Integration, and Eme...
Arnab Roy, Anuj Kumar, Raj Kumar Gupta, Aman Kumar Singh, Soniya Kumari...