How InstaTruth Works

Our advanced multi-agent fact-checking system combines DistilBERT NLP classification with comprehensive web verification using Google Custom Search and DeepSeek AI analysis.

STEP 1

Audio Transcription & Analysis

When you submit a video link, our system downloads the content using yt-dlp and uses OpenAI Whisper to transcribe any spoken content into text with high accuracy.

The transcribed text is then cleaned and prepared for analysis by our BERT-based natural language processing pipeline.

Audio Processing
Complete

Transcription Result:

"Studies show that this new treatment has a 90% success rate in clinical trials and could be available as early as next month..."

Source Verification
Complete

Sources Found:

  • Clinical trial data from National Health Institute (Phase 2 results only)
  • No sources confirm availability timeline
STEP 2

Multi-Step Claim Extraction & Verification

DeepSeek AI first extracts individual verifiable factual claims from the transcription. Each claim is then independently researched using Google Custom Search API to find credible supporting or contradicting sources.

Each claim undergoes detailed fact-checking with source credibility weighting (.gov, .edu, major news outlets), providing granular analysis with confidence scores and evidence summaries.

STEP 3

DistilBERT NLP Classification

Simultaneously, our DistilBERT transformer model (110M parameters) processes the transcribed text to generate semantic embeddings, which are classified using a pre-trained Random Forest model for initial credibility assessment.

This NLP analysis provides probability scores standardized to P(true) with confidence thresholds, offering an independent linguistic-based credibility assessment.

NLP Classification
Complete

Classification Results:

P(true) Score 0.72
Confidence 0.81
75% True

Overall Truth Score

False True
STEP 4

Dynamic Score Aggregation & Report

The system combines DistilBERT and web verification scores using dynamic confidence-based weighting. High-confidence web fact-checking (when available) takes precedence, while NLP analysis provides fallback assessment.

Final reports include individual claim breakdowns, source credibility analysis, confidence scores for each component, and comprehensive summaries with final verdicts: "real", "fake", or "inconclusive".

Ready to verify content?

Try InstaTruth Now