Skip to main content

HumanMark

·506 words·3 mins
Vin Patel
Author
Vin Patel
Your content. Your servers. Your control. HumanMark is the open-source AI detection engine that never sends your data to the cloud.

πŸ€” The Problem
#

Every AI detection service requires you to upload your content to their servers. That’s a dealbreaker for:

Compliance Nightmare: Healthcare (HIPAA), Legal (privilege), Finance (SOC2/PCI), and Government (classified) organizations cannot send sensitive data to third-party APIs.

HumanMark runs entirely on YOUR infrastructure. Your data never leaves your network. Period.


⚑ Quick Start
#

Get running in under 30 seconds:

docker run -p 8080:8080 humanmark/humanmark
go install github.com/vinpatel/humanmark/cmd/api@latest
humanmark
git clone https://github.com/vinpatel/humanmark.git
cd humanmark && make run

Then detect AI content:

curl -X POST http://localhost:8080/verify \
  -H "Content-Type: application/json" \
  -d '{"text": "Your content here"}'

Response:

{
  "id": "hm_abc123",
  "verdict": "human",
  "confidence": 0.87,
  "signals": {
    "sentence_variance": 0.42,
    "vocabulary_richness": 0.78,
    "contraction_ratio": 0.15
  }
}

πŸ† Why HumanMark?
#

Feature Comparison
#

FeatureHumanMarkGPTZeroOriginality.aiTurnitin
Self-Hostedβœ…βŒβŒβŒ
Works Offlineβœ…βŒβŒβŒ
Open Sourceβœ… MIT❌❌❌
Zero Costβœ…βŒβŒβŒ
Multi-Modalβœ…βš οΈβš οΈβš οΈ
API Limits∞TieredPer-checkPer-seat

πŸ”¬ How It Works
#

HumanMark uses statistical forensicsβ€”no ML models, no GPU required, instant results.

SignalHumanAI
Sentence length varianceHighLow
Vocabulary richnessDiverse“Safe” words
Contractions“don’t”, “I’m”“do not”, “I am”
Punctuation variety!?;:β€”Mostly periods
AI phrasesRare“As an AI…”
SignalReal PhotoAI Image
EXIF metadataPresentMissing
Camera makeApple, CanonNone
Sensor noiseNaturalToo clean
CompressionConsistentIrregular
  • File header analysis
  • Encoder fingerprinting
  • AI tool markers (ElevenLabs, etc.)
  • Temporal consistency checks
  • Container metadata analysis
  • Frame-by-frame consistency
  • Generation tool signatures
  • Encoding profile anomalies

πŸ’Ό Use Cases
#

  1. Enterprise Compliance

    HIPAA/GDPR

    Healthcare, Finance, Legal

    Organizations that cannot send data to third-party APIs. HumanMark runs on-premiseβ€”patient records, legal documents, and financial data never leave your network.
  2. Education at Scale

    100K+ Students

    Universities & Districts

    Process millions of assignments without per-document fees. Self-hosted = unlimited usage at fixed infrastructure costs. Often 90%+ savings vs. commercial alternatives.
  3. Developer Integration

    REST API

    Content Platforms & Apps

    Integrate AI detection into your product: content moderation, hiring tools, CMS plugins, browser extensions. Full REST API with SDKs coming soon.
  4. Government & Defense

    Air-Gapped

    Classified Environments

    Deploy in air-gapped networks with zero internet connectivity. Essential for classified environments, secure facilities, and critical infrastructure.

πŸ“Š Performance
#

Benchmarked on AWS c5.xlarge (4 vCPU, 8GB RAM):

12,400 req/sec 8ms p50 latency 45MB memory 150ms cold start

πŸ—ΊοΈ Roadmap
#

  • Text detection (statistical analysis)
  • Image detection (EXIF + forensics)
  • Audio detection (metadata)
  • Video detection (container analysis)
  • Docker support
  • Prometheus metrics
  • πŸ”œ Browser extension
  • πŸ”œ VS Code extension
  • πŸ”œ WordPress plugin
  • πŸ”œ Python & JavaScript SDKs
  • πŸ”œ Admin dashboard

πŸ”— Links#

⭐ Star on GitHub πŸ’¬ Discussions

πŸ“œ License
#

MIT License β€” Use it however you want. Free forever. No premium tiers, no enterprise editions, no bait-and-switch.

Built with ❀️ by Vin Patel