Indian job market · fraud detection
A hybrid ML system detecting fraudulent job postings in the Indian market. TF-IDF text analysis combined with 15 hand-engineered fraud signals, deployed as a live API and Chrome extension.
Submit a posting to see the risk report
Legitimate posting
Suspicious posting
About the extension
The Chrome extension scrapes job postings on LinkedIn and runs them against the live API in real time. Results appear as an overlay on each listing. No copy-pasting required.
Falls back to local heuristics if the API is cold-starting
No data stored — requests are discarded after analysis
Works on "linkedin.com/jobs/view/" and "collections" pages
Each posting is vectorised with TF-IDF across 30,000 unigram and bigram features, then combined with 15 hand-crafted regex features targeting India-specific fraud patterns.
A sparse hstacked matrix feeds a balanced logistic regression model (C=1.0, class_weight='balanced'). Outputs a calibrated fraud probability between 0 and 1.
Each prediction returns the top contributing model features, matched regex signals with descriptions, a risk band, and a confidence score — every decision is auditable.
Candidates asked to pay ₹500–5,000 as registration fee or security deposit before the role is confirmed.
₹30,000–80,000/month for copy-paste work. Training kits purchased upfront; payouts never materialise.
Entire recruitment over messaging apps. No verifiable company presence, offers sent as screenshots.
Posts impersonating Infosys, TCS, Wipro with near-identical domains or logo abuse.
Promises placements abroad. Agents charge ₹50,000–2 lakh for visa processing that leads nowhere.
Targets fresh graduates with internships requiring a certification fee. Companies don't exist.
Matches "registration fee", "pay ₹", "security deposit" and 12 variants.
Contact restricted to WhatsApp number, often "WhatsApp: +91 …".
Salary implausibly high for the role type, or "earn X daily" patterns.
"100% placement guaranteed", "assured income" — language no real employer uses.
"Limited seats", "apply within 24 hours", "immediate joiners only".
References to Gulf, Dubai, Singapore with placement guarantees or visa assistance.
"Freshers welcome", "no qualification required", "anyone can apply".
Contact email uses gmail.com or yahoo.com instead of a company domain.
Generic names like "HR Solutions", "Placement Services Pvt Ltd".
Payment required for training certificates or "mandatory certifications".
Domain mimicking a known brand — infosysjobs.in, tcs-careers.co.
"Data entry" + "work from home" + salary claims. High-precision fraud combo.