OSINT Cheat Sheet
Open-source intelligence workflow: domains, emails, people, breaches, and infrastructure using Google dorks and free OSINT tools.
Overview
OSINT gathers intelligence from public sources—no packets sent to the target. It's the first phase of any engagement: map the attack surface (domains, subdomains, employees, leaked credentials, exposed services) before touching anything. This sheet covers Google dorking and the go-to free tools for each data type.
Authorized and lawful use only. Respect privacy laws, terms of service, and engagement scope. Passive recon is still subject to legal and ethical limits.
Google dorks
Pages indexed for a domain
site:example.comFind non-www subdomains
site:example.com -wwwExposed documents (metadata goldmine)
site:example.com filetype:pdfOpen directory listings
intitle:"index of" site:example.comLogin/admin panels
inurl:admin | inurl:login site:example.comLeaked spreadsheets with emails
intext:"@example.com" filetype:xlsxDomains & infrastructure
Registrar, dates, sometimes contacts
whois example.comDNS records
dig example.com ANY +noall +answerSubdomains from certificate transparency
curl -s 'https://crt.sh/?q=%25.example.com&output=json' | jq -r '.[].name_value' | sort -uEnumerate subdomains and probe which are live
subfinder -d example.com -silent | httpx -silentSearch engines for exposed assets
Internet-exposed services for a domain
shodan search hostname:example.comOpen ports/banners for an IP
shodan host 1.2.3.4Pivot on certificates, favicons, and banners
# Censys / FOFA web UIsEmails, people & breaches
Harvest emails, names, and hosts from many sources
theHarvester -d example.com -b allFind email formats and addresses for a domain
# Hunter.io / phonebook.czCheck emails against known breaches (leaked passwords)
# haveibeenpwned.com / DeHashedCheck which sites an email is registered on
holehe user@example.comFind a username across social networks
sherlock usernameFrameworks & metadata
Automated OSINT correlation across 200+ modules
spiderfoot -l 127.0.0.1:5001Modular OSINT framework with workspaces
recon-ngPull authors, software, and paths from file metadata
exiftool document.pdfHistorical pages, old endpoints, removed content
# Wayback Machine (web.archive.org)Tips
- Start passive: certificate transparency (crt.sh) and subfinder map the attack surface without touching the target.
- Document file metadata (exiftool) — it leaks usernames, internal paths, and software versions.
- Breach data reveals real password patterns for an org — invaluable for later spraying (within scope).
- Keep OPSEC in mind: some 'OSINT' actions (port scans, logins) are active and out of passive scope.