Reconnaissance on archive URLs

8 hours ago 5
BOOK THIS SPACE FOR AD
ARTICLE AD

Mayank Kumar Prajapati

Wayback machine is a digital archive of world wide web that allows users to access and view historical snapshots of web pages, enabling them to see how a website looked and functioned at various points in time. This is achieved by periodically crawling and storing copies of web pages.

1. waybackurls

Waybackurls is a golang based tool that can be used for crawling domains archived/indexed by wayback machine.

2. unfurl

This is another great tool to have that is used to perform filtration on endpoints extracted from wayback archieve using waybackurls tool.

waybackurls example.com > output.txt

You can also refer to the below link to have a quick look on browser UI of the archive URLs. Just replace evil.com with your target/domain.

https://web.archive.org/cdx/search/cdx?url=*.evil.com&fl=original&collapse=urlkey

(i). Enumerating subdomains

Subdomains enumeration is also possible from the URLs enumerated from waybackurls tool.

cat output.txt | unfurl --unique domains

(ii). Searching for tokens

cat output.txt | unfurl --unique values | grep -E '^ey.*\..*\..*'

(iii). Finding parameters

cat output.txt | unfurl --unique keys
cat output.txt | grep auto= | head -n 1

Thanks for reading this:) Hope it helps..

Read Entire Article