<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jock Today &#187; google dork</title>
	<atom:link href="http://www.jocktoday.com/tag/google-dork/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.jocktoday.com</link>
	<description>Latest security protection and open source development tools</description>
	<lastBuildDate>Mon, 05 Mar 2012 21:20:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>How To: Enumerate Directories and Files</title>
		<link>http://www.jocktoday.com/2010/02/12/enumerate-directories-and-files/</link>
		<comments>http://www.jocktoday.com/2010/02/12/enumerate-directories-and-files/#comments</comments>
		<pubDate>Fri, 12 Feb 2010 17:27:21 +0000</pubDate>
		<dc:creator>jpereira</dc:creator>
				<category><![CDATA[Information Disclosure]]></category>
		<category><![CDATA[Technique]]></category>
		<category><![CDATA[dirbuster]]></category>
		<category><![CDATA[directory]]></category>
		<category><![CDATA[directory enumeration]]></category>
		<category><![CDATA[enumeration]]></category>
		<category><![CDATA[file]]></category>
		<category><![CDATA[google dork]]></category>
		<category><![CDATA[intellitamper]]></category>
		<category><![CDATA[traversal]]></category>
		<category><![CDATA[web archive]]></category>
		<category><![CDATA[web site scanner]]></category>
		<category><![CDATA[webarchive]]></category>

		<guid isPermaLink="false">http://www.jocktoday.com/?p=154</guid>
		<description><![CDATA[How can a pen-tester identify directories and files that are hidden from the public? This is where it gets easy.]]></description>
			<content:encoded><![CDATA[<p>Information gathering is a key component to web application penetration testing. In fact it should be the first phase in any security assessment: cull as much information as possible about your target before you start any specific pen-testing.</p>
<p>One key area of information gather is understanding the directory structure of a server and enumerating each directories files. This is commonly called &#8220;directory enumeration&#8221; or &#8220;web site scanner&#8221; or sometimes &#8220;directory traversal&#8221;.</p>
<p>One easy way to do this is by viewing a web servers directory listing/index. Automatic directory listing/indexing is a web server function that lists all of the files within a requested directory if the normal base file (eg index.php) is not present. If a web server has automatic directory listing/indexing turned on then there is a high probability that its administrator[s] are not security conscious. Typically this is turned off.</p>
<p>So then, how can a pen-tester identify directories and files that are hidden from the public? This is where it gets easy. Typically web administrators run on the assumption that if directory listing is turned off and a particular directory is not indexed by search engines that the data will remain hidden. This is not the case.</p>
<p>The following tools are what I consider to be best-of-class for identifying directories and files. Some of the tools are actually methods that can be executed manually and should be used in conjunction with the tools. I have listed these in the order in which I recommend their use. And I recommend using all of them &#8211; as is almost always the case with security testing there is no one single golden tool &#8211; multiple tools must be used for each phase of your security testing. They will produce duplicate results but they will also return their own unique results. To be thorough use each tool described in this list in the order in which they are listed:</p>
<p><strong>1. Google Dork. </strong></p>
<p><strong>Description</strong>: Google has an almost magical way of finding directories, even those that are hidden. You will be surprised with some of the data that Google stores&#8230; you will be at a loss to explain how Google finds your web applications hidden directories.</p>
<p><strong>Usage</strong>: Use the simple Google Dork search string: site:yoursite.com and hit enter. The resulting links are specific files and directories that Google has indexed within your yoursite.com domain.</p>
<p>Open a blank spreadsheet and start recording the results. Remember, this is just the information gathering phase of your testing. Do not be tempted to dig in and start testing what you find or you&#8217;ll never complete this all too important phase.</p>
<p><strong>2. The Web Archive (web.archive.org)</strong></p>
<p><strong>Description</strong>: The Internet Archive is a  non-profit that was founded to build an Internet library. You can find snapshots of a web site stored monthly going back to (possibly) the month it was launched. What makes the Web Archive useful? Searching through it can help you find directories and files that were indexed in the past but hidden today.</p>
<p><strong>Usage</strong>: http://web.archive.org/web/*/http://yoursite.com/*</p>
<p>This will return a list of all directories and files that the Web Archive has stored.</p>
<p>Record each new directory/file that is currently active (click the links to ensure the files exist today) in your spreadsheet.</p>
<p><strong>3. The robots.txt file</strong></p>
<p><strong>Description</strong>: Web site administrators use the /robots.txt file to give instructions about their site to web robots such as Microsoft Bing and Google. It works like this: a robot wants to visit a web site URL, say http://www.yoursite.com/welcome.html. Before it does so, it first checks for the existance of /robots.txt to ensure that it should visit that page. In many cases this file will contains specific directories that web robots should not crawl. In essence this file is a double edged sword; it prevents indexing of directories that should remain hidden but it gives away the names of these directories.</p>
<p><strong>Usage</strong>: Browse to http://www.yoursite.com/robots.txt. Look for specific directories that are disallowed. The format will look like this:</p>
<p>User-agent: *<br />
Disallow: /cgi-bin/</p>
<p>In the example above the web site is telling the web robots to ignore the cgi-bin directory but potentially telling you that this directory exists. There is a caveat here &#8211; some web site administrators will use this file to create a sort of trap door. Knowing that only hackers will attempt to visit such directories they may put in a &#8216;fake&#8217; directory, capture your IP address when you visit it and, perhaps, ban you from the site.</p>
<p><strong>4. OWASP DirBuster</strong></p>
<p><strong>Description</strong>: This is, hands down, my current favorite file/directory enumeration tool. It is light-years beyond any other I have tested. Its powerful, highly customizable and blazinging flast. There is a warning here &#8211; it is possible to bring a web server to its knees with DirBuster by bumping up the number of concurrent threads; it supports up to 100.</p>
<p>DirBuster works in a number of different ways. It comes with many default dictionary files (small, medium, large and upper/lowercase) that can be used to perform &#8216;list based brute force&#8217;. However it can also be used in &#8216;pure brute force&#8217; mode where it scans for files and directories using a character set (a-z, A-Z, 0-9, %20-_) with settings for minimum and maximum length.</p>
<p><strong>Usage</strong>: Enter a target url, select the number of threads and choose the type of scan and click start. DirBuster even gives you an estimated number of hours or days left for each running scan.</p>
<p><strong>5. Intellitamper</strong></p>
<p><strong>Description</strong>: This tool has been kicking around for a while but has not been updated in years. An initial warning: versions of it mascarade around on the Internet that include malware/spyware so be sure you find yourself a legitimate copy. That being said Intellitamper should definitely be a part of your enumeration arsenal. Intellitamper uses a dictionary file (like DirBuster) to try to find hidden directories and files.</p>
<p>With some web servers it will return false positives and you will see an enormous list of directories that do not actually exist.</p>
<p>Intellitamper does not perform pure brute force, just list based brute force.</p>
<p><strong>Usage</strong>: Enter the URL and you are good to go.</p>
<p><strong>6. Wikto</strong></p>
<p><strong>Description</strong>: Wikto is a web server assessment tooland it contains a module called back-end miner which is used to enumerate directories and files. Typically this tool does not return a lot of directories but it will surprise you once in a while; it is definitely worth using on a regular basis.</p>
<p><strong>Usage</strong>: Enter a URL and you are off and running.</p>
<p><strong>7. Nikto</strong></p>
<p><strong>Description</strong>: Nikto is the unix environment cousin to Wikto. If you are running Windows you can use Nikto by installing Active State Perl.</p>
<p><strong>Usage</strong>: &#8220;perl nikto.pl -h http://yoursite.com&#8221;</p>
<p><strong>8. Tennable Nessus</strong></p>
<p><strong>Description</strong>: Tennable Nessus is commercial software and it includes a module for directory enumeration. It typically returns far fewer directories than IntelliTamper or DirBuster but it is still worth running this tool each time you want a comprehensive list of directories/files on web server.</p>
<p><strong>Usage</strong>: Launch, start the Nessus Server and enter the URL.</p>
<p><strong>9. Paros Proxy</strong></p>
<p><strong>Description</strong>: Paros Proxy is a web application security assessment tool. This tool includes a nice spider function that crawls through a web site and creates a pretty comprehensive list of files and directories. This is based on following links and cannot be considered a brute force tool. The tool itself has not been updated since November of 2004.</p>
<p>That being said Paros Proxy should definitely be in your enumeration toolbelt. It is effective and will surprise you with the things it can find.</p>
<p><strong>Usage</strong>: Launch Paros Proxy. Open IE/Firefox and configure a proxy toaccess the internet. By default the HTTP proxy should be 127.0.0.1 on port 81. Next, browse to the URL. When the URL appears in Paros Proxy select it and choose Analyze/Spider.</p>
<p><strong>10. http-dir-enum</strong></p>
<p><strong>Description</strong>: Last but not least is the perl script http-dir-enum. This program guesses directory names within a website using a wordlist of potential directory names. It supports Basic Authentication HTTP Keep-alive, proxies, cookies and can save the results in XML format.</p>
<p><strong>Usage</strong>: Install Active State Perl. Run this from the command line: &#8220;perl http-dir-enum.pl -f directory-names.txt http://yoursite.com&#8221;</p>
<p>The ten tools described above, when used in conjunction with one another, provide what I believe to be the most comprehensive method of enumerating directories and files.</p>
<p>Do you know of a tool that should have made this list? Let me know!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.jocktoday.com/2010/02/12/enumerate-directories-and-files/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

