Friday, 28 October 2016

Google queries for advanced search and for vulnerability testers



A web search engine is a software application which crawls the web to index it and provides the information based on the user search query. Some search engines go beyond that and also extract information from various open databases. Usually the search engines provide real-time results based upon the backend crawling and data analysis algorithm they use. The results of a search engine are usually represented in the form of URLs with an abstract.

Apart from usual web search engines, some search engines also index data from various forums, and other closed portals (require login). Some search engines also collect search results from various different search engines and provide it in a single interface.Google is one of the most widely used search engines and is the starting point for web exploration for most of us.

Google is now worth billions and has its own place within the Oxford English Dictionary as a verb, but it took two men with a big dream to turn a small idea into a reality that has made a significant contribution to how the world uses the internet. Larry Page and Sergey Brin were both PhD candidates when they met in 1996 at Stanford and came up with the concept for a search engine that they were going to name BackRub,Google is now worth billions and has its own place within the Oxford English Dictionary as a verb, but it took two men with a big dream to turn a small idea into a reality that has made a significant contribution to how the world uses the internet. Larry Page and Sergey Brin were both PhD candidates when they met in 1996 at Stanford and came up with the concept for a search engine that they were going to name BackRub, on the 14th September 1997 Google.com was officially registered as a domain name.


Google serves some 80 percent of all search queries on the Internet, making it by far the most popular search engine. Its popularity is due not only to excellent search effectiveness, but also extensive querying capabilities. Google makes it possible to reach not only the publicly available information resources, but also gives access to some of the most confidential information that should never have been revealed.


Here is a summary of the most important and most useful query operators along with their descriptions.The right query can yield some quite remarkable results.

site:


Restricts results to sites within the specified domain
site:papersboot.blogspot.com fox will find all sites containing the word fox, located within the *papersboot.blogspot.com domain

intitle


Restricts results to documents whose title contains the specified phrase
intitle:facebook fire will find all sites with the word fox in the title and fire in the text

inurl


Restricts results to sites whose URL contains the specified phrase
inurl:papersboot fire will find all sites containing the word fire in the text and fox in the URL

filetype


Restricts results to documents of the specified type
filetype:pdf fire will return PDFs containing the word fire, while filetype:xls fox will return Excel spreadsheets with the word fox

ext


Restricts results to documents of the specified type
ext:pdf fire will return PDFs containing the word fire, while ext:xls fox will return Excel spreadsheets with the word fox

numrange


Restricts results to documents containing a number from the specified range
numrange:1-100 fire will return sites containing a number from 1 to 100 and the word fire. The same result can be achieved with 1..100 fire

link


Restricts results to sites containing links to the specified location
link: papersboot.blogspot.com will return documents containing one or more links to papersboot.blogspot.com

anchor


Restricts results to sites containing links with the specified phrase in their descriptions
anchor: download will return documents with links whose description contains the word download (that's the actual link text, not the URL indicated by the link)

allintext


Restricts results to documents containing the specified phrase in the text, but not in the title, link descriptions or URLs
allintext:"papersboot" will return documents which contain the phrase papersboot in their text only

+


specifies that a phrase should occur frequently in results
+papersboot will order results by the number of occurrences of the word papersboot

-


specifies that a phrase must not occur in results
-fire will return documents that don't contain the word fire

" "


delimiters for entire search phrases (not single words)
"fire fox" will return documents containing the phrase fire fox

.


wildcard for a single character
fire.fox will return documents containing the phrases fire fox, fireAfox, fire1fox, fire-fox etc.

*


wildcard for a single word
fire * fox will return documents containing the phrases fire the fox, fire in fox, fire or fox etc.

|


logical OR
"fire fox" | firefox will return documents containing the phrase fire fox or the word firefox

info:


The info operator provides information what Google has on a specific domain. Links to different types of information are present in the results, such as cache, similar websites, etc.
This is not all, sometimes Google also shows relevant information related to global events as and when they happen; for example, Cricket World Cup.
The operators we discussed are certainly very useful for anyone who needs to find out some information on the web, but the InfoSec community has certainly taken it to next level. These simple and innocent operators we just discussed are widely used in the cyber security industry to find and demonstrate how without even touching the target system, critical and compromising information can be retrieved. This technique of using Google search engine operators to find such information is termed as "Google Hacking."


No comments:

Post a Comment