sherlock

Commit Graph

Author	SHA1	Message	Date
Siddharth Dushantha	cc8192f45c	removed option to present websites ordered by their Alexa.com global rank in popularity refer to #610 for reason of removal	4 years ago
Siddharth Dushantha	38c987e4ea	version bump 0.12.2 --> 0.12.3	4 years ago
Siddharth Dushantha	f3180b73a0	fixed typo bowser --> browser	4 years ago
Christopher K. Hoadley	29eb2ddf2d	Update site list and version.	4 years ago
Christopher K. Hoadley	7505467020	Update site list and version.	4 years ago
탁동혁	cdfb97f6e6	Use `is` operator when comparing singletons.	4 years ago
Danny Sepler	3fa85c1576	Clean out unused imports	4 years ago
Christopher K. Hoadley	8619a353e4	Add override to HTTP Status Detection so HEAD request is not used. Configure Instagram to use this option. In most cases when we are detecting by status code, it is not necessary to get the entire body: we can detect fine with just the HEAD response. However, Richard Getz discovered that some sites (e.g. Instagram) will not respond properly if Sherlock only requests the HEAD. Add a "request_head_only" attribute to the data.json so HTTP Status Detection can be configured either way. It is simpler to support this change in this fashion, as it does not require changes to the tests. With Richard Getz <richardgetziii@gmail.com>	4 years ago
Christopher K. Hoadley	0ba4980887	Update site list. Update version.	4 years ago
Christopher K. Hoadley	6caa5a4e35	Restructure all print output to use QueryNotifyPrint() object. Added start and finish methods to base QueryNotify() object in order to get the same type of output. The only thing not supported in this refactor is the exception text for an error status when we are in verbose mode. This is an area of future work: I think exception information like this would more properly be handled by the logging module.	4 years ago
Christopher K. Hoadley	ae2fd7a729	Add username and site information to Query Result object. This will allow whoever defines a Query Notify object to have all of the context required to do their notifications.	5 years ago
Christopher K. Hoadley	c07d3967aa	Create QueryNotify() base class for notifying the caller about the results.	5 years ago
Christopher K. Hoadley	c054712920	Move response time for query into QueryResult() object.	5 years ago
Christopher K. Hoadley	92879af8eb	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # README.md # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	32385a1d8d	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	e41b485771	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	1101af8132	Add @sdushantha suggestion in creating directory.	5 years ago
Christopher K. Hoadley	8f6938ecb1	Add option to not print out results. Configure tests to there is no print output. This simplifies looking at the error output when the tests fail.	5 years ago
Christopher K. Hoadley	2e195d4439	Move all writing of output files to occur after query takes place. Use with statement for results file, as that is more graceful on errors. Use try block for result directory creation: this has a smaller window for a race condition.	5 years ago
Christopher K. Hoadley	123e4d47e0	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	ba0a44e0ae	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/resources/data.json	5 years ago
Christopher K. Hoadley	7f87f5fcc4	Add module to store information about the sites. This handles getting the information loaded from the JSON file. For now, use the new SitesInformation() object to calculate the original JSON dictionary: the rest of the code will be updated in the future.	5 years ago
Christopher K. Hoadley	647aea577c	Factor out all print statements from portion of code that determines the query results.	5 years ago
Christopher K. Hoadley	bbb44d7ef9	Add defensive check for unknown Error Type. If it does happen, an exception will be thrown, instead of using the previous site's results.	5 years ago
Christopher K. Hoadley	2a8f83924d	Remove some unneeded imports. Add minor comment.	5 years ago
Christopher K. Hoadley	2a1ab1c281	Add result module to hold results of site queries. The QueryResult() object contains an enumeration for the possible status about a given username on a site, and additional error information that might be handy. Rework all code to use this object instead of the "exists" key in the result dictionary that was used previously.	5 years ago
Christopher K. Hoadley	519ac34346	Extract all print statements from function that gets the response. Also, print out social network for error messages.	5 years ago
Christopher K. Hoadley	6114ca263d	Remove Proxy List Support While doing the restructuring, I am testing in more depth as I change the code. And, I am trying to grok how the proxy options work. Specifically, how the proxy list works. Or, does not work. There is code in the main function that randomly selects proxies from a list, but it does not actually use the result. This was noticed in #292. It looks like the only place where the proxy list is used is when there is a proxy error during get_response()...in that case a new random proxy is chosen. But, there is no care taken to ensure that we do not get the same proxy that just errored out. It seems like problematic proxies should be blacklisted if there is that type of failure. Moreover, there is a check earlier in the code that does not allow the proxy list and proxy command line option to be used simultaneously. So, I can see no way that the proxy list has any functionality: if you do define the proxy list, then there is no way to kick off the general request with a proxy. I also noticed that the recursive get_response() call does not pass its return tuples back up the call chain. The existing code would never get any good from the switchover to an alternate proxy (even if the other problems mentioned above were resolved). For now, I am removing the support. This feature may be looked at after the restructuring is done.	5 years ago
Christopher K. Hoadley	9e8e1a5aa4	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	0a455339c2	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	b92b931504	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	b0b7608549	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	f89fe03b34	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	a900d0f674	Merge remote-tracking branch 'origin/master' into restructure_take1 # Conflicts: # sherlock/sherlock.py	5 years ago
Christopher K. Hoadley	571f27f9a5	Clean up code that only reads the headers if we are doing HTTP Status detection. Remove special check for GitHub: everything works fine without it.	5 years ago
Christopher K. Hoadley	2268f8186c	Remove unneeded assignment of results before main Sherlock call.	5 years ago
Christopher K. Hoadley	232a5f7d07	Clarify meaning of "Total Websites" count. It is really "Total Websites Username Detected On".	5 years ago
Christopher K. Hoadley	ecf097e00d	Get rid of global variable amount. This is no longer used.	5 years ago
Christopher K. Hoadley	96ac2f15fd	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	d8c8cd8e7e	Leave the elapsed time for the request in seconds. Move format of time into ms to the presentation layer.	5 years ago
Christopher K. Hoadley	b02ef5e6e2	Use monotonic() time function. This method is guaranteed to not do flaky things during leap seconds or daylight savings times jumps.	5 years ago
Christopher K. Hoadley	d6d6b83b06	Add documentation to extension to FuturesSession() which we use to get timing information. Fix problem where timing hook would not be installed properly if the hooks for the request was already filled out with a tuple. I am not sure if that is even possible, but if it does happen, then I just convert the tuple to a list and go on from there.	5 years ago
Christopher K. Hoadley	578248bf8e	Do not create session and request objects if we are going to end up creating new one for Tor requests. This just wastes time.	5 years ago
Christopher K. Hoadley	8ef999fffd	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	1587df0860	Merge remote-tracking branch 'origin/master' into restructure_take1	5 years ago
Christopher K. Hoadley	d17268e737	Move content into sherlock sub-directory inside of repo. This directory will look very much like what the packaged version of Sherlock will look like when it is installed in the site-packages area. No real restructuring of the code has happened. This just gives a view of the directory structure.	5 years ago

46 Commits (cc8192f45c359e4e92b7343f94363910a3451549)