-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
brainstorming limitations and features #13
Comments
What about getting actual browsing data from volunteers to analyze multiple behaviours and decide what is the best option? |
@XayOn I think that would be the best idea I will look into my browsing history today to see what I can see. |
To add to this, some of the modules have a possibility of generating traffic that could be harmful if not outright incriminating. Without some kind of "safe mode", users could be putting themselves at real risk. The project could take advantage of services like Google "Safe Search" or MyWOT, but this would probably make real traffic easier to spot at the same time. |
@XayOn @NeuroWinter sounds great! Nirsoft have a free tool at http://www.nirsoft.net/utils/browsing_history_view.html to extract history - if you anonymise and share then we can start parsing them, finding patterns, etc. For Chrome this could be pretty helpful: https://chrome.google.com/webstore/detail/web-historian-web-history/chpcblajbmmlbhecpnnadmjmlbhkloji @rationalcoding yes I agree. Would you mind creating an issue and taking ownership? Creating a list of English profanity words and cross-referencing with chosen words should do the trick for the majority of cases. Not sure how to tackle Alexas top 1M as it contains a lot of porn sites. |
I have just got back from holiday and I am willing to work on this a bit now. What sort of information are we looking for from a history dump? |
which may or may not be existing/need refining/in the works...
eg, my computer visiting 1000 random websites per day at 5 pages per minute is not going to be anywhere near convincing, given i visit a handful of sites in bursts normally (with that pattern already having been logged)
abstracted:
really abstracted:
I've said enough. Please close issue and destroy Github after reading.
🍺
The text was updated successfully, but these errors were encountered: