brainstorming limitations and features #13

whilei · 2017-06-19T03:50:25Z

which may or may not be existing/need refining/in the works...

randomized (but not TOO randomized) intervals... as below, general pattern mimicry would be ideal; exactly randomly between 1-10 seconds is not; humans are not just gravel, also rocks and boulders.
customizeable word lists
- as crazy as it sounds, a chrome plugin to record actual searches and thereby use real starting data for mimicry might be effective (again, obfuscation vs privation)
variety of request types, ie POST, PATCH, DELETE... more tricky, but filtering vs GETS would be the first thing I'd do looking for real human logs
controlled variety of 'quest' depth. google+1click and then google something completely unrelated+1click is not convincing.

eg, my computer visiting 1000 random websites per day at 5 pages per minute is not going to be anywhere near convincing, given i visit a handful of sites in bursts normally (with that pattern already having been logged)

abstracted:

usage patterns that are not static randomness, but sporadic and clumpy, reasonably nonlinear
mimicry of actual/personalizeable trends in content

really abstracted:

better to make a handful of knitting needles than a busload of thumbtacks

I've said enough. Please close issue and destroy Github after reading.
🍺

XayOn · 2017-06-26T00:00:45Z

What about getting actual browsing data from volunteers to analyze multiple behaviours and decide what is the best option?

NeuroWinter · 2017-07-02T22:37:34Z

@XayOn I think that would be the best idea I will look into my browsing history today to see what I can see.

t-mullen · 2017-07-05T04:23:56Z

To add to this, some of the modules have a possibility of generating traffic that could be harmful if not outright incriminating. Without some kind of "safe mode", users could be putting themselves at real risk.

The project could take advantage of services like Google "Safe Search" or MyWOT, but this would probably make real traffic easier to spot at the same time.

eth0izzle · 2017-07-06T10:34:52Z

@XayOn @NeuroWinter sounds great! Nirsoft have a free tool at http://www.nirsoft.net/utils/browsing_history_view.html to extract history - if you anonymise and share then we can start parsing them, finding patterns, etc. For Chrome this could be pretty helpful: https://chrome.google.com/webstore/detail/web-historian-web-history/chpcblajbmmlbhecpnnadmjmlbhkloji

@rationalcoding yes I agree. Would you mind creating an issue and taking ownership? Creating a list of English profanity words and cross-referencing with chosen words should do the trick for the majority of cases. Not sure how to tackle Alexas top 1M as it contains a lot of porn sites.

NeuroWinter · 2017-07-30T20:27:13Z

I have just got back from holiday and I am willing to work on this a bit now.

What sort of information are we looking for from a history dump?

t-mullen mentioned this issue Jul 6, 2017

Filter unwanted/inappropriate websites #16

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

brainstorming limitations and features #13

brainstorming limitations and features #13

whilei commented Jun 19, 2017 •

edited

Loading

XayOn commented Jun 26, 2017

NeuroWinter commented Jul 2, 2017

t-mullen commented Jul 5, 2017 •

edited

Loading

eth0izzle commented Jul 6, 2017 •

edited

Loading

NeuroWinter commented Jul 30, 2017

brainstorming limitations and features #13

brainstorming limitations and features #13

Comments

whilei commented Jun 19, 2017 • edited Loading

XayOn commented Jun 26, 2017

NeuroWinter commented Jul 2, 2017

t-mullen commented Jul 5, 2017 • edited Loading

eth0izzle commented Jul 6, 2017 • edited Loading

NeuroWinter commented Jul 30, 2017

whilei commented Jun 19, 2017 •

edited

Loading

t-mullen commented Jul 5, 2017 •

edited

Loading

eth0izzle commented Jul 6, 2017 •

edited

Loading