Go through notes #1

MichaelCurrin · 2020-06-02T18:53:14Z

From plan.txt Dec 2018

Aim:
    Recover URLs which I want to bookmark
        Requires manually reading the page
            Some can be ignored if too much repetition on area or don't need
        Could add to bookmarks to avoid duplication and use some manual sorting into folder
    Make them easy to find
    Read them
    Generate once off report as CSV
        Need domains and pages together
        But also group by visited in periods - column for page to filter by
        Count instead of actual dates

Using frozen dump to recover tab from past year. Afterwards things are sent to bookmarks.


Parsing
    urlparse('')
    => ParseResult(scheme='', netloc='', path='', params='', query='', fragment='')

    from urllib.parse import ParseResult*
    x = ParseResult(*('scheme', 'netloc', 'path', 'params', 'query', 'fragment'))
    x.geturl()
    'scheme://netloc/path;params?query#fragment'



Unicode
    Errors were just in VC maybe? PyCharm is fine.

    TODO: Find out what encoding is used to make use of unicode characters which appear in URLs (such as equals sign) and possibly emojis or at least show emojis as ASCII.
    Some titles contain emojis. Normal unicode characters can only be parsed after emojis are replaced.
    Some URLs are broken
    Check for which sites
    URL can be found using title and domain search.

    https://stackoverflow.com/questions/33485255/python-decoding-a-string-that-consists-of-both-unicode-code-points-and-unicode
    Input either
        codecs.decode('\\u002d', 'unicode_escape')
        '\u002d'
    Gives
        '-'

Categories from transitions
{
    'LINK': 11626,
    'TYPED': 731,
    'AUTO_BOOKMARK': 127
    'RELOAD': 2313, Work keeping just in case.

    'GENERATED': 588,  Generated - google searches
    'AUTO_TOPLEVEL': 579, Chrome native - ignore
    'FORM_SUBMIT': 528,
}


Firebase could be a backend to access from anywhere but still need frontend to be setup on work and home laptop and reachable by cellphone if using that.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Go through notes #1

Go through notes #1

MichaelCurrin commented Jun 2, 2020

Go through notes #1

Go through notes #1

Comments

MichaelCurrin commented Jun 2, 2020