Skip to content

Series of data scrappers in Python that stract static information from the vinted source code, being the perfect complement for a Vinted API Wrapper

Notifications You must be signed in to change notification settings

ThalosES/vinted-db-feeder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vinted data scrapper

A series of data scrappers in Python that extract static information from the Vinted source code, being the perfect complement for a Vinted API wrapper.

Install & Run

Requirements

  • Pip
  • Python
  1. Create a virtual environment

    python3 -m venv pyvenv/
    source venv/bin/activate
  2. Run requirements.txt

     pip3 install -r requirements.txt
  3. Run main.py

Extracted data categories

Element Fields Returned
Brands Names, Ids
Materials Id, Name
Colors Id, Color, Hex Code
Sizes Id, Title, Size_Type, Category_id
Categories Id, Title, Code, Parent Id, URL, URL EN, Item Count
Categories Children Category Id, Child Id
Countries Id, French_name, local_name, ISO_code, flag_emoji

Materials and sizes

  • Available languages: 🇪🇸 🇫🇷 🇺🇸
    • [More languages can be added if html file included in data/raw/materials or data/raw/sizes]

Categories - Catalogs

  • Debug mode: Builds the full decision tree

  • Exec mode: Returns 2 CSVs:

    • categories.csv: Table of all the available categories and their attributes

    • categories_children.csv: Dictionary that models the Category->Children list relationship

Performance

  • Without brands search:
real    0m1,941s
user    0m1,225s
sys     0m0,037s
  • Brands validation process:
real    14m14,211s
user    0m19,753s
sys     0m1,229s

Authors

Álvaro Cabo

Pepe Márquez

About

Series of data scrappers in Python that stract static information from the vinted source code, being the perfect complement for a Vinted API Wrapper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published