ScrapIn

This module is a selenium wrapper tailored to searching people or jobs on linkedin given a set of criteria.

WARNING: While supposedly legal to scrape linkedin (see this legal proceeding), I do not take any responsibility for any retaliation Linkedin might enact against the user-account associated with the use of this package.

System

As of: 2019-09-20

CPython 3.7.3

numpy 1.16.4
pandas 0.24.2
selenium 3.141.0
bs4 4.7.1

compiler : GCC 7.3.0
system : Linux
release : 5.0.0-29-generic
machine : x86_64
processor : x86_64
CPU cores : 12
interpreter: 64bit

Setup

To get started, run the following commands:

pip install --upgrade pip
git clone [email protected]:maximemerabet/scrapIn.git
cd scrapIn
python3 -m venv .env
source .env/bin/activate
pip install -r requirements.txt

You will also need to download the chromedriver corresponding to your OS and chrome version.
These can be found here. Once downloaded, please move the executable to scrapIn/. and run the following command: chmod +x chromedriver

(Please note that you can use selenium's recommended driver location but will need to overwrite the chromedriver path in config.py or overwrite the parameter in scraper.Authenticator.login)

How to use:

The following is an example code for a people-specific search (notebook demo can be found here)

# Append to python path and import
from src.scraper import Authenticator, Scraper

# Credential prompt
auth = Authenticator() 

# Instantiate scraper object
scraper = Scraper(authenticator=auth)

# Execute search (returns DataFrame)
people_search = scraper.search_people(job_title='Executive Chef',
                                      location=['London, 'Manchester'],
                                      industry='Hospitality')

TODO

Add deep-search functionality (Given a list of profile URL, retrieve more detailed information)
Write up test_config to test for the presence of required elements on the linkedin web-page
(Optional) Add functionality to execute other types of search (i.e. jobs, etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
examples		examples
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScrapIn

System

Setup

How to use:

TODO

About

Releases

Packages

Languages

mm5631/scrapIn

Folders and files

Latest commit

History

Repository files navigation

ScrapIn

System

Setup

How to use:

TODO

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages