Skip to content

Commit

Permalink
Update spidersel.py
Browse files Browse the repository at this point in the history
adjust splitting to allow german umlauts
  • Loading branch information
l4rm4nd authored Sep 13, 2023
1 parent 09c1f94 commit 16b9ee0
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion spidersel.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,7 +43,7 @@ def is_email(input_string):

def filter_keywords(keywords):
filtered_keywords = []
split_pattern = r'[^a-zA-Z0-9]+'
split_pattern = r'[^a-zA-Z0-9äüö]+'
email_pattern = re.compile(r'([A-Za-z0-9]+[.-_])*[A-Za-z0-9]+@[A-Za-z0-9-]+(\.[A-Z|a-z]{2,})+')

for keyword in keywords:
Expand Down

0 comments on commit 16b9ee0

Please sign in to comment.