Replies: 3 comments 4 replies
-
@mlkorra Do you have the field descriptions as well? Couldn't find it in the Twint docs. |
Beta Was this translation helpful? Give feedback.
3 replies
-
Some fields (like mentions, urls etc) I wanted to clarify the output for. |
Beta Was this translation helpful? Give feedback.
1 reply
-
based on discussion with CIS folks, we've decided to scrape all data fields other than cashtag for descriptive research. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The following are the tweets fetched by the twitter scraper (twint)
{"id":,
"conversation_id",
"created_at":,
"date":,
"time":,
"timezone":,
"user_id":,
"username":,
"name":,
"place":,
"tweet":,
"language":,
"mentions":,
"urls":,
"photos":,
"replies_count":,
"retweets_count":,
"likes_count":,
"hashtags":,
"cashtags":,
"link": "",
"retweet":,
"quote_url":,
"video": ,
"thumbnail":,
"near":,
"geo":,
"source":,
"user_rt_id":,
"user_rt":,
"retweet_id":,
"reply_to":,
"retweet_date":,
"translate":,
"trans_src":,
"trans_dest":}
Note :
Fields in bold are currently stored in the database
Additional Fields in the database :
"timestamp_of_scraping",
"type" -> whether the tweet is fetched using keyword,hashtag or userhandle,
"search" -> search term used to scrape,
"content_type" -> whether the tweet contains text,image,gif,video,
"s3_url" -> url of the media uploaded to s3 bucket,
Please,go through the data fields and mention/discuss if any of the fields is helpful for the task
Beta Was this translation helpful? Give feedback.
All reactions