You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[](https://pypi.org/project/waybacktweets)[](https://doi.org/10.5281/zenodo.12528447)[](https://waybacktweets.streamlit.app)[](https://colab.research.google.com/drive/1zRqi6uTMiGi5z8GQ-PC0tbpCJWULCqMO?usp=sharing)
4
4
5
-
Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML (for easy viewing of the tweets using the `iframe` tag), CSV, and JSON formats.
5
+
6
+
Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.
6
7
7
8
## Installation
8
9
@@ -57,7 +58,7 @@ if archived_tweets:
57
58
## Acknowledgements
58
59
59
60
- Tristan Lee (Bellingcat's Data Scientist) for the idea of the application.
60
-
- Jessica Smith (Snowflake's Marketing Specialist) and Streamlit/Snowflake teams for the additional server resources on Streamlit Cloud.
61
+
- Jessica Smith (Snowflake's Community Growth Specialist) and Streamlit/Snowflake team for the additional server resources on Streamlit Cloud.
61
62
- OSINT Community for recommending the application.
Copy file name to clipboardExpand all lines: app/app.py
+11-17Lines changed: 11 additions & 17 deletions
Original file line number
Diff line number
Diff line change
@@ -34,7 +34,7 @@
34
34
layout="centered",
35
35
menu_items={
36
36
"About": f"""
37
-
[](https://github.com/claromes/waybacktweets/releases) [](https://github.com/claromes/waybacktweets/blob/main/LICENSE.md) [](https://github.com/claromes/waybacktweets)
The application is a prototype hosted on Streamlit Cloud, serving as an alternative to the command line tool.
40
40
@@ -168,16 +168,12 @@ def scroll_page():
168
168
169
169
# ------ User Interface Settings ------ #
170
170
171
-
st.info(
172
-
"🥳 [**Pre-release 1.0x: Python module, CLI, and new Streamlit app**](https://github.com/claromes/waybacktweets/releases)"# noqa: E501
173
-
)
174
-
175
171
st.image(TITLE, use_column_width="never")
176
172
st.caption(
177
-
"[](https://github.com/claromes/waybacktweets/releases) [](https://github.com/claromes/waybacktweets)"# noqa: E501
173
+
"[](https://github.com/claromes/waybacktweets/releases) [](https://github.com/sponsors/claromes)"# noqa: E501
178
174
)
179
175
st.write(
180
-
"Retrieves archived tweets CDX data in HTML (for easy viewing of the tweets using the `iframe` tag), CSV, and JSON formats."# noqa: E501
176
+
"Retrieves archived tweets CDX data in HTML (for easy viewing of the tweets using the iframe tag), CSV, and JSON formats."# noqa: E501
181
177
)
182
178
183
179
st.write(
@@ -291,15 +287,15 @@ def scroll_page():
291
287
292
288
# -- Rendering -- #
293
289
294
-
ifcsv_dataandjson_dataandhtml_content:
295
-
st.session_state.count=len(df)
296
-
st.write(f"**{st.session_state.count} URLs have been captured**")
290
+
st.session_state.count=len(df)
291
+
st.write(f"**{st.session_state.count} URLs have been captured**")
297
292
298
-
# -- HTML -- #
293
+
tab1, tab2, tab3=st.tabs(["HTML", "CSV", "JSON"])
299
294
300
-
st.header("HTML", divider="gray", anchor=False)
295
+
# -- HTML -- #
296
+
withtab1:
301
297
st.write(
302
-
f"Visualize tweets more efficiently through `iframes`. Download the @{st.session_state.current_username}'s archived tweets in HTML."# noqa: E501
298
+
f"Visualize tweets more efficiently through iframe tags. Download the @{st.session_state.current_username}'s archived tweets in HTML."# noqa: E501
303
299
)
304
300
305
301
col5, col6=st.columns([1, 18])
@@ -317,8 +313,7 @@ def scroll_page():
317
313
)
318
314
319
315
# -- CSV -- #
320
-
321
-
st.header("CSV", divider="gray", anchor=False)
316
+
withtab2:
322
317
st.write(
323
318
"Check the data returned in the dataframe below and download the file."
324
319
)
@@ -340,8 +335,7 @@ def scroll_page():
340
335
st.dataframe(df, use_container_width=True)
341
336
342
337
# -- JSON -- #
343
-
344
-
st.header("JSON", divider="gray", anchor=False)
338
+
withtab3:
345
339
st.write(
346
340
"Check the data returned in JSON format below and download the file."
Copy file name to clipboardExpand all lines: docs/index.rst
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,10 +9,11 @@ Wayback Tweets
9
9
10
10
Pre-release: |release|
11
11
12
-
Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see :ref:`field_options`), and saves the data in HTML (for easy viewing of the tweets using the ``iframe`` tag), CSV, and JSON formats.
12
+
Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see :ref:`field_options`), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.
Copy file name to clipboardExpand all lines: legacy_app/legacy_app.py
+2-6Lines changed: 2 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -14,11 +14,7 @@
14
14
layout="centered",
15
15
menu_items={
16
16
"About": """
17
-
## 🏛️ Wayback Tweets
18
-
19
-
Tool that displays, via Wayback CDX Server API, multiple archived tweets on Wayback Machine to avoid opening each link manually. Users can apply filters based on specific years and view tweets that do not have the original URL available.
20
-
21
-
This tool is a prototype, please feel free to send your [feedbacks](https://github.com/claromes/waybacktweets/issues). Created by [@claromes](https://claromes.com).
17
+
This is the legacy application of [Wayback Tweets](https://waybacktweets.streamlit.app/).
0 commit comments