-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data Visualization Tools (2022) #10
Comments
Bishal Goswami Introduction:
Getting started:
Features:
What are map tiles?
Exploration using Leaflet:Simple exercise: Marking the park locations in Sanjaynagar, BengaluruSublime Text download link: https://www.sublimetext.com/3 MapTiler Cloud is one of the Leaflet-providers. HTML link: leaflet_assignment1.html Possibilities with Leaflet: Few examples:Pros:
Cons:
References:
|
Alivia Chaudhuri What is Datawrapper? Quick overview You can do 3 types of visualizations using Datawrapper:
How do you get data into it?
How do you get data out of it? Privacy concern- Once you upload your data, you are placing it on Datawrapper's free hosting service so your data will be accessible to the internet. If you want to use this tool and have sensitive information, then to keep your data safe, it is advisable to install a paid version. Datawrapper River - We can view previously made charts and adopt them for our own projects (The River). The River is a good discovery tool for people who may have done similar visualizations. Features at a glance
Here are some of the examples we saw in class: https://datawrapper.dwcdn.net/yx7mD/2/ Student Discount Pros
Cons
Resources that helped me: |
Mapbox StudioShweta Ratanpura Highlights
How it worksMapbox Studio provides tools for converting your data to tilesets and creating styles. It also includes a dataset editor, which you can use to create and edit data before converting to a tileset. Once you create a style, you can display your map using Mapbox GL JS or through Mobile SDKs 3 main tabs of the Mapbox Studio
Steps to create your own Data Viz1. Upload data as a Dataset2. Draw & edit data in the Dataset tab3. Create a Tileset by exporting the Dataset4. Choose a template style in the Styles tab5. Modify style properties in the Styles editor6. Add your layer containing your Tileset in the Styles editor7. Style your layer in the Styles editor8. Publish style in the Styles editorPossibilities using Mapbox StudioFuther examplesPros of using Mapbox Studio
Cons of using Mapbox Studio
|
Tejaswini Pundge | 216330012 | Interactive Data Visualization | Assignment 3 Introduction RAWGraphs is an open source data visualization framework built with the goal of making the visual representation of complex data easy for everyone. RAWgraphs facilitate you to upload/paste data and customize it to create nice graphs. Using RAWgraphs is very easy. Below are the few steps to follow : Following are the input options for your data set.
Following are the type of graphs you can choose from :
Following are the Mapping parameters :
Following are the customization parameters :
Output: Some important points: |
Apoorv Anurag Tool: CircosCircos is a software package for visualizing data and information. It visualizes data in a circular layout, making Circos ideal for exploring relationships between objects or positions. Circos is ideal for creating publication-quality infographics and illustrations with a high data-to-ink ratio, richly layered data, and pleasant symmetries. You have fine control of each element in the figure to tailor its focus points and detail to your audience. It was developed by Martin Krzywinski, a genetic scientist and a data viz. enthusiast, in the mid-2000s. He originally conceived it for visualizing genomic data such as alignments and structural variation. He was successful in his attempt as, till now, Circos has been used in over 500 scientific research papers and many of the top science journals. Types of Circos-Using CircosThere are two ways of using Circos-
Method 1You must install Strawberry Perl on your computer from this link on the website. It is compatible with Windows, Mac OS X, Linux, and UNIX. I found a helpful youtube tutorial that can help you for installing the tool. Link. The installation also requires a terminal like Command prompt and Visual Studio Code (or VS Code) to make the visualization. Pain Points:
Method 2 - Circos Table ViewerThis online tool is provided on the same website, which makes creating Circos visualizations easier. We just have to create a Tabular Separated Value (TSV) file of our data. And hit 'Visualize Table'! After a few seconds, another webpage loads, which creates the circos chart. Example: I wrote tabular separated values inside notepad and saved it as a txt file. Then I uploaded it to the website. I got the following visualization. I can go into the settings tab from here to edit some attributes. It has a lot of simple customization options available, which are enough for a novice user. The bad part is you will have to upload the file again and wait for the page to load every time. The settings page looks something like this- After visualizing the chart, you can download it as png or SVG from the option provided just above the generated chart on the website. The final output that we got - Negatives of this tool:
Positives: P.S.- This was just a simple data sample I took for demo purposes and easy understanding. One can make visualization with much more complex data also. Reference Links:
Presentation SlidesPress 'F' to make it fullscreen :) |
ObservableName: Abhinav Bansal Tool: Observable About Observable: It is a Data Visualization Tool that is completely web-based. You do not need an IDE to work on this tool, nor do you need to install any external software. It works through several libraries of Javascript (Vega-Lite, D3, P5 etc.). Overall, a very powerful tool to customize the visualizations. Beginners can also explore the community to find suitable visualizations; find and change its data set and create their own Visualizations. Some Basics:
Syntax: With every language, it is important to understand the syntax. This is also the case with Observable. One needs to understand the encoding format, and the basic syntax before they can start editing/ writing their codes for visualizations.
Getting StartedWhile getting started, the user needs to understand 'Cells', Input Language, and the basic functions available in the interface. Cell: A strip which runs independently in the overall workflow. It has two fields- input field and the output field. Always define call with a unique variable (eg. A, B, myCell, myCell2 etc.) Pin: Pin is used to 'pin' the input field of a cell. The input field contains the code which one writes. The output field shows the out of the code which has been input. Run Button: Clicking on the run button (far right of a strip), will run the code that has been input. Any error is highlighted. Input Options: There are 3 types of input options- Javascript, Markdown and HTML. Features: These are located on the right side of the screen. These include Attaching Files, Viewing History, Sharing, Forking, Search, Publish Example 1: Understanding the BasicsHere, variables a,b and total are defined and given a numeric value. These are written in separate cells altogether. Example 2: AdditionHere, a function called Addition has been defined. Syntax to note:
Example 3: Using Vega-Lite Library (link)
This is how it looks after importing api and data-sets Verifying the Data-Set using printTable(data.slice(0,rows)); This command is used to verify whether the Data Set you added is loaded. Further, the attributes are also checked using this function. Example 4: Modifying Data using Curated Data SetsHere, we need to understand the syntax effectively. Every syntax follows the same pattern: Example 5: Interactive Scatter PlotHere, we use the feature called tooltip Example 6: Advanced VisualizationsOn Exploring the community, very high level visualizations can be found. Although, one can also code these complex visualizations from scratch, one needs to be very efficient with coding to be able to do so. Hence, an easy way is to find these in the community, fork them, and edit their data. (After understanding the kind of data it wants!). Below are the links to some visualizations I created by exploring the community and slightly modifying the datasets. Text Analysis: Sankey Diagram: Original Overall: Pros
Overall: Cons
Some Useful Links (Note: These were added and working as on Aug 13, 2022) |
Rubayat Ahmed Tool chosen : p5.js Introduction : p5.js is a JavaScript library for creative coding. The term creative coding refers to graphical approach of coding, where a beginner can learn programming with graphical output and shape their ideas on their own. It is a powerful tool for artists, designers and programmers with a gentle learning curve that’s easily achievable. The only limit to what one can create with p5.js is your imagination. It is an open-source library developed in 2014 and derived from Processing – a flexible software sketchbook specifically aimed at teaching the novice how to use code to create visual artwork. Till that point it has grown as a medium of expressing creativity through programming. Key usage of p5.js : The capability and range of functions that p5.js provides is huge, largely they can be grouped as...
Some Examples are :
Prerequisite : Being a completely coding based tool, a good understanding of algorithm/programing with some experience of HTML/CSS/JavaScript will be needed. JavaScript is preferred as the library is build on top of it. Code Editor : To create real websites using p5.js one can use any standard code editor like VS Code, Sublime text, Notepad ++ etc. But for learning and exploration purpose there is an online code editor available on the official website, where one can write the p5.js JavaScript code and run it in one go to see the instant graphical output. This can't be exported anywhere, but can be copied to your project file easily. The online code editor as shown in below has two main panels, left panel is where we write the JS codes, and on the right hand side we see the output in a canvas. Menu bar at the top gives you option to save and open new files. To run a code one need to click on the play button, but auto-refresh could be enabled so that you don't need to refresh each time you edit a piece of code. Syntax : The syntax to write a p5.js code is same as writing any JavaScript code. Where functions are the main component that executes code written inside them.
In the case of p5.js, there are two main predefined functions that are sufficient to write most of the programs. Which are :
SO WHAT CAN YOU DO WITH P5.JS? HOW? Let's write a simple code to draw a circle in the center of the canvas. There are three parameters inside the circle function which decides the location and size of the circle in the canvas. Here in this example the x and y coordinates of the center of the circle are equal to half of height and weight of the canvas, basically means the center location of the canvas. The third parameter is of radius which is given 200 in this example. Thus one can draw ant graphical objects in the canvas using its properties. BIG DATA VISUALIZATION? p5.js is not suitable for big data visualization, as it is purely code based. One need to write code from scratch to do anything in p5.js, on the other hand there are more efficient tools like D3(https://d3js.org/) javascript library to do the job. Still because of the flexibility of p5.js in terms of creating whatever you want gives you freedom to create some unique data visualization. PROS
DRAWBACKS
Thanks! |
Mithun Murali Tool: Tableau Public Tableau Public is a free platform to explore, create, and publicly share data visualizations. Tableau Public is a free edition of the Desktop product. With this edition, data can only be published to the Tableau public website and does not allow work to be saved or exported locally. It connects to various data sources for combining disparate data sources without coding. It provides tools for discovering patterns and insights, data calculations, forecasts, statistical summaries, and visual storytelling. Tableau Server allows Tableau Desktop users to publish dashboards to a central server to be shared across their organizations. The product is designed to facilitate collaboration across the organization. It can be deployed on a server in the data center, or it can be deployed on a public cloud. The most common users of Tableau Public are Enterprises, freelancers, and students. Tableau offers different types of products for different industries: Installation option for Tableau Public: The landing screen of Tableau Public: This is a simple data viz created from a spreadsheet data: Here is a sample dashboard created in Tableau Public. This dashboard is in its raw form; no visual customization has yet been done. (Please go through the presentation for more details) |
Voyant toolsAnkit Anand M.Des Tool: Voyant Tools What is Voyant Tools?Voyant Tools is an open-source, web-based tool used for the analysis of textual data. It is primarily used for statistical analysis, and data mining of scholarly texts to gain a deeper understanding through pattern recognition, word frequency, frequency distribution plots, and analyzing words in context. How to do text analysis with Voyant Tools?This chart shows some of the American president's vocabulary by Grade Level. Donald trump scores the lowest with a 4.6 grade. I wanted to verify if this was indeed the case by analyzing Trump’s entire tweet archive. I downloaded all the tweets sent from Trump’s account from 2009 to 2021 and converted them into .txt format. On the homepage, you will see a textbox that prompts users to type, open, or upload text files. After the tweets are uploaded, Voyant tools show this dashboard with the default visualizations. These visualizations can be further customized to filter the results (ex - to exclude certain words from the analysis) This is a Terms counter which gives an absolute count of the no of times a word has been used in a document. Here we can see that the most used words for Trump are often overused words like “great”, “president”, “thank”, and “just”. The second most used by trump is “Trump” which is not surprising. This Cirrus tool shows the same data in a fun visualization. Visualization typesVoyant tools offer an extensive range of visualization options. Some of them are shown below: Cirrus - It is a word cloud that visualizes the top-frequency words of a corpus or document. Terms counter - It gives an absolute count of the no of times a word has been used in a document. Terms berry - The TermsBerry tool provides a way of exploring high-frequency terms and their collocates (words that occur in proximity). Trends - Trends show a line graph depicting the distribution of a word’s occurrence across a corpus or document. Knots - Knots is a creative visualization that represents terms in a single document as a series of twisted lines. ExamplesVoyant tools maintain a list of open-source analyses done by researchers from various fields. These examples are great when looking for inspiration on what kinds of analysis are possible. Pros
Cons
Resourceshttps://library.villanova.edu/application/files/9815/8739/5480/VoyantToolsParadiseLostTutorial.pdf |
TangleZuha Asif P Data Visualisation Tool: Tangle What is Tangle?
How to get started with tangle?
Example of what a reactive document In the above example, if you click and drag 3 cookies, you can see the calories also changing accordingly. For this you simple reaction tangle gives you a JavaScript library which includes different classes with sub functions which can be utilised to build interesting reactions. Here the scroll is defined by a class named "TKAdustableNumber" and the value of calories are stored by a variable named "calories". HTML attributes includes:
Advantages of Tangle
To Being with you must Download • The Tangle.js lightweight library with no dependencies): |
Tool :Timeline JS The tool uses a spreadsheet template from Google Sheets to create a media rich timeline that can be used to show
It was created by Northwestern University Knight Lab, a community of designers, developers, students, and educators working on experiments designed to push journalism into new spaces. Few of their other creations:
This tool helps us in making an engaging and interactive timeline for the internet. It can be viewed by Embedded into websites or shared online This is how the timeline will look: You can zoom in and zoom out of the timeline You can hover over the various points in the timeline and you’ll be directed there Requirements to create timeline in Timeline JS: Link to the website - https://timeline.knightlab.com/ 1.Go to their website and click “make a timeline” 2.It will take you down the website to this part where you will have to click “Get the spreadsheet template” and a google sheet template will open 3.This is the template where you’ll have to fill the data of your timeline. One will have to the An example of the spread sheet with filled details 4.Once you fill all the details, you’ll have to publish the document to web. File < Share < publish to web < publish. Copy the URL in your browser’s bar. You’ll use this in the next step. 5.After you copy, go to the website and Paste the URL here 6.Once you paste you can either copy the link and share One can use some of the HTML codes to enhance the timeline Here are few good examples of timelines created Limitations |
Tool : Plotly What is Plotly? Which coding softwares can I use Plotly with? Link to all Plotly integrations Overview of Plotly with Python So the steps would be as follows -
Why should I use Plotly with Python?
Here is how you can get started with using Plotly Python library. What kind of visualizations can I create using Plotly's Python library? Now, the exciting thing is that - clicking on any of these visualization types takes you to a page with sample codes and the resulting visualizations. Example, clicking on 3D Scatter plot from the above list takes you to a page that looks like this - My short tutorial on this subject :
Here are some resources that can help you get started with Python - Datacamp, w3schools, Codeacademy.
Cufflinks is also a python library that connects plotly with pandas so that we can create charts directly on data frames. It basically acts as a plugin. Pandas is a Python library used for data manipulation and analysis. In particular, it offers data structures and operations for manipulating numerical tables and time series. NumPy is a Python library, adding support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
For the first exploration, I am using a publicly available dataset that essentially contains information about different countries such as total population, GDP per capita and Life expectancy. It is interesting to see how these parameters vary from country to country, and whether population has any effect on life expectancy and GDP per capita. The first graph shows the relation between life expectancy and GDP per capita through a scatter plot. The dot sizes are reflective of the country's population. It seems that higher the GDP per capita, higher the life expectancy. Checking if there is any similar trend that can be noticed when we consider population and life expectancy. It seems there is not much correlation between the two. Next, drawing a trend line to check how life expectancy increases when GDP per capita increases. The graph seems to be comparable to a Logarithmic graph. Changing the properties of the graph, such as line opacity, background color, line color etc. All these different elements of any visualization can be customized by making slight modifications to the code. Adding one graph on top of another : Having fun by playing around with 3D visualizations created using the famous Iris dataset. Apart from these, the visualizations are themselves interactive. Users can pan around the visualization, zoom, select a particular section using lasso tool, rotate the graphs and also download the graphs as PNG. I have uploaded my code as a notebook file (.ipynb), as well as a Python file (.py) in this link. Please feel free to go through the code if required. Finally, ending with some Cons of Plotly
So that's all !! Thanks !! Amit Kumar Ram |
Tool : Open Refine PPT link: https://docs.google.com/presentation/d/1rWaccRKhK4iT-1X6fyYnW6U-LK089m2SsMkDzXMnuuw/edit?usp=sharing What is Open Refine?
Shivangi Negi |
Name: Indubhusan Roy What is Google Data Studio? https://datastudio.google.com/ Google Data Studio is an online tool for converting data into customizable informative reports and dashboards introduced by Google on March 15, 2016 as part of the enterprise Google Analytics 360 suite. This is a tool which is used by many companies for business analytics. Apart from functioning as a data visualization tool 'GDS' also functions with the help of connectors which provide live data which tracks user behavior. This mechanism is very useful for businesses to keep a track of their performance What Google data studio basically does is. Connectors are an important aspect of Google data studio Google like to advertise it in this way Google provides simple tutorials for users to make use of Data Studio As mentioned earlier the primary user group is niche The following slides demonstrates the interface. Various Sample reports are available for users to edit and make it their own Data Studio provides various visual styles that can be used. Some of the samples provided can also be used to add personal data Now Taking an example of a sales data set This data set can be used to set several variables like regions of sale or date ranges or even sales of particular products over a period of time Changing the variables leading to different reports which can be used to check performance across variables Pros and Cons Thank You. |
Annapurna Garimella |
Name: Sanika Deshpande Download Gephi What is Gephi?
Features Explored
Pros
Cons
|
In this assignment each of you will select a dataviz tool, explore it as thoroughly as you can (download, install, try out, and use it to create something), and do a demonstration/overview of the tool to the rest of the class (15-20 minutes).
The objective is to introduce the tool to the class, and highlight its possibilities & limitations so the audience can make a well informed choice of available tools. We will do 4 tools per day starting from Monday onwards. Abhinav will coordinate & the assign the tools. This is a credited assignment.
In your documentation, include links to the resources you used (if any) in your presentations, capturing your personal insights about the tool and related resources.
There are more tools here and here. If you want to pick a tool that is not listed above, discuss with me.
For how to document your work, take a look at what the previous batches did with this assignment (2019, 2020, 2021).
The text was updated successfully, but these errors were encountered: