Data analysis part 2 #173

anusha-ramdarshan · 2021-04-22T16:23:58Z

No description provided.

qwandor · 2021-04-22T17:12:49Z

notebooks/README.md

+
+## Looking at the data
+
+The data will be split into different csv files, split by different data types.According to your setup, there will be up to 5 files:


You're missing a space after the fullstop.

qwandor · 2021-04-22T17:14:38Z

notebooks/README.md

+- homie_enum
+- homie_color: contains rgb values for the smart lights
+- homie_float: contains all metrics stored as floats (temperature)
+- homie_integer: contains all metrics stored as integers (humidity %, battery level %)


string is also possible.

qwandor · 2021-04-22T17:15:13Z

notebooks/README.md

+- homie_float: contains all metrics stored as floats (temperature)
+- homie_integer: contains all metrics stored as integers (humidity %, battery level %)
+
+Here, we want to focus on the csvs containing floats and integers, as they contain the temperature/humdity data. Useful columns:


qwandor · 2021-04-22T17:15:46Z

notebooks/README.md

+
+Here, we want to focus on the csvs containing floats and integers, as they contain the temperature/humdity data. Useful columns:
+- time: since epoch (unix epoch 1970). pandas handles this for us.
+device_id


I guess this should also be a list entry.

qwandor · 2021-04-22T17:16:45Z

notebooks/README.md

+- node_type: =="Mijia sensor" to select only the temperature/humidity sensor data
+- node_name: nickname for the sensor (e.g., "living room")
+
+There are between 4 and 10 data points per sensor per minute, depending on how often a sensor gets polled (~ 10K data points in a 24h period for a given sensor)


Depending on the min_update_period_seconds in mijia-homie.toml, really.

alsuren · 2021-04-23T10:39:49Z

notebooks/data_exploration.ipynb

@@ -7,7 +7,9 @@
   "outputs": [],
   "source": [
    "import pandas as pd \n",
-    "import plotly.express as px\n"
+    "import plotly.express as px\n",
+    "from sklearn.preprocessing import StandardScaler\n",


hrm. I'm getting an error here. Trying to debug now.

ModuleNotFoundError Traceback (most recent call last) <ipython-input-2-186f7a1512d6> in <module> 1 import pandas as pd 2 import plotly.express as px ----> 3 from sklearn.preprocessing import StandardScaler 4 from sklearn.decomposition import PCA ModuleNotFoundError: No module named 'sklearn'

alsuren · 2021-04-23T10:50:05Z

notebooks/pyproject.toml

@@ -10,6 +10,7 @@ ipykernel = "^5.5.3"
 pandas = "^1.2.4"
 plotly = "^4.14.3"
 nbstripout = "^0.3.9"
+sklearn = "^0.0"


https://pypi.org/project/sklearn/ says to use scikit-learn instead.

vscode also decided that it wanted to install notebook when I tried things out on a fresh virtualenv, but I can make a patch for that as a separate PR.

anusha-ramdarshan added 5 commits April 20, 2021 15:34

adding sklearn

8f37e99

adding to the readme

e32b45e

trying out a PCA to differenciate rooms based on Temp and humidity

f431492

adding plots to compare different days of the week

c51bb38

clearing up tickvals and ticktext, removing dead code

dfe2238

qwandor reviewed Apr 22, 2021

View reviewed changes

alsuren reviewed Apr 23, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data analysis part 2 #173

Data analysis part 2 #173

anusha-ramdarshan commented Apr 22, 2021

qwandor Apr 22, 2021

qwandor Apr 22, 2021

qwandor Apr 22, 2021

qwandor Apr 22, 2021

qwandor Apr 22, 2021

alsuren Apr 23, 2021

alsuren Apr 23, 2021


		## Looking at the data

		The data will be split into different csv files, split by different data types.According to your setup, there will be up to 5 files:

Data analysis part 2 #173

Are you sure you want to change the base?

Data analysis part 2 #173

Conversation

anusha-ramdarshan commented Apr 22, 2021

qwandor Apr 22, 2021

Choose a reason for hiding this comment

qwandor Apr 22, 2021

Choose a reason for hiding this comment

qwandor Apr 22, 2021

Choose a reason for hiding this comment

qwandor Apr 22, 2021

Choose a reason for hiding this comment

qwandor Apr 22, 2021

Choose a reason for hiding this comment

alsuren Apr 23, 2021

Choose a reason for hiding this comment

alsuren Apr 23, 2021

Choose a reason for hiding this comment