New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Fix wahoo speed and import more meta data #81

Closed

smartsammler wants to merge 4 commits into martin-ueding:master from smartsammler:fix-wahoo-speed

smartsammler commented Jan 7, 2024

Convert speed from m/s to km/h

Wahoo Element Road saves its speed in m/s instead of km/h, so to get proper values the conversion has to be done explicitly. We use a simple heuristic to check if we do need to convert the units.

Use name and kind from metadata

Extract further meta data like the sport from Wahoo FIT files.
Use that data for the filename and to fill missing data.

smartsammler added 4 commits

January 7, 2024 23:07


          Add macos files to gitignore and venv

34bdd2a


          Convert speed from m/s to km/h

ba962d4

Wahoo Element Road saves its speed in m/s instead of km/h, so to get proper values the conversion has to be done explicitly. We use a simple heuristic to check if we do need to convert the units.


          Use name and kind from metadata

8a4a9f2

Extract further meta data like the sport from Wahoo FIT files.

Use that data for the filename and to fill missing data.


          Avoid failing indexing if there are typical meta data files on macos

bbc255a

smartsammler mentioned this pull request

Incorrect values for speed and height for imported *.fit files #76

Closed

Owner

martin-ueding commented Jan 10, 2024

Thank's for the PR, I'm not sure when I can review it, but I hope I will get to it soon.

martin-ueding reviewed

View reviewed changes

Owner

martin-ueding left a comment

Thank you for all the suggested changes. I've added a bunch of comments. I will take your ideas and come up with an architecture that is a bit more general and allows growing the metadata extraction concept to the other file formats as well.

.gitignore

@@ @@ -3,3 +3,6 @@ __pycache__/ @@
               dist
               site
               data
+              .venv
+              .DS_Store

Owner

martin-ueding Jan 13, 2024

I've added those two files to the ignore on master.

.gitignore

@@ @@ -3,3 +3,6 @@ __pycache__/ @@
               dist
               site
               data
+              .venv
+              .DS_Store
+              .gitignore

Owner

martin-ueding Jan 13, 2024

Is there a particular reason you include the gitignore in itself?

Author

smartsammler Jan 16, 2024

No, just a habit, so that I don't have to take care of strange OS and editors of contributors, like you have to right now :)

geo_activity_playground/core/activities.py

@@ @@ -89,6 +90,19 @@ def get_time_series(self, id: int) -> pd.DataFrame: @@
                                   * 3.6
                               )
                               changed = True
+                          else:

Owner

martin-ueding Jan 13, 2024

This is a nice idea, but in #82 there came the suggestion to use the units field from the FIT files. I think that makes more sense to use the actual data instead of a heuristic.

Author

smartsammler Jan 16, 2024

Yes, I do agree. Shall I rebase on the updated master which includes PR #82 ?

Owner

martin-ueding Jan 16, 2024

I don't think that rebasing makes sense. I've already manually cherry-picked some of your ideas into the master branch. If you want to want to make further changes I'd suggest that you base them on the latest master. And perhaps let me know what you plan, then I can perhaps already implement it based on your idea.

geo_activity_playground/core/activity_parsers.py

@@ @@ -20,7 +20,9 @@ class ActivityParseError(BaseException): @@
               def read_activity(path: pathlib.Path) -> pd.DataFrame:
                   suffixes = path.suffixes
-                  if suffixes[-1] == ".gz":
+                  if not suffixes:  # Skip files without extensions like .DS_Store files on macos.

Owner

martin-ueding Jan 13, 2024

That's a good idea, I've added a skip.

geo_activity_playground/core/activity_parsers.py

                   elif file_type in [".kml", ".kmz"]:
                       df = read_kml_activity(path, opener)
                   elif file_type == ".csv":  # Simra csv export
                       df = read_simra_activity(path)
                   else:
-                      raise ActivityParseError(f"Unsupported file format: {file_type}")
+                      raise ActivityParseError(f"Unsupported file format: {file_type} of file: {path}")

Owner

martin-ueding Jan 13, 2024

The path will be added in the import_from_directory function, so we don't need to repeat it here.

geo_activity_playground/core/activity_parsers.py

                                           row["altitude"] = fields["altitude"]
                                       if "enhanced_altitude" in fields:
                                           row["altitude"] = fields["enhanced_altitude"]
+                                      if "grade" in fields:

Owner

martin-ueding Jan 13, 2024

Those three additional fields are now added to the time series. There is no analysis on them yet, though.

geo_activity_playground/core/activity_parsers.py

+                  df = pd.DataFrame(rows)
+                  if metadata:
+                      for key, value in metadata.items():
+                          setattr(df, key, value)

Owner

martin-ueding Jan 13, 2024

I see what you want to do, but I am not too happy with adding this to the data frame object. I will think about a more general way to extract metadata from the files as there are also some GPX files with metadata.

geo_activity_playground/core/activity_parsers.py

-                  return pd.DataFrame(rows)
+                                  elif "wkt_name" in fields and "sport" in fields and "sub_sport" in fields:
+                                      metadata["wkt_name"] = fields["wkt_name"]

Owner

martin-ueding Jan 13, 2024

What is “wkt name”? Can you give examples for the values that it takes?

Owner

martin-ueding Jan 13, 2024

Ah, I've found some documentation for this:

Owner

martin-ueding Jan 13, 2024

Apparently the contents are just strings. I've also added that to master now.

geo_activity_playground/core/activity_parsers.py

-                  return pd.DataFrame(rows)
+                                  elif "wkt_name" in fields and "sport" in fields and "sub_sport" in fields:
+                                      metadata["wkt_name"] = fields["wkt_name"]
+                                      metadata["sport"] = (fields["sport"], fields["sub_sport"])

Owner

martin-ueding Jan 13, 2024

How does sport and sub sport differ from the activity kind (ride, walk, hike)? Can you give examples?

Owner

martin-ueding Jan 13, 2024

Ah, so it is just something “cycling” and “generic”. I've added that into the kind field.

geo_activity_playground/importers/directory.py

                       row = {
                           "id": activity_id,
                           "commute": commute,
                           "distance": distance,
-                          "name": path.stem,
+                          "filename": path.stem,

Owner

martin-ueding Jan 13, 2024

Adding the filename to the metadata seems very sensible!

martin-ueding added a commit that referenced this pull request


          GH-81: Skip .DS_Store in the activity directory

a1501a0

martin-ueding added a commit that referenced this pull request


          GH-81: Extract more FIT time series data

a17a2a1

martin-ueding closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet