-
-
Notifications
You must be signed in to change notification settings - Fork 591
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Syncing #35
Comments
@calmh might know a thing or two about using Syncthing in an application-specific context like this. I haven't seen it done before so we might want to check with him before we start. I've taken a look at the arguments to Syncthing and found |
I've started prototyping something small here: https://github.com/ActivityWatch/aw-syncthing/ Could be made to work both with standalone Syncthing and bundled Syncthing, but standalone would probably be preferred due to the dependency on the Python package What it does:
|
From a Syncthing point of view there's no real difference to it - you just start it up, point -home towards somewhere suitable, and configure it appropriately. You'll need to do the exchange of device IDs and so on somehow. As for the Python package, I see that it mentions 0.14.24 specifically but probably only because that was the latest when the README was written. All of 0.14.x speak the same API so there is no difference (and 0.12, 0.13 as well for the absolute most part).
|
@calmh: Awesome! I'll let you know when we have a working release. |
I've started using Standard Notes recently (finally getting off Evernote) and have been impressed by the architecture. They have designed a neat data format/server called Standard File that defines how data should be encrypted and stored both client-side and server-side. Definitely something to check out. Edit: It's interesting, but I'd rather have it distributed than just decentralized. |
I've been thinking about this a bit more. My current idea is to simply configure a folder as a synced databases-folder. Basically This folder could then be synced with Syncthing, Dropbox, or Gdrive (we should probably explicitly recommend Syncthing). The synced database files would not be allowed to be modified from another host then the one who owns them, since such changes could cause syncing conflicts. Potential problems:
|
Reddittracker turned up this today. Makes it pretty clear that sync is a vital feature for most users.
|
Would be nice if this is implemented that it doesn't add to the system requirements to run the program. So for people who don't need the functionality or would rather just set up a cron job to copy it to a remote server manually can still disable the feature. |
@hippylover Noted! Thanks for the feedback. |
I googled activitywatch + backup. Trying to locate where the data is stored. Would be really nice to be able to 'set' where the data is stored. Backup solution I use--is to put important stuff I'm working on under Dropbox or MEGA. I'm under linux...and I actually add a home directory, so that I ...guess it makes me more 'aware' that it's Dropbox data. I just read through the above comments--supposedly MEGA is end to end encrypted. I started using because more free data.. but it has the bonus of not having to mess with an encryption solution if you want to store it encrypted. |
your sync look like auto-backup to me (or i've miss understoud) how do you merge activity from multiples devices ? if i was in charge, i probably use git as sync/merge tool if the data are stored in plain text files. |
@1000i100 The difference between sync and auto-backup would be that in auto-backup there's a definition of a producer and a consumer while in sync it doesn't, and by that definition we might actually refer to auto-backup yes. Merging activity from multiple devices is not an issue as long as the one device you are requesting data from has the data for all the devices you want to view. Each kind of data is separated by activity type per each host which we call buckets. Plaintext is simply not scalable and therefore git is out of the question. If we have 500MB of data and convert it back and forth between a database and plaintext file it would be incredibly slow. |
Started working on something small as an experiment: ActivityWatch/aw-server#50 |
raises hand Just wondering - isn't the storage a database? syncthing doesn't handle database syncing. |
@madumlao I don't get that either, syncthing syncs file by file and it is near impossible to do a diff of a binary sqlite fine. The database can easily grow past 100MB and it's not viable to sync such a large file frequently. |
@madumlao Correct, but the database is stored in a file, which can be synced. @johan-bjareholt Syncthing is smart enough to not sync the entire file if only parts of it have changed, see: https://forum.syncthing.net/t/noob-question-incremental-sync/1030/17 |
@ErikBjare Oh nice. Googled a bit on the sqlite database files and they seem to be paged so that should be fine then. I just assumed that it was as bad as git when comparing binaries but apparently they have solved that issue. |
Would syncing with syncthing also mean that we will have multiple database files? In that case we might need a lot of refactoring. |
@ErikBjare I'm not convinced that an SQLite db will survive syncthing. At best case you'll lose transactions done on one side, at worse case you'll have a mispaired hot journal which will corrupt the whole db. Effectively, if an aw-server process is running on two machines there's going to be contention. https://www.sqlite.org/howtocorrupt.html The only way that syncthing, rsync, or similar process is going to be "safe" is if each transaction is a separate file, but I guarantee that that's going to be bad. You really need to implement some kind of peer to peer syncing db, such as for example, a multi-master LDAP. |
@johan-bjareholt Yes, each instance would write to its own file in the synced folder(s) (there are some benefits to having one Syncthing-folder per instance, as Syncthing can enforce "master copies" preventing accidental deletion/corruption on other machines). An instance would therefore have read-only access to database files from remote machines. I don't think this requires any major refactoring. @madumlao I am aware, I'm not proposing we sync a single sqlite database file. I thought I had mentioned it in the issue before, but I realize now that I hadn't. Hopefully this should clear things up: I'm not proposing two-way sync in the sense that you can edit remote buckets, only read them (and create copies, which you could in turn modify). |
I see. A full-on p2p system would be very much appreciated. I have a case where I have multiple laptops / devices that all move around. Unless I set up a single server and configured all clients (including firefox extensions etc) to talk to that server, my activity watchers will all have gaps in activitytracking, defeating the purpose of review. Ideally a user who has multiple devices can transfer in between devices with little setup, and the tracking will follow them throughout. Maybe the laziest / easiest way to do this without major rearchitecting is to use periodic "sync checkpoints", which would basically:
Could be implemented as a separate watcher-like process. (My assumption is that tracking events are largely just additive transactions, there is little editing done) By the way, I have no idea where the sqlite database is saved. Any pointers? |
@madumlao That's almost the exact design I had in mind for the MVP, nice to see we arrived at the solution independently! We use appdirs to manage files like the database, caches, and logs. So check |
Just to be sure, there is currently no across-device syncing available yet, right? If so, once syncing available I'd gladly switch from RescueTime. I constantly switch across different computers. |
@x-ji No it's sadly not available yet. |
I'm looking for more people to try out the WIP sync implementation. Try to get it working, find some bugs, fix some bugs, etc. If you feel up to the task of writing a little bit of Rust and helping out with an important feature, please give it a shot! |
Is there a timeline for the feature? Understandably, not looking for any hard commitments, just wondering how things are looking given the current state. |
First off — I really appreciate the work you guys put into this. But I would also like to know if the syncing feature will be coming soon? I would def be willing to beta test if needed. Unfortunately, I don't know Rust, so I'm unable to help with the coding. |
+1 to this. Although I do know Rust and would love to help. Is this feature even being pursued still? |
Distributed sync would be a great feature. But is that not easier to host a dockerized version of ActivityWatch that will use MariaDB? It will allow easy backup (MariaDB is easy to backup, can be replicated if needed etc...). A lot of softwares/services work in this way |
How about syncing to one's social account like Google? |
I just updated the aw-sync README with proper usage instructions, for those bold enough to try it out. Still probably rough around the edges, but I've had it working for a while now. Please give it a try, report issues, and submit PRs! @OGoodness No timelines. I work on it when I find the spare time. It's ready when it's ready :) Shoutout to @nathanmerrill for his awesome PR improving error handling and other stuff in the syncing code: ActivityWatch/aw-server-rust#437 Thank you all for your kind expressions of impatience and overall understanding. I hope I get to give you all what you are waiting for soon :) |
I would love to test this feature, can you please compile it for windows? |
Is this coming to Android too? My main use case for syncing would be pulling AW data from my phone, both to back it up and to hopefully visualise it on desktop rather than fumbling around with the UI on mobile. As far as I can tell, the beta builds don't include Android, as that's maintained separately but only released occasionally? E: Looks like this has been requested already: ActivityWatch/aw-android#107 |
It would be good to be able to self-host and store all data in our own server (centralized) as encyrpted |
…next WIP: transform-next
veilid is p2p, secure, mobile-first and has python bindings here: https://gitlab.com/veilid/veilid/-/tree/main/veilid-python?ref_type=heads |
file upload example (very old) : http://vdrop.link |
The latest version v0.13.1 that was released last week now ships with aw-sync bundled! 🎉 It is not enabled by default, so you need to manually start it in the trayicon menu, or modify the I hope this gets more people to try it out and give their feedback. It's not guaranteed to be stable, but it works. I'm still working out how to bring this to Android. |
here is my timeline - it works wonderfully for me, syncing with syncthing. I think it took a day to appear in the web ui (I have no data on this), but because the data can’t conflict it just works for now. the ui could be polished (eg. visually grouping the devices in timeline mode and much more) but that is a topic on its own. Thanks for making this work!! |
Not sure if the same folks are watching all the AW repositories, but I've logged a sync-related issue ActivityWatch/aw-qt#105 where sync (along with the input watcher, another new module?) doesn't load at startup, unlike the other modules. |
any progress or an official release day coming soon? i would also be happy with some sync option via dropbox or google drive, they have my data anyway as i am using windows and android, but i wanna hav it too. |
I think 0.13.1 includes it? |
well its not quite user friendly as there are no options or hints in the web ui and i dont think sync between android and windows is aldry functioning, is it? |
Thank you for implementing this sync! Started to test aw-sync Also, when selecting the model from the icon in the tray menu Its seems to run only with However, where running from the command line Another thing, when running the |
Why https://litestream.io/ was never considered? It's good solution to sync sqlite database and cheap as well. |
Dolt is another excellent open source decentralized replacement for MySQL, sqlite etc. Basically git for data. But it seems |
Vote on this issue on the forum!
There are two usage issues with ActivityWatch at the moment to which syncing is a solution:
I know of two interesting solutions to this problem:
Want to back this issue? Post a bounty on it! We accept bounties via Bountysource.
The text was updated successfully, but these errors were encountered: