-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consultation: About DADI's use business scenarios & implementation status #120
Comments
Does DADI have a user exchange group? Does DADI have any unsuitable business scenarios? Would it be convenient for you to inform Alibaba about the current status of DADI's landing? |
Thanks for answering.
|
I see... In our paper, there are some performance behavior in our production environment. One more, DADI is widely used in machine learning and WebIDE in Alibaba cloud, but I think you should take your own test in your environment. For question 3. currently, we don't have other optimization plans... You can contact me by email ([email protected]) if you need more information or business landing help in your company :D |
@bengbeng-pp Currently in Alibaba Cloud, only the Function Compute uses trace prefetching, because it's relatively easier for them to record trace. Some business are reluctant to do such a thing. I think what you need is Cache + P2P distribution. For each of them DADI has an open-source implementation. By setting up a large scale of SSD cluster, you basically distribute / cache every hot piece of data in the network and thus a mighty network filesystem is formed :-) |
Thank you very much for your patience.
other:
|
yes, the figure shows the cold startup time between tgz image and overlaybd
'The IO pattern' I mean is most applications only use a few image data.( ~6.4% FAST 16 )
Actually, online business is the first landing scenario. Machine-learning and webIDE which I mentioned, always use larger images than others. (~10GB+)
Overlaybd recored the image I/O trace without network. In my experience, trace prefetching should be helpful to you.
we use LRU to auto evict unused cache data and it will never exceed the limit capacity. If you want to know the disk usage about cache, try 'du -sh' on the cache-dir.
there is a very rudimentary open source code for our p2p... But I don't think you need it. https://github.com/data-accelerator/dadi-p2proxy Anyway, as I said before, I can only tell you the conclusion from my experiences, you should take your own test. :-D |
Overlaybd recored the image I/O trace without network. Will this cause the application to get stuck when network operations are required, and it is impossible to obtain a complete record of io operations. |
Yes.... the prefetch trace is based on the application environment. |
I understand, thank you very much for the answer |
Hello,Is there any documentation on how to configure cache and p2p? When I pulled obd format image from registry, I can not see anything from /opt/overlaybd/registry_cache |
Excuse me, I have a few questions。
Does DADI have a user exchange group?
Does DADI have any unsuitable business scenarios?
Would it be convenient for you to inform Alibaba about the current status of DADI's landing?
The text was updated successfully, but these errors were encountered: