load file over network vs copyfile
7 views (last 30 days)
Show older comments
Hi,
So this is not a complex coding problem, more a basic matlab understanding issue.
I have access to huge amounts of data (in the range of 100s of terabytes) in a network folder. When I read this data (using load) on the actual network path i get a speed in the range of 10-30mbit/s (based on information in the task manager). This is quite slow and will not work.
When I instead copy the folders using "copyfile" from the network location to C:\ I get a speed in the range of 150-200 mbit/s.
Then I can continue reading the variables I'm interested in.
What I can't understand is why there is such a big difference in speed using load on the network location and copying the file to C:\ could anyone here please enlighten me? :)
0 Comments
Accepted Answer
Walter Roberson
on 30 Apr 2021
load() needs to seek() in the file, and potentially read sections of the same file multiple times.
When seeking around is done, the file cannot logically be cached because on a network drive it must be assumed that another process might be writing to the file. Though that is a detail that depends on the exact file system, as on some network file systems, the file could potentially be locked while it is open, even over the network.
3 Comments
More Answers (0)
See Also
Categories
Find more on Deep Learning Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!