`gpu(::DataLoader)`, take III #2245

mcabbott · 2023-04-27T14:14:07Z

Simpler variant that just calls DataLoader(mapobs(gpu, data), ...).

What this misses compared to more complex CuIterator-like things is that it does not call finalize afterwards. But perhaps that doesn't matter since each call of the model will allocate so much more, and also not finalize explicitly.

Closes #2240, closes #2186

PR Checklist

Tests are added
Entry in NEWS.md
Documentation, if applicable

github-actions · 2023-04-27T14:14:52Z

Once the build has completed, you can preview any updated documentation at this URL: https://fluxml.ai/Flux.jl/previews/PR2245/ in ~20 minutes

mcabbott · 2023-04-27T16:18:05Z

docs/src/gpu.md

@@ -171,7 +163,7 @@ In order to train the model using the GPU both model and the training data have
   ```
   Here `(xtrain, ytrain) |> gpu` applies [`gpu`](@ref) to both arrays -- it recurses into not just tuples, as here, but also whole Flux models.

-### Saving GPU-Trained Models
+## Saving GPU-Trained Models


This change is just me reducing the number of levels of heading from 3 to 2. The file is a bit of a mess but no need for deep hierarchy.

mcabbott · 2023-04-27T16:26:00Z

docs/src/gpu.md

+       for (x_cpu, y_cpu) in train_loader
+           x = gpu(x_cpu)
+           y = gpu(y_cpu)
+           grads = gradient(m -> loss(m, x, y), model)


I've changed this example to use explicit gradient, and to be less verbose.

mcabbott · 2023-04-27T16:27:06Z

docs/src/gpu.md

@@ -122,61 +122,48 @@ julia> x |> cpu
 0.7766742
 ```

-```@docs
-cpu
-gpu


These docstrings moved from a "guide" section to a "reference" section.

mcabbott · 2023-04-27T16:28:08Z

src/functor.jl

+"""
+function gpu(d::MLUtils.DataLoader)
+  MLUtils.DataLoader(MLUtils.mapobs(gpu, d.data),
+    d.batchsize,


Instead of writing this out here, we could move it upstream: JuliaML/MLUtils.jl#153

docs/src/gpu.md

CarloLucibello · 2023-05-01T08:08:06Z

docs/src/gpu.md

       end
   end
   ```
+   This is equivalent to `DataLoader(MLUtils.mapobs(gpu, (X, Y)); keywords...)`.
+   Something similar can also be done with [`CUDA.CuIterator`](https://cuda.juliagpu.org/stable/usage/memory/#Batching-iterator), `gpu_train_loader = CUDA.CuIterator(train_loader)`. However, this only works with a limited number of data types: `first(train_loader)` should be a tuple (or `NamedTuple`) of arrays.


Here we could hint at using mapobs to transform the dataset into something CUIterator compatible. Could be a short example like

train_loader = mapobs(preprocess_transform, train_loader) gpu_train_loader = CUDA.CuIterator(train_loader)

Also, mention when CuIterator should be preferred over gpu?

Is it preferred? This PR takes the line that it's not... it's tricky about what types, and finalize doesn't matter. So it's mentioned here in case people are already using it.

If finalize does matter, then we should do #2240 instead.

docs/src/gpu.md

mcabbott added the documentation label Apr 27, 2023

mcabbott added enhancement cuda labels Apr 27, 2023

mcabbott commented Apr 27, 2023

View reviewed changes

mcabbott force-pushed the gpu_loader3 branch from d14cd27 to 7d2a10d Compare April 29, 2023 17:51

mcabbott added 6 commits April 29, 2023 18:21

simpler MLUtils gpu(::DataLoader)

e35ce8b

docs

7ab7c86

also move cpu/gpu docstrings to a reference section

ae7e0c2

doc fixes

aecb711

less verbose code in docs

2b7042b

tweak words

db8aadb

mcabbott force-pushed the gpu_loader3 branch from 7d2a10d to db8aadb Compare April 29, 2023 22:22

CarloLucibello reviewed May 1, 2023

View reviewed changes

docs/src/gpu.md Outdated Show resolved Hide resolved

CarloLucibello reviewed May 1, 2023

View reviewed changes

docs/src/gpu.md Outdated Show resolved Hide resolved

CarloLucibello reviewed May 1, 2023

View reviewed changes

docs/src/gpu.md Outdated Show resolved Hide resolved

Apply 3 suggestions

592badd

CarloLucibello approved these changes May 1, 2023

View reviewed changes

mcabbott merged commit 5790b73 into master May 1, 2023

mcabbott deleted the gpu_loader3 branch May 1, 2023 21:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`gpu(::DataLoader)`, take III #2245

`gpu(::DataLoader)`, take III #2245

mcabbott commented Apr 27, 2023

github-actions bot commented Apr 27, 2023

mcabbott Apr 27, 2023

mcabbott Apr 27, 2023

mcabbott Apr 27, 2023

mcabbott Apr 27, 2023

CarloLucibello May 1, 2023

mcabbott May 1, 2023

gpu(::DataLoader), take III #2245

gpu(::DataLoader), take III #2245

Conversation

mcabbott commented Apr 27, 2023

PR Checklist

github-actions bot commented Apr 27, 2023

mcabbott Apr 27, 2023

Choose a reason for hiding this comment

mcabbott Apr 27, 2023

Choose a reason for hiding this comment

mcabbott Apr 27, 2023

Choose a reason for hiding this comment

mcabbott Apr 27, 2023

Choose a reason for hiding this comment

CarloLucibello May 1, 2023

Choose a reason for hiding this comment

mcabbott May 1, 2023

Choose a reason for hiding this comment

`gpu(::DataLoader)`, take III #2245

`gpu(::DataLoader)`, take III #2245