I need to insert multiple 32x32 arrays into 1024x1024 array at predetermined random locations. How to do this efficiently?

Question

Kevin Hughes on 28 May 2016

1
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/286258-i-need-to-insert-multiple-32x32-arrays-into-1024x1024-array-at-predetermined-random-locations-how-t

Commented: Joss Knight on 9 Jun 2016

I need to insert many (10^6) 32x32 arrays into single 1024x1024 array and sum them. Insertion location is random, but predetermined. Simple for loops result in long execution time. How can I optimize? I do have parallel processing and gpu available.

Nx=ceil(rand(1e6,1)*(1024-32)); %avoiding partial arrays at edges
Ny=ceil(rand(1e6,1)*(1024-32));
smallArray=ones(32);
largeArray=zeros(1024);
for ii=1:1e6
    largeArray(Nx(ii):Nx(ii)+31,Ny(ii):Ny(ii)+31)=largeArray(Nx(ii):Nx(ii)+31,Ny(ii):Ny(ii)+31)+smallArray;
end

2 Comments
Show NoneHide None

Image Analyst on 29 May 2016

How long?

Ahmet Cecen on 29 May 2016

On the other hand, I am not entirely sure why this would take any longer than 2 seconds. Is that slow?

Sign in to comment.

Sign in to answer this question.

Answer 1

Joss Knight on 8 Jun 2016

1
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/286258-i-need-to-insert-multiple-32x32-arrays-into-1024x1024-array-at-predetermined-random-locations-how-t#answer_224822

Open in MATLAB Online

Use accumarray (and randi):

N = 1024;
Nx = randi([1 (N-32)], 1e5, 1);
Ny = randi([1 (N-32)], 1e5, 1);
linIndStart = sub2ind([N N], Nx, Ny);
smallArray = gpuArray.ones(32);
[I, J] = ndgrid(1:32);
linOffsets = sub2ind([N N], I(:), J(:)) - 1;
linInd = bsxfun(@plus, linIndStart, linOffsets');
values = repmat(smallArray(:)', numel(Nx), 1); % Replace with actual vals
largeArray = accumarray(linInd(:), values(:), [N*N 1]);
largeArray = reshape(largeArray, N, N);

If your submatrices are genuinely all just ones, then just replace the values(:) with 1.

2 Comments
Show NoneHide None

Kevin Hughes on 8 Jun 2016

Edited: Kevin Hughes on 8 Jun 2016

Open in MATLAB Online

Joss, Thanks. Actually got slightly better (8.3 vs 9.7) results with:

Nx=1024;
Np=100000;
Ns=32;
smallArray=gpuArray.ones(Ns)+1i*gpuArray.ones(Ns);
smallArrayr=reshape(smallArray,1,Ns^2);
smallArrayrr=gpuArray.ones(Np,1)*smallArrayr;
% [x(:) y(:)] will be a list of the Ns x Ns offset coordinates for
% each box element
[x, y] = meshgrid(1:Ns);
x=gpuArray(x);
y=gpuArray(y);
%random insertion locations
insertX=gpuArray(round(ceil(rand(Np,1)*(Nx-Ns))));
insertY=gpuArray(round(ceil(rand(Np,1)*(Nx-Ns))));
nx=gpuArray(insertX);
ny=gpuArray(insertY);
% Add every offset to every coordinate. The result is 256x16, that
% is one row per box, with 16 elements in each box. Now x and y
% identify every element of every box.
xx = bsxfun(@plus, nx(:), x(:)');
yy = bsxfun(@plus, ny(:), y(:)');
% Now we have arrays that identify what to add together and where
% to put the results. accumarray can do the accumulation based
% on the list of coordinates [x(:) y(:)] and the data going into
% each element bmatrixx(:) and bmatrixy(:).
largeArray = accumarray([xx(:) yy(:)], smallArrayrr(:), [Nx Nx], @sum);

Code found on this site at related link. (Actually your answer to someone else with similar issue) http://www.mathworks.com/matlabcentral/answers/264457-why-for-gpu-is-slower-than-cpu-for-this-code-is-it-because-of-sparsity-or-because-of-for-loop Sets up smallArrays a little differently, but uses accumarray.

Joss Knight on 9 Jun 2016

Oh well done - I knew I'd answered basically the same question before but I couldn't unearth the old answer!

Sign in to comment.

Answer 2

Ahmet Cecen on 29 May 2016

1
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/286258-i-need-to-insert-multiple-32x32-arrays-into-1024x1024-array-at-predetermined-random-locations-how-t#answer_223747

If your small array will always be the same, you might be able to do this lightning fast with a convolution. Instead of using the random ok indices as corners, you would use them as centers. Do something like:

LargeArray(randindexvector)=1;

You can separately find repetitions using unique and add them if you happen to get the same center twice. Much smaller for loop.

Then you convolve with a 16x16 ones matrix.

There might be some rough edges in this approach that needs some smoothing, but if you can get it to work, it would be milliseconds.

2 Comments
Show NoneHide None

Kevin Hughes on 29 May 2016

Thanks. Small array is actually complex. As written I get about 20 seconds execution time. But 10^6 could be order of magnitude low and I'll need to repeat with variations on insertion locations a few thousand times which puts total at 160 hours...Not sure assumption of constant small array holds or convolution would be excellent way to go.

The first related question links to a vectorized answer that also shows promise.

Ahmet Cecen on 29 May 2016

Open in MATLAB Online

You can cascade on stages of uniqueness and vectorize the fill. Say you created 1000 random values, and say 100 of them are duplicates. You first work on the 900, then 100, then any duplicates within the 100 etc. You do something like:

 for i = 1:BiggestNumberofRepetations
    WORK HERE
 end

Hopefully this is an orders of magnitude smaller iteration. This allows you to iterate over the dimensions of the smaller matrix instead of the number of random numbers. I will burst the memory here but you can work around it. It should look something like:

 RandInd = sub2ind([1024 1024],Nx,Ny);
 for i = 1:32*32
     largeArray(RandInd+floor((i-1)/32)*1024)+mod(i,32)) = largeArray(RandInd+floor((i-1)/32)*1024)+mod(i,32)) + smallArray(i);
end

Now there is some nasty indexing there so double check that. Now your iteration count is MaximumRepetationsCount*numel(smallArray), which in most cases should be still orders of magnitude smaller.

Sign in to comment.

I need to insert multiple 32x32 arrays into 1024x1024 array at predetermined random locations. How to do this efficiently?

2 Comments
Show NoneHide None

Accepted Answer

2 Comments
Show NoneHide None

More Answers (1)

2 Comments
Show NoneHide None

See Also

Categories

Tags

Community Treasure Hunt

I need to insert multiple 32x32 arrays into 1024x1024 array at predetermined random locations. How to do this efficiently?

2 Comments Show NoneHide None

Accepted Answer

2 Comments Show NoneHide None

More Answers (1)

2 Comments Show NoneHide None

See Also

Categories

Tags

Community Treasure Hunt

2 Comments
Show NoneHide None

2 Comments
Show NoneHide None

2 Comments
Show NoneHide None