Optimize vectorized sorting - reduce code size, improve speed for large heaps by PeterSolMS · Pull Request #40613 · dotnet/runtime

PeterSolMS · 2020-08-10T13:36:38Z

There are two optimizations in this PR:

reduction of code size in the bitonic sorters: by limiting the amount of inlining in this code, we can reduce overall code size in coreclr.dll by about 180 kB.
dynamic packing: during sorting, we can switch to 32-bit sorting as soon as the address range in a partition is less 32 GB. This will only have an impact on large heaps or machines with many processors, because we already have a similar, but static optimization where we use 32-bit sorting if the overall address range in the ephemeral region is less than 32 GB. So this additional optimization will give improvements if the overall address range is greater than 32 GB initially, but becomes less during the sort. In this case, we get about a 1.6x improvement in sorting speed.

…npacking.

ghost · 2020-08-10T13:36:43Z

Tagging subscribers to this area: @dotnet/gc
See info in area-owners.md if you want to be subscribed.

Maoni0 · 2020-08-12T02:41:05Z

 #include <immintrin.h>
 //#include <stdexcept>
+//#include <limits>
 #include <assert.h>


any reason to keep these 2 lines?

- remove commented out #include lines - remove unreferenced template specializations for float, double, uint32_t and uint64_t Mitigated trap with our implementation of numeric_limits - make it so we get a compile error when an instantiation of numeric_limits is referenced that is not specialized.

PeterSolMS added 3 commits July 23, 2020 10:25

Improved vectorized sort - smaller bitonic sorters, dynamic packing/u…

2eed66b

…npacking.

Merge branch 'master' into vxsort-opt

ceb47d8

Merge branch 'master' into vxsort-opt

bca389d

PeterSolMS requested a review from Maoni0 August 10, 2020 13:36

Dotnet-GitSync-Bot added the area-GC-coreclr label Aug 10, 2020

Maoni0 reviewed Aug 12, 2020

View reviewed changes

Maoni0 approved these changes Aug 13, 2020

View reviewed changes

PeterSolMS merged commit 2fd135f into dotnet:master Aug 13, 2020

mangod9 mentioned this pull request Aug 14, 2020

coreclr.dll regressed by ~300kB due to vxsort #39600

Closed

karelz added this to the 5.0.0 milestone Aug 18, 2020

ghost locked as resolved and limited conversation to collaborators Dec 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize vectorized sorting - reduce code size, improve speed for large heaps#40613

Optimize vectorized sorting - reduce code size, improve speed for large heaps#40613
PeterSolMS merged 4 commits intodotnet:masterfrom
PeterSolMS:vxsort-opt

PeterSolMS commented Aug 10, 2020

Uh oh!

ghost commented Aug 10, 2020

Uh oh!

Maoni0 Aug 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

PeterSolMS commented Aug 10, 2020

Uh oh!

ghost commented Aug 10, 2020

Uh oh!

Maoni0 Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants