8318538: Add a way to obtain a strided var handle from a layout

Reviewed-by: jvernee, pminborg
This commit is contained in:
Maurizio Cimadamore 2023-10-20 10:35:09 +00:00
parent b07da3ae15
commit 848ecc1621
6 changed files with 142 additions and 44 deletions

View File

@ -269,19 +269,87 @@ import jdk.internal.foreign.layout.UnionLayoutImpl;
* access modes. All other access modes will result in {@link UnsupportedOperationException} being thrown. Moreover, * access modes. All other access modes will result in {@link UnsupportedOperationException} being thrown. Moreover,
* while supported, access modes {@code get} and {@code set} might lead to word tearing. * while supported, access modes {@code get} and {@code set} might lead to word tearing.
* *
* <h2 id="variable-length">Working with variable-length structs</h2> * <h2 id="variable-length">Working with variable-length arrays</h2>
* *
* Memory layouts allow clients to describe the contents of a region of memory whose size is known <em>statically</em>. * We have seen how sequence layouts are used to describe the contents of an array whose size is known <em>statically</em>.
* There are, however, cases, where the size of a region of memory is only known <em>dynamically</em>, as it depends * There are cases, however, where the array size is only known <em>dynamically</em>. We call such arrays <em>variable-length arrays</em>.
* on the value of one or more struct fields. Consider the following struct declaration in C: * There are two common kinds of variable-length arrays:
* <ul>
* <li>a <em>toplevel</em> variable-length array whose size depends on the value of some unrelated variable, or parameter;</li>
* <li>an variable-length array <em>nested</em> in a struct, whose size depends on the value of some other field in the enclosing struct.</li>
* </ul>
* While variable-length arrays cannot be modelled directly using sequence layouts, clients can still enjoy structured
* access to elements of variable-length arrays using var handles as demonstrated in the following sections.
*
* <h3 id="variable-length-toplevel">Toplevel variable-length arrays</h3>
*
* Consider the following struct declaration in C:
*
* {@snippet lang=c :
* typedef struct {
* int x;
* int y;
* } Point;
* }
*
* In the above code, a point is modelled as two coordinates ({@code x} and {@code y} respectively). Now consider
* the following snippet of C code:
*
* {@snippet lang=c :
* int size = ...
* Point *points = (Point*)malloc(sizeof(Point) * size);
* for (int i = 0 ; i < size ; i++) {
* ... points[i].x ...
* }
* }
*
* Here, we allocate an array of point ({@code points}). Crucially, the size of the array is dynamically bound to the value
* of the {@code size} variable. Inside the loop, the {@code x} coordinate of all the points in the array is accessed.
* <p>
* To model this code in Java, let's start by defining a layout for the {@code Point} struct, as follows:
*
* {@snippet lang=java :
* StructLayout POINT = MemoryLayout.structLayout(
* ValueLayout.JAVA_INT.withName("x"),
* ValueLayout.JAVA_INT.withName("y")
* );
* }
*
* Since we know we need to create and access an array of points, it would be tempting to create a sequence layout modelling
* the variable-length array, and then derive the necessary access var handles from the sequence layout. But this approach
* is problematic, as the size of the variable-length array is not known. Instead, a var handle that provides structured
* access to the elements of a variable-length array can be obtained directly from the layout describing the array elements
* (e.g. the point layout), as demonstrated below:
*
* {@snippet lang=java :
* VarHandle POINT_ARR_X = POINT.arrayElementVarHandle(PathElement.groupElement("x"));
*
* int size = ...
* MemorySegment points = ...
* for (int i = 0 ; i < size ; i++) {
* ... POINT_ARR_X.get(segment, 0L, (long)i) ...
* }
* }
*
* Here, the coordinate {@code x} of subsequent point in the array is accessed using the {@code POINT_ARR_X} var
* handle, which is obtained using the {@link #arrayElementVarHandle(PathElement...)} method. This var handle
* features two {@code long} coordinates: the first is a base offset (set to {@code 0L}), while the
* second is a logical index that can be used to stride over all the elements of the point array.
* <p>
* The base offset coordinate allows clients to express complex access operations, by injecting additional offset
* computation into the var handle (we will see an example of that below). In cases where the base offset is constant
* (as in the previous example) clients can, if desired, drop the base offset parameter and make the access expression
* simpler. This is achieved using the {@link java.lang.invoke.MethodHandles#insertCoordinates(VarHandle, int, Object...)}
* var handle adapter.
*
* <h3 id="variable-length-nested">Nested variable-length arrays</h3>
*
* Consider the following struct declaration in C:
* *
* {@snippet lang=c : * {@snippet lang=c :
* typedef struct { * typedef struct {
* int size; * int size;
* struct { * Point points[];
* int x;
* int y;
* } points[];
* } Polygon; * } Polygon;
* } * }
* *
@ -290,45 +358,37 @@ import jdk.internal.foreign.layout.UnionLayoutImpl;
* the size of the {@code points} array is left <em>unspecified</em> in the C declaration, using a <em>Flexible Array Member</em> * the size of the {@code points} array is left <em>unspecified</em> in the C declaration, using a <em>Flexible Array Member</em>
* (a feature standardized in C99). * (a feature standardized in C99).
* <p> * <p>
* Memory layouts do not support sequence layouts whose size is unknown. As such, it is not possible to model * Again, clients can perform structured access to elements in the nested variable-length array using the
* the above struct directly. That said, clients can still enjoy structured access provided by memory layouts, as * {@link #arrayElementVarHandle(PathElement...)} method, as demonstrated below:
* demonstrated below:
* *
* {@snippet lang=java : * {@snippet lang=java :
* StructLayout POINT = MemoryLayout.structLayout(
* ValueLayout.JAVA_INT.withName("x"),
* ValueLayout.JAVA_INT.withName("y")
* );
*
* StructLayout POLYGON = MemoryLayout.structLayout( * StructLayout POLYGON = MemoryLayout.structLayout(
* ValueLayout.JAVA_INT.withName("size"), * ValueLayout.JAVA_INT.withName("size"),
* MemoryLayout.sequenceLayout(0, POINT).withName("points") * MemoryLayout.sequenceLayout(0, POINT).withName("points")
* ); * );
* *
* VarHandle POLYGON_SIZE = POLYGON.varHandle(0, PathElement.groupElement("size")); * VarHandle POLYGON_SIZE = POLYGON.varHandle(0, PathElement.groupElement("size"));
* VarHandle POINT_X = POINT.varHandle(PathElement.groupElement("x"));
* long POINTS_OFFSET = POLYGON.byteOffset(PathElement.groupElement("points")); * long POINTS_OFFSET = POLYGON.byteOffset(PathElement.groupElement("points"));
* } * }
* *
* Note how we have split the polygon struct in two. The {@code POLYGON} layout contains a sequence layout * The {@code POLYGON} layout contains a sequence layout of size <em>zero</em>. The element layout of the sequence layout
* of size <em>zero</em>. The element layout of the sequence layout is the {@code POINT} layout, which defines * is the {@code POINT} layout, shown previously. The polygon layout is used to obtain a var handle
* the {@code x} and {@code y} coordinates, accordingly. The first layout is used to obtain a var handle * that provides access to the polygon size, as well as an offset ({@code POINTS_OFFSET}) to the start of the
* that provides access to the polygon size; the second layout is used to obtain a var handle that provides * variable-length {@code points} array.
* access to the {@code x} coordinate of a point struct. Finally, an offset to the start of the variable-length
* {@code points} array is also obtained.
* <p> * <p>
* The {@code x} coordinates of all the points in a polygon can then be accessed as follows: * The {@code x} coordinates of all the points in a polygon can then be accessed as follows:
* {@snippet lang=java : * {@snippet lang=java :
* MemorySegment polygon = ... * MemorySegment polygon = ...
* int size = POLYGON_SIZE.get(polygon, 0L); * int size = POLYGON_SIZE.get(polygon, 0L);
* for (int i = 0 ; i < size ; i++) { * for (int i = 0 ; i < size ; i++) {
* int x = POINT_X.get(polygon, POINT.scaleOffset(POINTS_OFFSET, i)); * ... POINT_ARR_X.get(polygon, POINTS_OFFSET, (long)i) ...
* } * }
* } * }
* Here, we first obtain the polygon size, using the {@code POLYGON_SIZE} var handle. Then, in a loop, we read * Here, we first obtain the polygon size, using the {@code POLYGON_SIZE} var handle. Then, in a loop, we read
* the {@code x} coordinates of all the points in the polygon. This is done by providing a custom base offset to * the {@code x} coordinates of all the points in the polygon. This is done by providing a custom offset
* the {@code POINT_X} var handle. The custom offset is computed as {@code POINTS_OFFSET + (i * POINT.byteSize())}, where * (namely, {@code POINTS_OFFSET}) to the offset coordinate of the {@code POINT_ARR_X} var handle. As before,
* {@code i} is the loop induction variable. * the loop induction variable {@code i} is passed as the index of the {@code POINT_ARR_X} var handle,
* to stride over all the elements of the variable-length array.
* *
* @implSpec * @implSpec
* Implementations of this interface are immutable, thread-safe and <a href="{@docRoot}/java.base/java/lang/doc-files/ValueBased.html">value-based</a>. * Implementations of this interface are immutable, thread-safe and <a href="{@docRoot}/java.base/java/lang/doc-files/ValueBased.html">value-based</a>.
@ -544,6 +604,49 @@ public sealed interface MemoryLayout permits SequenceLayout, GroupLayout, Paddin
*/ */
VarHandle varHandle(PathElement... elements); VarHandle varHandle(PathElement... elements);
/**
* Creates a var handle that accesses adjacent elements in a memory segment at offsets selected by the given layout path,
* where the accessed elements have this layout, and where the initial layout in the path is this layout.
* <p>
* The returned var handle has the following characteristics:
* <ul>
* <li>its type is derived from the {@linkplain ValueLayout#carrier() carrier} of the
* selected value layout;</li>
* <li>it has a leading parameter of type {@code MemorySegment} representing the accessed segment</li>
* <li>a following {@code long} parameter, corresponding to the base offset, denoted as {@code B};</li>
* <li>a following {@code long} parameter, corresponding to the array index, denoted as {@code I0}. The array
* index is used to scale the accessed offset by this layout size;</li>
* <li>it has zero or more trailing access coordinates of type {@code long}, one for each
* <a href=#open-path-elements>open path element</a> in the provided layout path, denoted as
* {@code I1, I2, ... In}, respectively. The order of these access coordinates corresponds to the order
* in which the open path elements occur in the provided layout path.
* </ul>
* <p>
* If the provided layout path {@code P} contains no dereference elements, then the offset {@code O} of the access
* operation is computed as follows:
*
* {@snippet lang = "java":
* O = this.offsetHandle(P).invokeExact(this.scale(B, I0), I1, I2, ... In);
* }
* <p>
* More formally, this method can be obtained from the {@link #varHandle(PathElement...)}, as follows:
* {@snippet lang = "java":
* MethodHandles.collectCoordinates(varHandle(elements), 1, scaleHandle())
* }
*
* @apiNote
* As the leading index coordinate {@code I0} is not bound by any sequence layout, it can assume <em>any</em> non-negative
* value - provided that the resulting offset computation does not overflow, or that the computed offset does not fall
* outside the spatial bound of the accessed memory segment. As such, the var handles returned from this method can
* be especially useful when accessing <a href="#variable-length">variable-length arrays</a>.
*
* @param elements the layout path elements.
* @return a var handle that accesses adjacent elements in a memory segment at offsets selected by the given layout path.
* @throws IllegalArgumentException if the layout path is not <a href="#well-formedness">well-formed</a> for this layout.
* @throws IllegalArgumentException if the layout selected by the provided path is not a {@linkplain ValueLayout value layout}.
*/
VarHandle arrayElementVarHandle(PathElement... elements);
/** /**
* Creates a method handle which, given a memory segment, returns a {@linkplain MemorySegment#asSlice(long,long) slice} * Creates a method handle which, given a memory segment, returns a {@linkplain MemorySegment#asSlice(long,long) slice}
* corresponding to the layout selected by the given layout path, where the initial layout in the path is this layout. * corresponding to the layout selected by the given layout path, where the initial layout in the path is this layout.

View File

@ -27,8 +27,6 @@ package java.lang.foreign;
import java.io.UncheckedIOException; import java.io.UncheckedIOException;
import java.lang.foreign.ValueLayout.OfInt; import java.lang.foreign.ValueLayout.OfInt;
import java.lang.invoke.MethodHandle;
import java.lang.invoke.VarHandle;
import java.nio.Buffer; import java.nio.Buffer;
import java.nio.ByteBuffer; import java.nio.ByteBuffer;
import java.nio.ByteOrder; import java.nio.ByteOrder;
@ -46,8 +44,6 @@ import java.util.stream.Stream;
import jdk.internal.foreign.AbstractMemorySegmentImpl; import jdk.internal.foreign.AbstractMemorySegmentImpl;
import jdk.internal.foreign.MemorySessionImpl; import jdk.internal.foreign.MemorySessionImpl;
import jdk.internal.foreign.SegmentFactories; import jdk.internal.foreign.SegmentFactories;
import jdk.internal.foreign.StringSupport;
import jdk.internal.foreign.Utils;
import jdk.internal.javac.Restricted; import jdk.internal.javac.Restricted;
import jdk.internal.reflect.CallerSensitive; import jdk.internal.reflect.CallerSensitive;
import jdk.internal.vm.annotation.ForceInline; import jdk.internal.vm.annotation.ForceInline;
@ -134,21 +130,18 @@ import jdk.internal.vm.annotation.ForceInline;
* int value = (int) intAtOffsetHandle.get(segment, 10L); // segment.get(ValueLayout.JAVA_INT, 10L) * int value = (int) intAtOffsetHandle.get(segment, 10L); // segment.get(ValueLayout.JAVA_INT, 10L)
* } * }
* *
* The var handle returned by {@link ValueLayout#varHandle()} features a <em>base offset</em> parameter. This parameter * Alternatively, a var handle that can be used to access an element of an {@code int} array at a given logical
* allows clients to express complex access operations, by injecting additional offset computation into the var handle.
* For instance, a var handle that can be used to access an element of an {@code int} array at a given logical
* index can be created as follows: * index can be created as follows:
* *
* {@snippet lang=java: * {@snippet lang=java:
* MethodHandle scale = ValueLayout.JAVA_INT.scaleHandle(); // (long, long)long
* VarHandle intAtOffsetAndIndexHandle = * VarHandle intAtOffsetAndIndexHandle =
* MethodHandles.collectCoordinates(intAtOffsetHandle, 1, scale); // (MemorySegment, long, long) * ValueLayout.JAVA_INT.arrayElementVarHandle(); // (MemorySegment, long, long)
* int value = (int) intAtOffsetAndIndexHandle.get(segment, 2L, 3L); // segment.get(ValueLayout.JAVA_INT, 2L + (3L * 4L)) * int value = (int) intAtOffsetAndIndexHandle.get(segment, 2L, 3L); // segment.get(ValueLayout.JAVA_INT, 2L + (3L * 4L))
* } * }
* *
* <p> * <p>
* Clients can also drop the base offset parameter, in order to make the access expression simpler. This can be used to * Clients can also drop the base offset parameter, in order to make the access expression simpler. This can be used to
* implement access operation such as {@link #getAtIndex(OfInt, long)}: * implement access operations such as {@link #getAtIndex(OfInt, long)}:
* *
* {@snippet lang=java: * {@snippet lang=java:
* VarHandle intAtIndexHandle = * VarHandle intAtIndexHandle =

View File

@ -196,6 +196,10 @@ public abstract sealed class AbstractLayout<L extends AbstractLayout<L> & Memory
Set.of(), elements); Set.of(), elements);
} }
public VarHandle arrayElementVarHandle(PathElement... elements) {
return MethodHandles.collectCoordinates(varHandle(elements), 1, scaleHandle());
}
public MethodHandle sliceHandle(PathElement... elements) { public MethodHandle sliceHandle(PathElement... elements) {
return computePathOp(LayoutPath.rootPath((MemoryLayout) this), LayoutPath::sliceHandle, return computePathOp(LayoutPath.rootPath((MemoryLayout) this), LayoutPath::sliceHandle,
Set.of(PathKind.DEREF_ELEMENT), elements); Set.of(PathKind.DEREF_ELEMENT), elements);

View File

@ -82,8 +82,8 @@ public class TestAdaptVarHandles {
} }
} }
static final VarHandle intHandleIndexed = MethodHandles.collectCoordinates(ValueLayout.JAVA_INT.varHandle(), static final VarHandle intHandleIndexed = MethodHandles.insertCoordinates(
1, MethodHandles.insertArguments(ValueLayout.JAVA_INT.scaleHandle(), 0, 0L)); ValueLayout.JAVA_INT.arrayElementVarHandle(), 1, 0L);
static final VarHandle intHandle = MethodHandles.insertCoordinates(ValueLayout.JAVA_INT.varHandle(), 1, 0L); static final VarHandle intHandle = MethodHandles.insertCoordinates(ValueLayout.JAVA_INT.varHandle(), 1, 0L);

View File

@ -292,8 +292,7 @@ public class TestArrayCopy {
} }
private static VarHandle arrayVarHandle(ValueLayout layout) { private static VarHandle arrayVarHandle(ValueLayout layout) {
return MethodHandles.collectCoordinates(layout.varHandle(), return MethodHandles.insertCoordinates(layout.arrayElementVarHandle(), 1, 0L);
1, MethodHandles.insertArguments(layout.scaleHandle(), 0, 0L));
} }
public static MemorySegment truthSegment(MemorySegment srcSeg, CopyHelper<?, ?> helper, int indexShifts, CopyMode mode) { public static MemorySegment truthSegment(MemorySegment srcSeg, CopyHelper<?, ?> helper, int indexShifts, CopyMode mode) {

View File

@ -44,7 +44,6 @@ public class JavaLayouts {
static final VarHandle VH_LONG = arrayVarHandle(JAVA_LONG); static final VarHandle VH_LONG = arrayVarHandle(JAVA_LONG);
private static VarHandle arrayVarHandle(ValueLayout layout) { private static VarHandle arrayVarHandle(ValueLayout layout) {
return MethodHandles.collectCoordinates(layout.varHandle(), return MethodHandles.insertCoordinates(layout.arrayElementVarHandle(), 1, 0L);
1, MethodHandles.insertArguments(layout.scaleHandle(), 0, 0L));
} }
} }