关于gpu:WebGPU-规范篇-01-规范基础

62次阅读

共计 7086 个字符，预计需要花费 18 分钟才能阅读完成。

系列博客总目录：https://segmentfault.com/a/1190000040716735

WebGPU 基本概念节选翻译，选自 WebGPU Spec 3 Fundamentals，本文列举一部分简略的内容，其余内容当前独自拉出去和其余材料一起谈。

节选自 WebGPU Spec 3.1 Conventions

WebGPU 接口是外部对象的公开拜访模式，和大多数语言的接口语法提供的性能差不多。

这里只需注意，继承了 GPUObjectBase 的接口都是 WebGPU 接口。

interface mixin GPUObjectBase {attribute USVString? label;};

label 字段用来形容对象本身，可空。

对象形容者蕴含创立一个对象所需的信息，通常调用 GPUDevice 的 createXXX 办法实现。

dictionary GPUObjectDescriptorBase {USVString label;};

此处的 label 和 GPUObjectBase 的 label 含意差不多。

节选自 WebGPU Spec 3.3 Coordinate Systems

在 NDC 中，Y 轴朝上。NDC 的点 (-1.0, -1.0) 位于 NDC 的左下角。NDC 的 X 和 Y 的最大最小值为 1 和 -1，Z 的取值范畴是 [0, 1]。NDC 中超出此范畴的点会被剪裁。
在 Framebuffer、视口坐标系和 Fragment/ 像素坐标系中 Y 轴朝下。原点 (0, 0) 位于这几个坐标系的左上角。
视窗坐标和帧缓存（Framebuffer）坐标相匹配。
纹理坐标的原点 (0, 0) 示意纹理数据的第一个纹素（texel）。

WebGPU 的坐标系和 DirectX 坐标系匹配。

节选自 WebGPU Spec 3.5 Core Internal Objects

适配器对象示意 WebGPU 在具体零碎中古代图形接口的实现。如果适配器对象变得不可用了，那就会被标记为有效，并且永远不会再变无效，并且，连带着由它创立的设施等对象一并变得有效。

适配器对象能够简略了解为图形处理器，既能够是真的物理显卡，也能够是由模仿进去的的软显卡。不同的适配器对象容许指向同一个显卡，譬如适配器 A 和适配器 B，只是在申请参数上略有不同。

适配器对象在 WebGPU 中由 GPUAdapter 接口实现。

对于适配器对象的申请与创立过程，参考初始化局部的内容。

设施是适配器的逻辑性能上的实现，所有的 WebGPU 子对象均由设施对象创立，它能够在专用型 WebWorker 中应用。

如果设施对象生效了，那么由设施对象创立的子对象也会变得不可用。

设施对象在 WebGPU 中由 GPUDevice 接口实现。

对于设施对象的申请与创立过程，参考初始化局部的内容。

节选自 WebGPU Spec 3.6 Optional Capabilities

性能，指的是一组可选的 WebGPU 性能，并不是所有的平台都反对，通常是操作系统或者显卡硬件不反对。

每个 GPUAdapter 都会带有一个 features 对象，在申请设施对象时只能用这个对象内的性能，也只有在申请设施对象时，能力设置这个行将被创立的设施对象能有哪些性能。

无关 WebGPU 性能的具体形容，参考 WebGPU Spec 23. Feature Index 后列举如下：

译者注

有点蛋疼的是，你间接拜访 adapter.features 它只是个相似 Set 的对象，而不是一般的对象能看到外面有什么反对的性能，要你本人去拜访 values …

启用这个性能时，GPUPrimitiveState 接口的 clampDepth 属性能力被设置：

 const renderPipeline = device.createRenderPipeline({
  /*...*/,
  primitive: {
    /* ... */,
    clampDepth: true
  }
})

启用这个性能时，能力创立 “depth24unorm-stencil8” 格局的纹理对象：

 const texture = device.createTexture({
  /* ... */,
  format: "depth24unorm-stencil8"
})

启用这个性能时，能力创立 “depth32float-stencil8” 格局的纹理对象：

 const texture = device.createTexture({
  /* ... */,
  format: "depth32float-stencil8"
})

启用这个性能时，枚举类型 GPUQueryType 能力用。

启用这个性能时，能力创立 BC 格局的纹理：

 const texture = device.createTexture({
  /* ... */,
  format: "bc1-rgba-unorm" / "bc1-rgba-unorm-srgb" / "bc2-rgba-unorm" / "bc2-rgba-unorm-srgb" / "bc3-rgba-unorm" / "bc3-rgba-unorm-srgb" / "bc4-r-unorm" / "bc4-r-snorm" / "bc5-rg-unorm" / "bc5-rg-snorm" / "bc6h-rgb-ufloat" / "bc6h-rgb-float" / "bc7-rgba-unorm" / "bc7-rgba-unorm-srgb"
})

启用这个性能后，枚举类型 GPUQueryType 中的 "timestamp" 枚举能力应用。

每个限度属性的值类型都是数字。每个限度对象（适配器对象和设施对象的一个属性）上的限度属性个数都是一样的，区别就是他们的值有可能不太一样。

设施对象的限度优先级高于适配器的，因为大多时候用的是设施对象，而且设施对象是由适配器对象申请的，适配器对象申请设施对象的时候能够传递对设施对象的限度需要。

留神，每条限度要与硬件适配，不要在挪动设施上把限度放得很宽，否则容易影响性能。正当的限度须要因具体硬件而定。

每条限度都有一个默认值，如果适配器在申请设施时，没有传递限度参数 requiredLimits，那么就会应用默认值。

当初，对限度值进行逐条解释。

限度名	类型	倡议值	默认值	解释
maxTextureDimension1D	unsigned long	可设更高	8192	创立一维纹理对象时，`size` 属性最大允许值
maxTextureDimension2D	unsigned long	可设更高	8192	创立二维纹理对象时，`size` 属性最大允许值
maxTextureDimension3D	unsigned long	可设更高	8192	创立三维纹理对象时，`size` 属性最大允许值
maxTextureArrayLayers	unsigned long	可设更高	256	`GPUExtent3DDict` 对象的 `depthOrArrayLayers` 属性的最大值
maxBindGroups	unsigned long	可设更高	4	创立 `GPUPipelineLayout` 时，绑定组的布局对象数量（也即绑定组的最大个数）
maxDynamicUniformBuffersPerPipelineLayout	unsigned long	可设更高	8	创立 `GPUPipelineLayout` 时，每个管线的布局对象中容许设置带有动静偏移值的 UBO 的最大数量
maxDynamicStorageBuffersPerPipelineLayout	unsigned long	可设更高	4	创立 `GPUPipelineLayout` 时，每个管线的布局对象中容许设置带有动静偏移值的存储型 GPUBuffer 的最大数量
maxSampledTexturesPerShaderStage	unsigned long	可设更高	16	（在其可能用到的着色器阶段中）绑定组的布局对象中容许设置采样纹理的最大个数
maxStorageBuffersPerShaderStage	unsigned long	可设更高	8	（在其可能用到的着色器阶段中）绑定组的布局对象中容许设置的存储型 GPUBuffer 的最大个数
maxStorageTexturesPerShaderStage	unsigned long	可设更高	4	（在其可能用到的着色器阶段中）绑定组的布局对象中容许设置的存储型纹理的最大个数
maxUniformBuffersPerShaderStage	unsigned long	可设更高	12	（在其可能用到的着色器阶段中）绑定组的布局对象中容许设置的 UBO 的最大个数
maxUniformBufferBindingSize	unsigned longlong	可设更高	16384	绑定组布局对象的 entry 中 UBO 的 `size` 属性的最大值
maxStorageBufferBindingSize	unsigned longlong	可设更高	13427728(128MB)	绑定组布局对象的 entry 中存储型或只读存储型 (`storage` 或 `read-only-storage`) GPUBuffer 的 `size` 属性的最大值
minUniformBufferOffsetAlignment	unsigned long	尽量设小	256	绑定组布局对象的 entry 中 UBO 的 offset 值，以及 `setBindGroup` 办法的动静 offset 值，这是一个对齐值
minStorageBufferOffsetAlignment	unsigned long	尽量设小	256	绑定组布局对象的 entry 中存储型或只读存储型 (`storage` 或 `read-only-storage`) GPUBuffer 的 offset 值，以及 `setBindGroup` 办法的动静 offset 值，这是一个对齐值
maxVertexBuffers	unsigned long	可设更高	8	渲染管线能创立最大 VBO 的个数
maxVertexAttributes	unsigned long	可设更高	16	渲染管线中 VBO 的最大顶点属性个数
maxVertexBufferArrayStride	unsigned long	可设更高	2048	渲染管线中 VBO 的最大字节偏移量
maxInterStageShaderComponents	unsigned long	可设更高	60	不同渲染阶段通信时，在通信中传递的变量中容许的最大组件个数（例如顶点阶段到片元阶段）
maxComputeWorkgroupStorageSize	unsigned long	可设更高	16352	计算着色器入口函数中工作组容许应用的最大空间（单位：字节）
maxComputeInvocationsPerWorkgroup	unsigned long	可设更高	256	计算着色器入口函数中 workgroup_size 乘积的最大值
maxComputeWorkgroupSizeX	unsigned long	可设更高	256	计算着色器入口函数中 workgroup_size 中 X 维度的最大值
maxComputeWorkgroupSizeY	unsigned long	可设更高	256	计算着色器入口函数中 workgroup_size 中 Y 维度的最大值
maxComputeWorkgroupSizeZ	unsigned long	可设更高	64	计算着色器入口函数中 workgroup_size 中 Z 维度的最大值
maxComputeWorkgroupsPerDimension	unsigned long	可设更高	65535	计算通道编码器的 `dispatch(x, y, z)` 函数参数中的最大值

GPUSupportedLimits 接口裸露了适配器或者设施对象的最大限度，间接拜访 device.limits 或 adapter.limits 即可，是一个对象。

 [Exposed=(Window, DedicatedWorker)]
interface GPUSupportedLimits {
  readonly attribute unsigned long maxTextureDimension1D;
  readonly attribute unsigned long maxTextureDimension2D;
  readonly attribute unsigned long maxTextureDimension3D;
  readonly attribute unsigned long maxTextureArrayLayers;
  readonly attribute unsigned long maxBindGroups;
  readonly attribute unsigned long maxDynamicUniformBuffersPerPipelineLayout;
  readonly attribute unsigned long maxDynamicStorageBuffersPerPipelineLayout;
  readonly attribute unsigned long maxSampledTexturesPerShaderStage;
  readonly attribute unsigned long maxSamplersPerShaderStage;
  readonly attribute unsigned long maxStorageBuffersPerShaderStage;
  readonly attribute unsigned long maxStorageTexturesPerShaderStage;
  readonly attribute unsigned long maxUniformBuffersPerShaderStage;
  readonly attribute unsigned long long maxUniformBufferBindingSize;
  readonly attribute unsigned long long maxStorageBufferBindingSize;
  readonly attribute unsigned long minUniformBufferOffsetAlignment;
  readonly attribute unsigned long minStorageBufferOffsetAlignment;
  readonly attribute unsigned long maxVertexBuffers;
  readonly attribute unsigned long maxVertexAttributes;
  readonly attribute unsigned long maxVertexBufferArrayStride;
  readonly attribute unsigned long maxInterStageShaderComponents;
  readonly attribute unsigned long maxComputeWorkgroupStorageSize;
  readonly attribute unsigned long maxComputeInvocationsPerWorkgroup;
  readonly attribute unsigned long maxComputeWorkgroupSizeX;
  readonly attribute unsigned long maxComputeWorkgroupSizeY;
  readonly attribute unsigned long maxComputeWorkgroupSizeZ;
  readonly attribute unsigned long maxComputeWorkgroupsPerDimension;
};

每项的解释见上文。

GPUSupportedFeatures 即 device.features 和 adapter.features 对象所实现的接口。

这个对象不是一般的 JavaScript 对象，而是相似 ES6 中 Set 一样的汇合，你能够应用 has 办法来查看有什么性能。它只能增加 GPUFeatureName（定义在 WebGPU Spec 4.4.1.1 GPUFeatureName）枚举中的值。

 [Exposed=(Window, DedicatedWorker)]
interface GPUSupportedFeatures {readonly setlike<DOMString>;};
 
enum GPUFeatureName {
  "depth-clamping",
  "depth24unorm-stencil8",
  "depth32float-stencil8",
  "pipeline-statistics-query",
  "texture-compression-bc",
  "timestamp-query",
};

留神，GPUFeatureName 会缓缓增加，然而浏览器实现不肯定就跟得上进度，这时候能够用判断优雅地解决：

 if (adapter.features.has('unknown-feature')) {// Use unknown-feature} else {console.warn('unknown-feature is not supported by this adapter.');
}

正文完

gpu

发表至： gpu

2021-09-22

0

关于gpu:WebGPU-的类型约定之WebIDL

关于gpu:优优工场云原生的GPU虚拟化AI平台产品-XPU-通过亚马逊-AWS-Marketplace对外发布

关于gpu:如何评估计算机的算力从-CPU-到-GPU-的演变

关于gpu:基于阿里云GPU云服务器的AIACC助力UC搜索业务性能提效380每年节省数千万成本

关于gpu:WebGPU-的类型约定之WebIDL

关于gpu:WebGPU-规范篇-01-规范基础

1 公共约定

① WebGPU 接口

② 对象形容者

2 坐标零碎

3 外围对象

Adapter（适配器）

Devices（设施）

4 WebGPU 的性能列表及其最大限度

4.2 性能

① depth-clamping

② depth24unorm-stencil8

③ depth32float-stencil8

④ pipeline-statistics-query

⑤ texture-compression-bc

⑥ timestamp-query

4.1 限度

4.1.1 GPUSupportedLimits

4.1.2 GPUSupportedFeatures

Just My Socks（注册教程内含优惠码）

	const renderPipeline = device.createRenderPipeline({
	/.../,
	primitive: {
	/* ... */,
	clampDepth: true
	}
	})

	const texture = device.createTexture({
	/* ... */,
	format: "depth24unorm-stencil8"
	})

	const texture = device.createTexture({
	/* ... */,
	format: "depth32float-stencil8"
	})

	const texture = device.createTexture({
	/* ... */,
	format: "bc1-rgba-unorm" / "bc1-rgba-unorm-srgb" / "bc2-rgba-unorm" / "bc2-rgba-unorm-srgb" / "bc3-rgba-unorm" / "bc3-rgba-unorm-srgb" / "bc4-r-unorm" / "bc4-r-snorm" / "bc5-rg-unorm" / "bc5-rg-snorm" / "bc6h-rgb-ufloat" / "bc6h-rgb-float" / "bc7-rgba-unorm" / "bc7-rgba-unorm-srgb"
	})

	[Exposed=(Window, DedicatedWorker)]
	interface GPUSupportedLimits {
	readonly attribute unsigned long maxTextureDimension1D;
	readonly attribute unsigned long maxTextureDimension2D;
	readonly attribute unsigned long maxTextureDimension3D;
	readonly attribute unsigned long maxTextureArrayLayers;
	readonly attribute unsigned long maxBindGroups;
	readonly attribute unsigned long maxDynamicUniformBuffersPerPipelineLayout;
	readonly attribute unsigned long maxDynamicStorageBuffersPerPipelineLayout;
	readonly attribute unsigned long maxSampledTexturesPerShaderStage;
	readonly attribute unsigned long maxSamplersPerShaderStage;
	readonly attribute unsigned long maxStorageBuffersPerShaderStage;
	readonly attribute unsigned long maxStorageTexturesPerShaderStage;
	readonly attribute unsigned long maxUniformBuffersPerShaderStage;
	readonly attribute unsigned long long maxUniformBufferBindingSize;
	readonly attribute unsigned long long maxStorageBufferBindingSize;
	readonly attribute unsigned long minUniformBufferOffsetAlignment;
	readonly attribute unsigned long minStorageBufferOffsetAlignment;
	readonly attribute unsigned long maxVertexBuffers;
	readonly attribute unsigned long maxVertexAttributes;
	readonly attribute unsigned long maxVertexBufferArrayStride;
	readonly attribute unsigned long maxInterStageShaderComponents;
	readonly attribute unsigned long maxComputeWorkgroupStorageSize;
	readonly attribute unsigned long maxComputeInvocationsPerWorkgroup;
	readonly attribute unsigned long maxComputeWorkgroupSizeX;
	readonly attribute unsigned long maxComputeWorkgroupSizeY;
	readonly attribute unsigned long maxComputeWorkgroupSizeZ;
	readonly attribute unsigned long maxComputeWorkgroupsPerDimension;
	};

	[Exposed=(Window, DedicatedWorker)]
	interface GPUSupportedFeatures {readonly setlike<DOMString>;};

	enum GPUFeatureName {
	"depth-clamping",
	"depth24unorm-stencil8",
	"depth32float-stencil8",
	"pipeline-statistics-query",
	"texture-compression-bc",
	"timestamp-query",
	};

	if (adapter.features.has('unknown-feature')) {// Use unknown-feature} else {console.warn('unknown-feature is not supported by this adapter.');
	}

关于gpu:WebGPU-规范篇-01-规范基础

1 公共约定

① WebGPU 接口

② 对象形容者

2 坐标零碎

3 外围对象

Adapter（适配器）

Devices（设施）

4 WebGPU 的性能列表及其最大限度

4.2 性能

① depth-clamping

② depth24unorm-stencil8

③ depth32float-stencil8

④ pipeline-statistics-query

⑤ texture-compression-bc

⑥ timestamp-query

4.1 限度

4.1.1 GPUSupportedLimits

4.1.2 GPUSupportedFeatures

Just My Socks（注册教程 内含优惠码）

Just My Socks（注册教程内含优惠码）