Telegram Web
Раз уж сегодня пятница, вечером в 19:00 МСК проведу стрим с ответами на вопросы. Буду рад ответить всем. Вопросы можно задать в комменты под постом, или прямо на стриме.
👍6
Наверняка у вас накопилось много вопросов.
И вот вам стрим где все можно спросит. Вопросы в чат на ютубе или в под этот пост. Начинаем через 5 минут.

https://www.youtube.com/live/Oy2LpjdmjFg?feature=shared
👍3
Forwarded from viruseg
Написал статью как работать с классом Gradient из под burst. И в процессе малость охренел, от того что моя реализация метода Evaluate оказалось в разы быстрее c++ реализации.
https://habr.com/ru/articles/761572/
👍14😐3
Finally!
There's a new article on .NET object layouts.
In the Part 1 the layouts of:
1. System.Object
2. T[] array
3. string

This article also reveals how to access those types in unsafe or unmanaged environment. How we can change the object type or get array/string length pointer to change it later.

https://meetemq.com/2023/09/27/managed-primitives-part-i/
🔥6👍1
👋 Анонс стрима

Всем привет, давно тут не было новостей. А все потому что я очень много работал, и реализовывал очень интересные вещи.

Работать я более менее закончил, поэтому запускаю стрим с вопросами и ответами про ансейф и все такое, можете заранее писать вопросы в комменты под этой новостью.

Когда? 16.01.2024
18:00 CET (20:00 MSK, 19:00 Kyiv)

Ссылочку выложу ближе к началу
Please open Telegram to view this post
VIEW IN TELEGRAM
🔥12
Через полтора часа начинаем.
Пишите вопросы в комменты под этим или предыдущим постом.

18:00 CET
20:00 Msk
19:00 Kyiv

https://www.youtube.com/watch?v=vTXDPntqs6Y
🔥9👍2
Все знают что произошло.
Тем кто сочувствует — мои соболезнования. Сам только отхожу.
😢19
Ok, so I've managed to call MDI render on DX12 and Vulkan.
Previously I've been working on MDI for DX11 via NvAPI and it worked, tho NVIDIA doesn't have an API for passing an indirect count, but at least CPU count worked fine.

Unity mentioned that they may support MDI in BRG, but they actually don't and they don't even have plans to support it in Unity 6, so I made my little takeover.

There's a working proof of concept.
All material, shader, and render targets (basically a PSO) is set by Unity, so we don't need to go fully native.

Writing native rendering plugins for Unity is hard, mostly due to lack of documentation and even if it exists, there's a big chance it will not work.

😁 You are welcome to ask any questions in the comments, I will answer them in upcoming stream.
Please open Telegram to view this post
VIEW IN TELEGRAM
👍9🔥3
Сегодня сделаю стрим по обзору Unity Native Rendering Plugin
и ответами на вопросы.

Подключайтесь)

19:00 MSK
19:00 Kyiv
18:00 CEST

https://www.youtube.com/watch?v=XcjNVTHRxqI
👍6🔥32
Начинаем через 3 минуты!
Interseting discovery about Unity when using Vulkan.
See, Vulkan have two entry points to resolve functions: vkGetInstanceProcAddr and vkGetDeviceProcAddr.
vkGetInstanceProcAddr returns functions for a given VkInstance.
vkGetDeviceProcAddr on the other hand returns functions for a given VkDevice OR device child, e.g. VkQueue and VkCommandBuffer.
According to the docs, vkGetDeviceProcAddr is preferred when resolving device/-child functions, because returned address induce less overhead (probably because it doesn't need to resolve the child from VkInstance).

So, Unity requests vkGetDeviceProcAddr from Vulkan (or native plugin if hooked with InterceptVulkanInitialization). But actually never using it.
That means that all of the device/-child functions have an overhead to them. E.g. vkCmd* functions, vkQueue* functions etc. Those functions are actually used with insane frequency, in draw calls, uploading constants, binding buffer ranges etc.

The good thing is that we can reroute vkGetInstanceProcAddr to return pointers as vkGetDeviceProcAddr via native plugin. Which can give potential performance increase when events count is high.

How much performance? Well, you never know before you try. I think for 10K calls, say, on Android it could be measurable, like 0.5ms or something, but that's just my speculation.
👍4
Understanding GPU Virtual Addressing and Sparse Images/Buffers

Since the days of DirectX 11, and possibly even earlier, it's been possible to allocate memory on a GPU without actually using physical memory right away. But what does this mean?

Imagine you can create a buffer with a size of 64GB, even if your GPU only has 4GB of actual VRAM. How is this possible?

This works similarly to how virtual addresses work on a CPU. When you ask the operating system for memory, it doesn't immediately use real physical memory (RAM). Instead, it gives you a virtual address. The actual physical memory is only used when you start using that memory.

When you create a sparse buffer on a GPU, it only allocates a mapping table that looks something like this:
Page 0 = Address0
Page 1 = Address1
...


If you try to read this memory before it is backed by real memory, it will return zero because the memory doesn't actually exist yet.

Next, you allocate real, physical memory. This memory is usually aligned in pages (typically 64KB on modern GPUs). For example, let's say we allocate 2 pages, which equals 128KB. Then, we can bind these pages to the virtual address.

You can tell the GPU: "Bind my BufferAddress + 1GB (16384 pages) to the start of my allocated data." The mapping table then updates like this:

Page 16383 = NULL [previous value]
Page 16384 = AllocatedData + 0
Page 16385 = AllocatedData + 65536 Bytes
Page 16386 = NULL [previous value]


After binding the real memory to the virtual address, you can read or write to it in your shaders, compute passes, etc. Essentially, your 64GB buffer only takes up the size of the mapping table plus the 128KB of allocated real memory.
🔥72👍2
This media is not supported in your browser
VIEW IN TELEGRAM
Implemented bindless for Unity. Compute and Fragment shaders support for now. No weird trickery with compiling shaders manually or anything else. Just normal Unity shaders and native plugin.
👍15
Unity LockBufferForWrite: When You Should Prefer Them and Why Your Choice Matters

Написал тут небольшой обобщительный пост, когда юзать и когда не юзать LockBufferForWrite, и чем они отличаются от обычных GraphicsBuffer.

https://meetemq.com/2025/01/26/lockbufferforwrite-vs-other-buffer-types/
🔥63👍1👏1💅1
Compiling Shaders on Device with Fully Dynamic Shader

The idea is that in Unity you can load a shader asset via an asset bundle. In this asset, there is either compiled bytecode (for Metal or Vulkan, as well as DX11 DXBC / DX12 DXIL) or text.
The binary format of the asset bundle is known — there are plenty of open-source rippers on GitHub.
The binary format of the shader is also known.

This leaves only compiling the shader. The simplest case is when you have an Android device, your code is in GLSL, and you only need OpenGL ES. In that case, simply write the text into the shader asset.

[At first you will need to add the available shader variants in the Shader asset as they are always stored, and this will need to be done anyway.]

It's more complicated when you have GLSL and Vulkan:
You then need to compile the SPIR-V Cross compiler for Android. It's written in C++, so there shouldn't be any issues.
If you prefer HLSL — feel free to port DXC to Android. That shouldn't be too hard either.

The resulting output is also written into the shader inside the asset bundle.

Then load the asset bundle into Unity.

??????

PROFIT!
🔥5
This media is not supported in your browser
VIEW IN TELEGRAM
A long time ago I've made a proof of concept for Bindless textures in Unity.
Now it's open sourced and available for public use!

Bindless resources are the core of any GPU Driven Rendering pipeline (along with MDI).
MDI plugin can follow if there would be requests.

https://github.com/Meetem/DX12BindlessUnity
🔥16
2025/10/24 10:06:30
Back to Top
HTML Embed Code: