skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2024-12-27 09:31:52 +01:00

Author	SHA1	Message	Date
Billy Laws	7e088ca465	Fix constbuf updates to actually increment the write offset Uses the register directly now as when we modify it we want the changes to be visible from macros too.	2022-04-14 14:14:52 +05:30
PixelyIon	d2f3479610	Use `eB5G6R5UnormPack16` VkFormat for `B5G6R5Unorm` and `R5G6B5Unorm` Using `eB5G6R5UnormPack16` (with a swizzle for `R5G6B5Unorm`) removes the need for `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` when those formats are aliased which happens in Sonic Mania among other titles.	2022-04-14 14:14:52 +05:30
PixelyIon	24d7066d8b	Add quirk to avoid `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` on Adreno GPUs Adreno GPUs have significant performance penalties from usage of `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` which require disabling UBWC and on Turnip, forces linear tiling. As a result, it's been made an optional quirk which doesn't supply the flag in `VkImageCreateInfo` and logs a warning if a view with a different Vulkan format from the original image is created.	2022-04-14 14:14:52 +05:30
PixelyIon	731d06010d	Set `eMutableFormat` in Texture Image Creation We often need to alias the underlying data as multiple Vulkan formats which requires the `eMutableFormat` bit to be set in `VkImageCreateInfo`, without doing this there'll be validation layer errors and potentially GPU bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	dafcfa68ca	Transition texture layout to `eGeneral` after creation As we no longer set the layout to general inside the Texture constructor, yet, we need it to be set prior to the image being used as an attachment. We need to transition the layout to `eGeneral` after creation of the texture object.	2022-04-14 14:14:52 +05:30
MK73DS	647cb07dc8	Stub functions in IAccountServiceForApplication: - GetUserCount - InitializeApplicationInfo - IsUserAccountSwitchLocked	2022-04-14 14:14:52 +05:30
PixelyIon	730bf504f8	Correct Adreno texture binding quirk We incorrectly determined an Adreno driver bug to require padding between binding slots but the real issue was not supporting consecutive binding writes for `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` and was fixed by the padding slot unintentionally requiring individual writes. The quirk has now been corrected to explicitly specify this as the bug and the solution is more apt.	2022-04-14 14:14:52 +05:30
PixelyIon	da8cb48933	Fix Interval Map `GetAlignedRecursiveRange` lookup bug Any lookups done using `GetAlignedRecursiveRange` incorrectly added intervals in the exclusive interval entry lookups as the condition for adding them was the reverse of what it should've been due to a last minute refactor, it led to graphical glitches and crashes. This has been fixed and the lookups should return the correct results.	2022-04-14 14:14:52 +05:30
PixelyIon	f2faa74707	Fix crashes due to `SEGV_ACCERR` check On certain devices, accesses to a protected memory region can return `si_code` as non-`SEGV_ACCERR` values, this leads to a crash as we only pass access violations to the trap handler and would lead to not doing so on those devices which would then result in going to the crash handler.	2022-04-14 14:14:52 +05:30
PixelyIon	77e2797219	Delete expired `weak_ptr`s for Texture/Buffer views A large amount of Texture/Buffer views would expire before reuse could occur in `Texture::GetView`/`Buffer::GetView`. These can lead to a substantial memory allocation given enough time and they are now deleted during the lookup while iterating on all entries. It should be noted that there are a lot of duplicate views that don't live long enough to be reused and the ultimate solution here is to make those views live long enough to be reused.	2022-04-14 14:14:52 +05:30
PixelyIon	881bb969c4	Implement access-driven Buffer synchronization Similar to constant redundant synchronization for textures, there is a lot of redundant synchronization of buffers. Albeit, buffer synchronization is far cheaper than texture synchronization it still has associated costs which have now been reduced by only synchronizing on access.	2022-04-14 14:14:52 +05:30
PixelyIon	7532eaf050	Attach Texture to Cycle in `Texture::TransitionLayout` Not doing so could result in the texture being destroyed before the completion of a transition and lead to undefined behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	3268b3779a	Implement access-driven Texture synchronization There was a lot of redundant synchronization of textures to and from host constantly as we were not aware of guest memory access, this has now been averted by tracking any memory accesses to the texture memory using the NCE Memory Trapping API and synchronizing only when required.	2022-04-14 14:14:52 +05:30
PixelyIon	3e33d49faf	Implement NCE Memory Trapping API An API for trapping accesses to guest memory and performing callbacks based on those accesses alongside managing protection of the memory. This is a fundamental building block for avoiding redundant synchronization of resources from the guest and host. Note: All accesses are treated as write accesses at the moment, support for picking up read accesses will be implemented later	2022-04-14 14:14:52 +05:30
PixelyIon	08510d75b0	Implement Interval Map An interval map is a crucial piece of infrastructure required for memory faulting to track any regions that have an associated callback and their protection. Additionally, efficient page-aligned lookups with semantics optimal for memory faulting are also a requirement and the ability to associate multiple regions with a single callback/protection entry rather than doing so on a per-region basis as we deal with split-mapping resources.	2022-04-14 14:14:52 +05:30
PixelyIon	5c9e42e384	Use mirror mappings for Textures and Buffers This is a prerequisite to memory trapping as we need to write to the mirror to avoid a race condition with external threads writing to a texture/buffer while we do so ourselves for the sync on a read/write, it also avoids an additional `mprotect` to `-WX`/`RWX` on a read access. An additional advantage for textures especially is that we now support split-mapping textures due to laying them out in a contiguous mirror and they will not require costly algorithmic changes. Buffers should also benefit from not needing to iterate over every region when they are split into multiple mappings.	2022-04-14 14:14:52 +05:30
PixelyIon	577a67babd	Support mirrors of multiple non-contiguous memory regions `CreateMirror` is limited to creating a mirror of a single contiguous region which does not work when creating a contiguous mirror of multiple non-contiguous regions. To support this functionality, `CreateMirrors` which expects a list of page-aligned regions and maps them into a contiguous mirror.	2022-04-14 14:14:52 +05:30
PixelyIon	e35ab6d1e0	Move to mapping guest AS as shared memory We want to create arbitrary mirrors in the guest address space and to make this possible, we map the entire address space as a shared memory file. A mirror is mapped by using `mmap` with the offset into the guest address space.	2022-04-14 14:14:52 +05:30
Billy Laws	a5dd961f01	Add support for batched method sending Important for constbuf updates which would be very slow if done one at a time.	2022-04-14 14:14:52 +05:30
Robin Kertels	43879e2476	Round up when calculating size of compressed texture in bytes	2022-04-14 14:14:52 +05:30
Robin Kertels	d889550e84	Don't set COLOR_ATTACHMENT_BIT for compressed formats. The better solution would be to only set this for formats that support it on original HW but this will get rid of the validation errors for now.	2022-04-14 14:14:52 +05:30
Robin Kertels	82296ac5b8	Use buffer size instead of allocation size for Buffer constructor Fixes a validation error.	2022-04-14 14:14:52 +05:30
Robin Kertels	752245c3c8	Enable provoking vertex feature	2022-04-14 14:14:52 +05:30
Robin Kertels	dd45d054e7	Enable shaderDrawParameters	2022-04-14 14:14:52 +05:30
Billy Laws	7e16c1f989	Heavily optimise GPFIFO command dispatch to reduce redundant checks Previously for methods with count > 1 the subchannel and engine would be looked up for each part of the method rather than only doing so at the start. Each call also needed to be looked up to see if it touched a macro or GPFIFO method. Fix this by doing checks outside of the main dispatch loop with templated helper lambdas to avoid needing to repeat lots of code. Maxwell3D is the only subchannel with a fast path for now but more can be added later if needed.	2022-04-14 14:14:52 +05:30
Billy Laws	b4927d0138	Add support for turnip and driver file redirection via libadrenotools	2022-04-14 14:14:52 +05:30
Billy Laws	dd91d063a5	Pass native library dir to OS + reorder OS init order so paths are first This is required for integrating libadrenotools, which needs access to library and app directories in the GPU class constructor.	2022-04-14 14:14:52 +05:30
Billy Laws	011de98940	Rework formats to support passing through guest swizzle values Almost every Maxwell format now directly corresponds to a Vulkan format. This allows formats to be passed through and the swizzle used directly from guest (with some extra swizzle handling for edge cases) thus saving the need to explicitly support each swizzle combination which is adds a lot of code bloat. The format header is additionally reordered with line breaks to separate formats by their bits-per-block.	2022-04-14 14:14:52 +05:30
Billy Laws	6f17d1351f	Fixup ordering for B10G11R11Float texture format	2022-04-14 14:14:52 +05:30
Billy Laws	78238d550a	Add 6 channel downmixing support for Audout The specific attenuations used for each channel are taken from Ryujinx.	2022-04-14 14:14:52 +05:30
Billy Laws	2e1a1a965d	Fixup AudioTrack locking	2022-04-14 14:14:52 +05:30
PixelyIon	727f83e969	Fix Incorrect Vertex Binding Divisor State Submission We always submit pipeline divisor descriptions regardless of binding input rate being vertex rather than instance. This is invalid behavior and has been fixed by only submitting binding descriptors when the input rate is per-instance.	2022-04-14 14:14:52 +05:30
PixelyIon	9f7e80cf8f	Fix Adreno Texture Sampler Binding Bug Adreno proprietary drivers suffer from a bug where `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` requires 2 descriptor slots rather than one, we add a padding slot to fix this issue. `QuirkManager` was introduced to handle per-vendor/per-device errata and allow enabling this on Adreno proprietary drivers specifically as to not affect the performance of other devices.	2022-04-14 14:14:52 +05:30
PixelyIon	ddb2ba8a1b	Rename `QuirkManager` to `TraitManager` Quirk terminology was deemed to be inappropriate for describing the features/extensions of a device. It has been replaced with traits which is far more fitting but quirks will be used as a terminology for errata in devices.	2022-04-14 14:14:52 +05:30
PixelyIon	0b2ce6a8f3	Fix Texture Handle Offset Calculation The texture handle offset calculation involved an incorrect shift by descriptor size which was found to be unnecessary and would result in an invalid handle that had the wrong TIC/TSC index and caused broken rendering.	2022-04-14 14:14:52 +05:30
PixelyIon	aa57ec6d55	Destroy `CommandExecutor` Nodes Before Waiting on Execution `nodes` and `syncTextures` were cleared after waiting on the `CommandExecutor` fence rather than before, this wasted execution time after the wait for something that could be performed prior to the wait.	2022-04-14 14:14:52 +05:30
PixelyIon	90a1b3348c	Implement D24S8 + R11G11B10 Formats	2022-04-14 14:14:52 +05:30
PixelyIon	bd718175ce	Enable `VK_KHR_uniform_buffer_standard_layout` when available We now attempt to enable `VK_KHR_uniform_buffer_standard_layout` when present as lax UBO layout significantly reduces complexity. If a device doesn't support this extension, we still assume that the device supports it implicitly as this has proven to be true across all major mobile GPU vendors regardless of the driver version but enabling this prevents validation layer errors.	2022-04-14 14:14:52 +05:30
PixelyIon	22ce531e6f	Force Memory Barrier at `VkRenderPass` Start We depend on past commands to have completed execution in a renderpass, a subpass dependency on all graphics stages from `VK_SUBPASS_EXTERNAL` to subpass #0 is used to enforce this. Nvidia and Adreno proprietary drivers implicitly do this but Turnip or Mali drivers require this or they execute out of order.	2022-04-14 14:14:52 +05:30
PixelyIon	35fde2cd0b	Rework Blocklinear Texture Deswizzling Blocklinear texture decoding was broken for padding blocks and would incorrectly decode them resulting in major texture corruption for any textures with their widths not aligned to 64 bytes. This has now been fixed with neater code which avoids redundant repetition of any code using lambdas and functions where necessary.	2022-04-14 14:14:52 +05:30
PixelyIon	043be4d8f7	Implement Maxwell3D Two-Side Stencil Toggle Stencil operations are configurable to be the same for both sides or have independent stencil state for both sides. It is controlled via the previously unimplemented `stencilTwoSideEnable`.	2022-04-14 14:14:52 +05:30
PixelyIon	80ae7b255a	Implement Maxwell3D Front Face Flip	2022-04-14 14:14:52 +05:30
PixelyIon	40a3887695	Implement Maxwell3D Viewport Y Swizzle & Lower-Left Origin	2022-04-14 14:14:52 +05:30
Billy Laws	3be30e68c3	Add D16 depth format and ZF32 TIC format Used by One Piece Unlimited World Red	2022-04-14 14:14:52 +05:30
Billy Laws	be007c4ccc	Fixup texture swizzling to actually function Before this we were not applying the supplied swizzles, will be superseeded in the future by using guest swizzle values.	2022-04-14 14:14:52 +05:30
Billy Laws	6e48460c0d	Add BC2/3 format support	2022-04-14 14:14:52 +05:30
Billy Laws	2253bc3151	Reorder GPU quirks member to prevent it constructing after device init	2022-04-14 14:14:52 +05:30
Billy Laws	62db21fb78	Rework GPFIFO method distribution and macros to support multiple engines Fermi2D supports macros in addition to Maxwell3D, these both share code memory. To support this we rework the macro interpreter to support passing in a target engine and abstract the communications out into an interface that can be implemented by applicable engines. ``` GPFIFO <-> MME <-> Maxwell3D ^ ^---> Fermi2D X------------> I2M X------------> MaxwellComputeB X--Flush-----> MaxwellDMA ```	2022-04-14 14:14:52 +05:30
Billy Laws	8d5463ef28	Drop engine base class usage from GPFIFO This class does nothing since we made stopped GPFIFO submits from using virtual functions so it can be dropped.	2022-04-14 14:14:52 +05:30
PixelyIon	41aad83c33	Tie Shader `ObjectPool` Lifetime to Shader `Program` Shader programs allocate instructions and blocks within an `ObjectPool`, there was a global pool prior that was never reaped aside from on destruction. This led to a leak where the pool would contain resources from shader programs that had been deleted, to avert this the pools are now tied to shader programs.	2022-04-14 14:14:52 +05:30
PixelyIon	e747de37cf	Implement Blocklinear TIC Type	2022-04-14 14:14:52 +05:30
PixelyIon	723189a948	Calculate Blocklinear Texture Aligned Size Correctly The size of blocklinear textures did not consider alignment to Block/ROB boundaries before, it is aligned to them now. Incorrect sizes led to textures not being aliased correctly due to different size calculations for GraphicBufferProducer surfaces and Maxwell3D color RTs.	2022-04-14 14:14:52 +05:30
Billy Laws	95685b8207	Avoid iterator invalidation segfault when unregistering a syncpt waiter erase invalidated `it` leading to a potential segfault if the GPU was very far behind, bail out early to avoid that since there can only be one occurence at most in the buffer anyway.	2022-04-14 14:14:52 +05:30
Billy Laws	e7bfd93541	Implement BC7 format support Used by ARMS	2022-04-14 14:14:52 +05:30
Billy Laws	99652c5eda	Support partially mapped cbufs Buggy games sometimes supply an incorrect cbuf size so limit buffers to the first unmapped region.	2022-04-14 14:14:52 +05:30
PixelyIon	6a6f51ea84	Implement Maxwell3D Depth/Stencil State Implements the entirety of Maxwell3D Depth/Stencil state for both faces including compare/write masks and reference value. Maxwell3D register `stencilTwoSideEnable` is ignored as its behavior is unknown and could mean the same behavior for both stencils or the back facing stencil being disabled as a result of this it is unimplemented.	2022-04-14 14:14:52 +05:30
PixelyIon	9f5c3d8ecd	Force Textures to be Optimal on Host GPU We don't respect the host subresource layout in synchronizing linear textures from the guest to the host when mapped to memory directly, this leads to texture corruption and while the real fix would involve respecting the host subresource layout, this has been deferred for later as real world performance advantages/disadvantages associated with this change can be observed more carefully to determine if it's worth it.	2022-04-14 14:14:52 +05:30
Billy Laws	ab4962c4e4	Implement additional texture formats, including BCn BCeNabler is required for BCn textures, the pre-swizzled formats will be removed when arbitary swizzle support is added later.	2022-04-14 14:14:52 +05:30
Billy Laws	600b94505c	Fix A2R10G10B10 render target format This was wrongly described as R10G10B10A2 in the enum when it's actually A2R10G10B10, a format natively supported in Vulkan with just a swizzle.	2022-04-14 14:14:52 +05:30
Billy Laws	175ba11f07	Integrate BCeNabler support into QuirkManager Allows using BCn format textures on devices where they are unsupported by the driver.	2022-04-14 14:14:52 +05:30
Billy Laws	47d920d91e	Make GPU private static functions file-local	2022-04-14 14:14:52 +05:30
PixelyIon	edd51c3dfa	Fix Color RT Disabling Bug Color RTs are disabled by setting their format as `None`, it was removed while transitioning to macros and resulted in a missing format exception. It has been readded as several applications depend on this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	a2285669b3	Use static vector for shader bytecode to prevent constant reallocation Using `std::vector` for shader bytecode led to a lot of reallocation due to constant resizing, switching over the static vector allows for a single static allocation of the maximum possible guest shader size (1 MiB) to be done for every stage resulting in a 6 MiB preallocation which is unnoticeable given the total memory overhead of running a Switch application.	2022-04-14 14:14:52 +05:30
PixelyIon	21a6866def	Fix Maxwell3D Blend Enum Conversion Bugs The `OneMinusSourceAlpha` blending factor was converted to `eOneMinusSrcColor` rather than `eOneMinusSrcAlpha` leading to incorrect blending behavior in certain titles. A similar issue with the order of `MinimumGL`/`MaximumGL` and `SubtractGL`/`ReverseSubtractGL` being the opposite of what it should've been, both of these issues have been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	0a506088f4	Fix `NextSubpassNode` Subpass Index Bug `NextSubpassNode` didn't increment `subpassIndex` which runs commands with the wrong subpass index resulting in them accessing invalid attachments or other bugs that may arise from using the wrong subpass.	2022-04-14 14:14:52 +05:30
PixelyIon	defbfe8f78	Serialize Maxwell3D Draw State for Subpass All Maxwell3D state was passed by reference to the draw command lambda, this would break if there was more than one pass or the state was changed in any way before execution. All state has now been serialized by value into the draw command lambda capture, retaining state regardless of mutations of the class state.	2022-04-14 14:14:52 +05:30
PixelyIon	934130b3e6	Remove Implicit Command Executor Resource Attachment Any usage of a resource in a command now requires attaching that resource externally and will not be implicitly attached on usage, this makes attaching of resources consistent and allows for more lax locking requirements on resources as they can be locked while attaching and don't need to be for any commands, it also avoids redundantly attaching a resource in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	f0e9c42097	Fix Fence Cycle Double Insertion Lifetime Bug If an object is attached to a `FenceCycle` twice then it would cause `FenceCycleDependency::next` to be overwritten and lead to destruction of dependencies prior to the fence being signaled causing usage of deleted resources. This commit fixes this by tracking what fence cycle a dependency is currently attached to and doesn't reattach if it's already attached to the current fence cycle.	2022-04-14 14:14:52 +05:30
PixelyIon	6a831f6ed7	Add `VK_EXT_shader_demote_to_helper_invocation` Quirk An assumption was hardcoded into `Shader::Profile` regarding devices supporting demotion of shader invocations to helpers. This assumption wasn't backed by enabling the `VK_EXT_shader_demote_to_helper_invocation` extension via a quirk leading to assertions when it was used by the shader compiler, a quirk has now been added for the extension and is supplied to the shader compiler accordingly.	2022-04-14 14:14:52 +05:30
Billy Laws	3ff8075151	Move vertex and RT format conv to macros and fill them fully in Makes the format conversions easier to read and shorter, and adds in some new formats needed to complete the RT table properly.	2022-04-14 14:14:52 +05:30
PixelyIon	98b366c1f5	Fix Texture Synchronization Bug Fixes texture corruption due to incorrect synchronization, the barrier would not enforce waiting till the texture was entirely rendered causing an incomplete texture to be downloaded which lead to rendering bugs for certain GPUs including ARM's Mali GPUs.	2022-04-14 14:14:52 +05:30
PixelyIon	aea40e6496	Fix `enabledFeature2` Unlinking Assertion Bug A bug caused an assertion if both `VK_EXT_custom_border_color` and `VK_EXT_vertex_attribute_divisor` due to mistakenly unlinking `PhysicalDeviceVertexAttributeDivisorFeaturesEXT` instead of `PhysicalDeviceCustomBorderColorFeaturesEXT` when `VK_EXT_custom_border_color` isn't supported which would potentially lead to unlinking the same structure twice and cause the assertion.	2022-04-14 14:14:52 +05:30
Billy Laws	68f31c3688	Use macros for defining texture formats and their conversions Avoids the need to repeat all the possible component types for each texture format while also making them simpler to add and easier to read.	2022-04-14 14:14:52 +05:30
PixelyIon	bc29b23972	Implement CPU-only Maxwell3D Inline Constant Buffer Updates Implements inline constant buffer updates that are written to the CPU copy of the buffer rather than generating an actual inline buffer write, this works for TIC/TSC index updates but won't work when the buffer is expected to actually be updated inline with regard to sequence rather than just as a buffer upload prior to rendering. GPU-sided constant buffer updates will be implemented later with optimizations for updating an entire range by handling GPFIFO `Inc`/`NonInc`directly and submitting it as a host inline buffer update.	2022-04-14 14:14:52 +05:30
PixelyIon	08f29f7da4	Make `ActiveDescriptorSet` movable and non-copyable There should only ever be a single instance of a `ActiveDescriptorSet` that tracks the lifetime of a descriptor set as the destructor is responsible for freeing the descriptor set. There are cases where a new object inheriting the descriptor set needs to be created in these cases we need to have move semantics and make the destructor of the prior object inert, this allows for moving to the new object without any side effects. If the copy constructor was used in these cases the older object would free the set on its destruction which would lead to the set being invalid on existing instances which is incorrect behavior and would likely lead to driver crashes.	2022-04-14 14:14:52 +05:30
PixelyIon	bb14af4f7a	Implement Maxwell3D Sampled Textures The descriptor sets should now contain a combined image and sampler handle for any sampled textures in the guest shader from the supplied offset into the texture constant buffer. Note: Games tend to rely on inline constant buffer updates for writing the texture constant buffer and due to it not being implemented, the value will be read as 0 which is incorrect.	2022-04-14 14:14:52 +05:30
PixelyIon	d9a9e52350	Use `ConstantBuffer` instead of `BufferView` for Shader Constant Buffers We want read semantics inside the constant buffer object via the mappings to avoid a pointless GPU VMM mapping lookup. It is a fairly frequent operation so this is necessary, the ability to write directly will be added in the future as well.	2022-04-14 14:14:52 +05:30
PixelyIon	adb0a16873	Implement Maxwell 3D Textures Implements parsing for the Maxwell 3D TIC pool and conversion of a TIC into a `GuestTexture`, support is limited to pitch-linear RGB565/A8R8G8B8 textures at the moment but will be extended as games utilize more formats and layouts. Support for 1D buffers is also omitted at the moment since they need special handling with them effectively being treated as buffers in Vulkan rather than images.	2022-04-14 14:14:52 +05:30
PixelyIon	a7b90e7825	Change Texture Pitch Unit to Bytes from Pixels The pitch of the texture should always be supplied in terms of bytes as it denotes alignment on a byte boundary rather than a pixel one, it is also always utilized in terms of bytes rather than pixels so this avoids an unnecessary conversion. Note: GBP stride unit was assumed to be pixels earlier but is likely bytes which is why there are no changes to the supplied value there, if this is not the case it'll be fixed in the future	2022-04-14 14:14:52 +05:30
PixelyIon	a9aa16798f	Add `-fsigned-bitfields` for defined bitfield `int` behavior We want consistent behavior between signed `int`s in bitfields and outside of bitfields, the `-fsigned-bitfields` flag enforces this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	87c8dc94d2	Implement Maxwell3D Samplers Maxwell3D `TextureSamplerControl` (TSC) are fully converted into Vulkan samplers with extension backing for all aspects that require them (border color/reduction mode) and approximations where Vulkan doesn't support certain functionality (sampler address mode) alongside cases where extensions may not be present (border color).	2022-04-14 14:14:52 +05:30
PixelyIon	e48a7d7009	Fix Mapping Caching For Maxwell 3D Buffers Code involving caching of mappings was copied from `RenderTarget` without much consideration for applicability in buffers, the reason for caching mappings in RTs was that the view may be invalidated by more than the IOVA/Size being changed but this doesn't hold true for buffers generally so invalidation can only be on the view level with the mappings being looked up every time since the invalidation would likely change them.	2022-04-14 14:14:52 +05:30
PixelyIon	ff27dce24c	Implement `ObjectHash` for hashing trivial objects in maps `std::hash` doesn't have a generic template where it can be utilized for arbitrary trivial objects and implementing this might result in conflicts with other types. To fix this a generic templated hash is now provided as a utility structure, that can be utilized directly in hash-based containers such as `unordered_map`.	2022-04-14 14:14:52 +05:30
PixelyIon	97cfcba0da	Add Nullability for Optional Semantics to `span` Nullability allow for optional semantics where a span may be explicitly invalidated with `nullptr` being used as a sentinel value for it and a boolean operator that allows trivial checking for if the span is valid or not.	2022-04-14 14:14:52 +05:30
PixelyIon	c11962e8e4	Implement Maxwell3D Bindless Texture Constant Buffer Index The index of the constant buffer with bindless texture descriptors is now retrieved from Maxwell3D register state and passed to the shader compiler.	2022-04-14 14:14:52 +05:30
PixelyIon	1c3f62b7b4	Implement Maxwell3D Indexed Drawing	2022-04-14 14:14:52 +05:30
PixelyIon	23cdfe2139	Implement Maxwell3D Index Buffers Adds support for index buffers including U8 index buffers via the `VK_EXT_index_type_uint8` extension which has been added as an optional quirk but an exception will be thrown if the guest utilizes it but the host doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	a4041364e1	Address CR comments Note: CR comments regarding `ShaderSet` and `PipelineStages` will be addressed at a later date with a common class for associative enum arrays.	2022-04-14 14:14:52 +05:30
PixelyIon	e1e14e781f	Support Dual Vertex Shader Programs Add support for parsing and combining `VertexA` and `VertexB` programs into a single vertex pipeline program prior to compilation, atomic reparsing and combining is supported to only reparse the stage that was modified and recombine once at most within a single pipeline compilation.	2022-04-14 14:14:52 +05:30
PixelyIon	974cf03c18	Add Atomic Pipeline Stage Invalidation Atomically invalidate pipeline stages as runtime information that pertains to them changes rather than never recompiling pipelines on runtime information being updated resulting in out of date pipelines or recompiling all pipelines on any runtime information updates.	2022-04-14 14:14:52 +05:30
PixelyIon	5414db8411	Rework Maxwell3D Shader/Pipeline Stages Compilation with UBO support Shader compilation is now broken into shader program parsing and pipeline shader compilation which will allow for supporting dual vertex shaders and more atomic invalidation depending on runtime state limiting the amount of work that is redone. Bindings are now properly handled allowing for bound UBOs to be converted to the appropriate host UBO as designated by the shader compiler by creating Vulkan Descriptor Sets that match it.	2022-04-14 14:14:52 +05:30
PixelyIon	055d315048	Seperate Maxwell3D Stages into Shader/Pipeline We need this to make the distinction between a shader and pipeline stage in as shader programs are bound at a different rate than that of pipeline stage resources such as UBO.	2022-04-14 14:14:52 +05:30
PixelyIon	492dd47218	Implement Vulkan Descriptor Set Allocator A fixed descriptor set allocator which manages the size of the pool with automatic reallocations when any allocations run out of descriptors.	2022-04-14 14:14:52 +05:30
PixelyIon	9af9f1d41a	Implement Maxwell3D Constant Buffer Selector The Constant Buffer Selector is used to point to a constant buffer that will be bound to a shader stage or updated with inline data.	2022-04-14 14:14:52 +05:30
PixelyIon	afa34e320a	Retain Shader Binding State Across Stages An instance of `Shader::Backend::Bindings` must be retained across all stages for correct emission of bindings, which is now done inside `GraphicsContext::GetShaderStages`.	2022-04-14 14:14:52 +05:30
PixelyIon	550d12b7fa	Set Shader Runtime Generic Vertex Attribute Types Correctly The vertex attribute types supplied prior were just the default which is `Float`, this works for some cases but will entirely break if the attribute type isn't a float. The attribute types are now set correctly.	2022-04-14 14:14:52 +05:30
PixelyIon	a2de6b9255	Fix Maxwell3D `vertexEndGl` Register Offset The offset was set to 0x586 which is the location of `vertexBeginGl`, it's been corrected now and set to 0x585.	2022-04-14 14:14:52 +05:30
Billy Laws	5815cda7a7	Update Vulkan-Hpp to v1.2.202	2022-04-14 14:14:52 +05:30
PixelyIon	bd6cd0056c	Support Multi-Aspect Copy in `Texture::CopyIntoStagingBuffer` Only copying a single aspect was supported by `CopyIntoStagingBuffer` earlier due to not supplying a `VkBufferImageCopy` for each aspect separately, this has now been done with Color/Depth/Stencil aspects having their own `VkBufferImageCopy` for the `VkCmdCopyImageToBuffer` command.	2022-04-14 14:14:52 +05:30
PixelyIon	daff17c776	Order `TextureView` Definition Correctly The definition of the `TextureView` class was spread across `texture.cpp` and has now been moved to the top of the file above the other half of the definition.	2022-04-14 14:14:52 +05:30
PixelyIon	189b9533f2	Disable Vertex Buffers With 0 as IOVAs A buffer with 0 as the start/end IOVA should be invalid as there shouldn't be any mappings at 0 in GPU VA, titles such as Puyo Puyo Tetris configure the Vertex Buffer with 0 IOVAs which leads to a segmentation fault without this exception.	2022-04-14 14:14:52 +05:30
PixelyIon	cfeb8098db	Attach `TextureView`/`BufferView` Lifetime to `FenceCycle` The lifetime of a texture and buffer view is now bound by the `FenceCycle` in `CommandExecutor`, this ensures that a `VkImageView` isn't destroyed prior to usage leading to UB.	2022-04-14 14:14:52 +05:30
PixelyIon	34fc1e32b8	Remove `Texture`s from `RenderPassNode::Storage` The lifetime of all textures bound to a RenderPass alongside syncing of textures is already handled by `CommandExecutor` and doesn't need to be redundantly handled by `RenderPassNode`. It's been removed as a result of this.	2022-04-14 14:14:52 +05:30
PixelyIon	45c7a89fc3	Cleanup `BufferView`/`TextureView` Locking Code Renames the variable to be neater and less confusing alongside adding comments for `try_lock()` to make the goal of the function more apparent.	2022-04-14 14:14:52 +05:30
PixelyIon	7776ef2cd0	Support Depth/Stencil RT in Draw Adds the depth/stencil RT as an attachment for the draw but with `VkPipelineDepthStencilStateCreateInfo` stubbed out, it'll not function correctly and the contents will not be what the guest expects them to be.	2022-04-14 14:14:52 +05:30
PixelyIon	525850ae09	Stub `VkPipelineDepthStencilStateCreateInfo` Maxwell3D Depth State is composed of several registers and will be implemented at a later date, for the time being it's been stubbed.	2022-04-14 14:14:52 +05:30
PixelyIon	9e63ecf05d	Implement Maxwell3D Depth/Stencil Clears Support for clearing the depth/stencil RT has been added as its own function via either optimized `VkAttachmentLoadOp`-based clears or `vkCmdClearAttachments`. A bit of cleanup has also been done for color RT clears with the lambda for the slow-path purely calling the command rather than creating the parameter structures.	2022-04-14 14:14:52 +05:30
PixelyIon	bf89f96bf5	Implement Optimized LoadOp Clears for Depth/Stencil Attachments Implements `AddClearDepthStencilSubpass` in `CommandExecutor` which is similar to `ClearColorAttachment` in that it uses `VK_ATTACHMENT_LOAD_OP_CLEAR` for the clear which is far more efficient than using `VK_ATTACHMENT_LOAD_OP_LOAD` then doing the clear.	2022-04-14 14:14:52 +05:30
PixelyIon	6f6413f02d	Fix `VkSubpassDependency` for Depth/Stencil Attachments The stage/access mask for `VkSubpassDependency` were hardcoded to only be valid for color attachments earlier, this has now been fixed by branching based on the format aspect.	2022-04-14 14:14:52 +05:30
PixelyIon	aa32f6b017	Add Depth/Stencil Format Support to `Texture` Sets `VkImageUsageFlags` correctly rather than hardcoding it for color attachments and adds multiple `VkBufferImageCopy` to `VkCmdCopyBufferToImage` for Color/Depth/Stencil aspects of an image.	2022-04-14 14:14:52 +05:30
PixelyIon	68c990c041	Implement Maxwell3D Depth/Stencil Render Target Support the Maxwell3D Depth RT for Z-buffering, this just creates an equivalent `RenderTarget` object with no support on the API-user side (IE: `Draw` and `ClearBuffers`).	2022-04-14 14:14:52 +05:30
PixelyIon	2a8bcc60c7	Make Render Targets Abstract for Color/Depth RTs This prefixes all RT functions that deal with color RTs with `Color` and abstracts out common functions that will be used for both color and depth RTs. All common Maxwell3D structures are also moved out of the `ColorRenderTarget` (`RenderTarget` previously) structure.	2022-04-14 14:14:52 +05:30
PixelyIon	b0f084ae32	Implement Shader Compiler Input Topology Sets the input toplogy in the runtime information for the shader compiler correctly based on the Maxwell3D input topology.	2022-04-14 14:14:52 +05:30
PixelyIon	7a63ad7d3d	Implement `VkPipelineCache` for host pipeline caching To allow for caching of pipelines on the host a `VkPipelineCache` has been added, it is entirely in-memory and is not flushed to the disk which'll be done in the future alongside caching guest shaders to further avoid translation where possible.	2022-04-14 14:14:52 +05:30
PixelyIon	4dcf12c4c0	Implement Maxwell3D Draws Uses all Maxwell3D state converted into Vulkan state to do an equivalent draw on the host GPU, it sets up RT/Vertex Buffer/Vertex Attribute/Shader state and creates a stubbed out `VkPipelineLayout` for the draw. Any descriptor state isn't currently handled and is yet to be implemented, currently there's no Vulkan pipeline cache supplied which will be implemented subsequently.	2022-04-14 14:14:52 +05:30
PixelyIon	57b0d6a2fb	Stub `VkPipelineMultisampleStateCreateInfo` Multisampling will be worked on later and for the time being is being safely stubbed by setting the sample count to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	56b3a01a59	Track `VkRenderPass` and Subpass Index for Subpass Function Nodes We require a handle to the current renderpass and the index of the subpass in certain cases, this is now tracked by the `CommandExecutor` and passed in as a parameter to `NextSubpassFunctionNode` and the newly-introduced `SubpassFunctionNode`.	2022-04-14 14:14:52 +05:30
PixelyIon	cb7f68b98d	Allow Attaching Texture/Buffers to `CommandExecutor` Switch from `SubmitWithCycle` to manually allocating the active command buffer to tag dependencies with the `FenceCycle` that prevents them from being mutated prior to execution. This new paradigm could also allow eager recording of commands with only submission being deferred.	2022-04-14 14:14:52 +05:30
PixelyIon	aeea3e6f66	Allow manual allocation of `ActiveCommandBuffer` `CommandScheduler` API users can now directly allocate an active command buffer that they need to manage alongside its fence, this can allow for more efficient recording as it doesn't need to be immediately submitted after, it can also allow attaching objects to a `FenceCycle` prior to submission that can be useful for locking resources.	2022-04-14 14:14:52 +05:30
PixelyIon	8989305637	Implement Host Vertex Buffer Translation Uses the buffer cache to retrieve an equivalent host vertex buffer for a corresponding guest vertex buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	b6ba770a27	Implement Maxwell3D Shader Compilation Compiles shaders supplied by the guest with caching and automatic invalidation, the size of the shader is also automatically determined by looking for `BRA $` instructions which cause an infloop, it should be noted that we have a maximum shader bytecode size, any shader above this size will not be supported.	2022-04-14 14:14:52 +05:30
PixelyIon	08afda6ac4	Implement Graphics Shader Compilation in `ShaderManager` Graphics shaders can now be compiled using the shader compiler and emit SPIR-V that can be used on the host. The binding state isn't currently handled alongside constant buffers and textures support in `GraphicsEnvironment` yet.	2022-04-14 14:14:52 +05:30
PixelyIon	353ca8ec84	Fix Viewport X/Y Translation The operands of the subtraction in the X/Y translation calculation were the wrong way around which led to negative translations that would translate the viewport off the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	f06a12170f	Set Default Color Write Mask to RGBA The default color write mask should mask no channels and write all of them and should be mutated to mask out certain channels as required by the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	23faf1370c	Use Static Arrays for Vertex Buffer Bindings & Attributes We cannot statically construct the vertex buffer/attribute arrays for Vulkan due to inactive attributes or buffers which isn't possible on Vulkan, we also cannot just change the count dynamically as there might be disabled buffers or attributes in the middle. We just have a `static_array` which should dynamically be filled in with buffer binding/attribute Vulkan structures before submission.	2022-04-14 14:14:52 +05:30
PixelyIon	8652edb07b	Make `GuestBuffer` format-less Buffers generally don't have formats that are fundamentally associated with them unless they're texel buffers, if that is the case it can be manually set in `BufferView`.	2022-04-14 14:14:52 +05:30
PixelyIon	03314ec7d2	Introduce `BufferManager` The Buffer Manager handles mapping of guest buffers to host buffer views with automatic handling of sub-buffers and eventually supporting recreation of overlapping buffers to create a single larger buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	bde61d72cc	Introduce `Buffer` and `BufferView` Implements infrastructure for using guest buffers on the host for rendering, a `BufferManager` is still missing which'd handle mapping from guest buffers to host buffers and will be subsequently committed. It should be noted that `BufferView` is also disconnected from `Buffer` and shared for every instance with the same properties like `TextureView` is now.	2022-04-14 14:14:52 +05:30
PixelyIon	6eda1777c5	Rework `TextureView` to be disconnected from `Texture` We want `TextureView`(s) to be disconnected from the backing on the host and instead represent a specific texture on the guest with a backing that can change depending on mapping of new textures which'd invalidate the backing but should now be automatically repointed to an appropriate new backing. This approach also requires locking of the backing to function as it is mutable till it has been locked or the backing has an attached `FenceCycle` that hasn't been signaled which will be added for `CommandExecutor` in a subsequent commit.	2022-04-14 14:14:52 +05:30
PixelyIon	82916657fb	Only Enable Shader Compiler Debug Mode in Debug Builds Sets properties that relate to debugging in `Shader::Settings` to `true` only for debug builds while leaving them disabled for release builds.	2022-04-14 14:14:52 +05:30
PixelyIon	b09f28c0ba	Implement Missing Shader Compiler Quirks Introduces the `supportsShaderViewportIndexLayer` quirk and sets `Shader::Profile::support_int64_atomics` depending on if the `supportsAtomicInt64` quirk is available.	2022-04-14 14:14:52 +05:30
PixelyIon	f3e81094a2	Implement Shader Compiler Property Quirks Introduces the `floatControls`, `supportsSubgroupVote` and `subgroupSize` quirks for the shader compiler which are based on Vulkan `PhysicalDevice` properties.	2022-04-14 14:14:52 +05:30
PixelyIon	51c4df24b5	Switch from `VK_VERSION_` to `VK_API_VERSION_` macros Vulkan has officially deprecated `VK_VERSION_*` macros for versioning as it has introduced the variant into the version. It should however be `0` for the Vulkan APi and doesn't need to be printed.	2022-04-14 14:14:52 +05:30
PixelyIon	0588a525b4	Implement Shader Compiler Extension/Feature Quirks Introduces several quirks for optional features used by the shader compiler which are now reported in the `Shader::HostTranslateInfo` and `Shader::Profile` structure. There are still property-related quirks for the shader compiler which haven't been implemented in this commit.	2022-04-14 14:14:52 +05:30
PixelyIon	8f3887c56a	Create `memory::Buffer` & Implement `StagingBuffer` as derivative A `Buffer` class was created to hold any generic Vulkan buffer object with `span` semantics, `StagingBuffer` was implemented atop it as a wrapper for `Buffer` that inherits from `FenceCycleDependency` and can be used as such.	2022-04-14 14:14:52 +05:30
PixelyIon	a55aca76c6	Rename `TextureView::backing` to `TextureView::texture` It was determined that `backing` wasn't a very descriptive name and that it conflicted with the texture's own backing, the name was changed to `texture` to make it more apparent that it was specifically the `Texture` object backing the view.	2022-04-14 14:14:52 +05:30
PixelyIon	482c573b81	Introduce `FlatMemoryManager::ReadTill` for scanning semantics A memory manager function to read into a vector till it satisfies the supplied function or hits an early stop condition like hitting the end of vector or reaching an unmapped region. This can be used to efficiently scan for values in GPU VA.	2022-04-14 14:14:52 +05:30
PixelyIon	31c4f1ca4e	Unlink `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when disabled When `VK_EXT_vertex_attribute_divisor` is not available, `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` is unlinked from the device enabled feature list as it is undefined behavior to link a structure provided by an extension without enabling that extension.	2022-04-14 14:14:52 +05:30
PixelyIon	7df2670ece	Fix `QuirkManager`'s `EXT_SET_V` macro bug `EXT_SET_V` would enable the extension regardless of if it was actually the correct extension or if the version was high enough as long as the hash matched. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	e9ed771b48	Check for `supportsMultipleViewports` feature before usage If the host only supports a single viewport then we set `viewportCount` and `scissorCount` in `VkPipelineViewportStateCreateInfo` to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	3e45006d14	Make `shaderImageGatherExtended` a required `VkDevice` feature `shaderImageGatherExtended` is required by the shader compiler, to avoid complications associated with making it optional and considering that it's supported by the vast majority of Vulkan mobile devices, it was made a mandatory feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ece2785582	Introduce `ShaderManager` with Proxy Shader Compiler Logger/Settings This class will be entirely responsible for any interop with the shader compiler, it is also responsible for caching and compilation of shaders in itself.	2022-04-14 14:14:52 +05:30
PixelyIon	89e9a41a86	Implement `VkPipelineViewportStateCreateInfo` "Viewport Transforms" and "Viewport Scissors" were combined into one section to reflect their state in Vulkan correctly like all other sections.	2022-04-14 14:14:52 +05:30
PixelyIon	38119e21d4	Implement Vulkan-Supported Maxwell3D Primitive Topologies Any primitive topologies that are directly supported by Vulkan were implemented but the rest were not and will be implemented with conversions as they are used by applications, they are: * LineLoop * QuadList * QuadStrip * Polygon	2022-04-14 14:14:52 +05:30
PixelyIon	138f884159	Implement Maxwell3D Vertex Attributes Translates all Maxwell3D vertex attributes to Vulkan with the exception of `isConstant` which causes the vertex attribute to return a constant value `(0,0,0,X)` which was trivial in OpenGL with `glDisableVertexAttribArray` and `glVertexAttrib4(..., 0, 0, 0, 1)` but we don't have access to this in Vulkan and might need to depend on undefined behavior or manually emulate it in a shader. This'll be revisited in the future after checking host GPU behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	4b9f99bb27	Make `ENUM_STRING` function `static` `ENUM_STRING` can be used inside a `class`/`struct`/`union` for `enum`s contained within them. Making the function `static` allows doing this and doesn't require supplying a `this` pointer of the enclosing class for usage.	2022-04-14 14:14:52 +05:30
PixelyIon	c2a6da6431	Implement Maxwell3D Vertex Buffer Limit Sets the end of VBOs based on the `vertexArrayLimits` register array which provides an IOVA to the end of the VBO.	2022-04-14 14:14:52 +05:30
PixelyIon	d8890f13e1	Explicitly make `default` case `break` for `Maxwell3D::HandleMethod` This being made implicit removes any confusion that all cases would need to be implemented and explicitly define that the CF should continue onto the 2nd switch-case when it cannot find any matches in the first one.	2022-04-14 14:14:52 +05:30
PixelyIon	612f324e78	Implement Maxwell3D Vertex Buffer Instance Rate Implements the `isVertexInputRatePerInstance` register array which controls if the vertex input rate is either per-vertex or per-instance. This works in conjunction with the vertex attribute divisor for per-instance attribute repetition of attributes.	2022-04-14 14:14:52 +05:30
PixelyIon	476c070c7a	Fix Minor Maxwell3D Register Ordering Issues We order all registers in ascending order, a few registers namely `colorLogicOp`, `colorWriteMask`, `clearBuffers` and `depthBiasClamp` were erroneously not following this order which has now been fixed.	2022-04-14 14:14:52 +05:30

1 2 3 4 5 ...

757 Commits