skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2024-11-27 10:04:17 +01:00

Author	SHA1	Message	Date
PixelyIon	a947933bf0	Fix `Buffer` cycle check being inverted The check for the fence cycle being the same as the current cycle was incorrectly inverted to be the opposite of what it should have been, leading to bugs.	2022-04-27 13:07:36 +05:30
PixelyIon	54794f4b71	Move `Texture` locking and synchronization to `PresentationEngine` The responsibility for synchronizing a texture and locking it is now on the `PresentationEngine` rather than the API-user as this'll allow more fine grained locking and delay waiting until necessary.	2022-04-25 21:01:16 +05:30
Billy Laws	1dd230afde	Refactor all std::lock_guard usages to std::scoped_lock	2022-04-25 15:00:30 +01:00
PixelyIon	94e6f3cfa0	Add quirk for relaxed render pass compatibility As we require a relaxed version of the Vulkan render pass compatibility clause for caching multi-subpass render passes, we now utilize a quirk to determine if this is supported which it is on Nvidia/Adreno while AMD/Mali where it isn't supported we force single-subpass render passes.	2022-04-24 16:18:36 +05:30
PixelyIon	44615c8dd2	Implement per-vendor `VkQueue` maximum global priority We found out that certain vendors such as Nvidia had a limitation on the global priority of a queue and requesting `VK_QUEUE_GLOBAL_PRIORITY_HIGH_EXT` would result in `VK_ERROR_NOT_PERMITTED_EXT`. A quirk has been introduced to supply the maximum supported global priority which is currently set on a per-vendor basis to avoid future crashes.	2022-04-24 16:15:01 +05:30
PixelyIon	7ef4959060	Implement Graphics Pipeline Cache Implements a cache for storing `VkPipeline` objects which are fairly expensive to create and doing so on a per-frame basis was rather wasteful and consumed a significant part of frametime. It should be noted that this is not compliant with the Vulkan specification and will break unless the driver supports a relaxed version of the Vulkan specification's Render Pass Compatibility clause.	2022-04-24 14:31:00 +05:30
PixelyIon	50a8b69f7b	Optimize descriptor set writes using push descriptors We can use inline push descriptors for writing to descriptor rather than allocating a descriptor set for a one time write and freeing it as this is rather inefficient while an inline push descriptor generally ends up being a direct `memcpy` on the driver side designed for this use-case.	2022-04-24 13:45:09 +05:30
PixelyIon	5adafbff04	Set `VkQueue`'s global priority to high We want Skyline to have the most favorable GPU scheduling possible due to low latency and high throughput requirements, we request high priority scheduling due to this reason.	2022-04-24 13:34:09 +05:30
PixelyIon	f9c052d1b7	Implement Maxwell3D Tessellation State This implements all Maxwell3D registers and HLE Vulkan state for Tessellation including invalidation of the TCS (Tessellation Control Shader) state during state changes.	2022-04-24 13:23:00 +05:30
Billy Laws	de796cd2cd	Implement overhead-free sequenced buffer updates with megabuffers Previously constant buffer updates would be handled on the CPU and only the end result would be synced to the GPU before execute. This caused issues as if the constant buffer contents was changed between each draw in a renderpass (e.g. text rendering) the draws themselves would only see the final resulting constant buffer. We had earlier tried to fix this by using vkCmdUpdateBuffer however this caused significant performance loss due to an oversight in Adreno drivers. We could have worked around this simply by using vkCmdCopy buffer however there would still be a performance loss due to renderpasses being split up with copies inbetween. To avoid this we introduce 'megabuffers', a brand new technique not done before in any other switch emulators. Rather than replaying the copies in sequence on the GPU, we take advantage of the fact that buffers are generally small in order to replay buffers on the GPU instead. Each write and subsequent usage of a buffer will cause a copy of the buffer with that write, and all prior applied to be pushed into the megabuffer, this way at the start of execute the megabuffer will hold all used states of the buffer simultaneously. Draws then reference these individual states in sequence to allow everything to work without any copies. In order to support this buffers have been moved to an immediate sync model, with synchronisation being done at usage-time rather than execute (in order to keep contents properly sequenced) and GPU-side writes now need to be explictly marked (since they prevent megabuffering). It should also be noted that a fallback path using cmdCopyBuffer exists for the cases where buffers are too large or GPU dirty.	2022-04-23 22:48:28 +01:00
lynxnb	0d9992cb8e	Implement `QuadList` support for non-indexed draws	2022-04-20 18:17:10 +02:00
lynxnb	bcaf7dfe1c	Make `GetVertexBuffer` return a pointer to the requested buffer This avoids a redundancy in the `Draw` function and makes code easier to read	2022-04-20 18:16:45 +02:00
Billy Laws	5c3559e888	Revert "Implement support for GPU-side constant buffer updating" This reverts commit `d79635772f`.	2022-04-18 13:28:58 +01:00
Billy Laws	7bf3580031	Revert "Allow external synchronization for buffers" This reverts commit `372ab8befa`.	2022-04-18 13:28:58 +01:00
PixelyIon	ddc9622b90	Fix Shader Module Cache As bindings weren't correctly handled due to the fact that `EmitSPIRV` would change the bindings, the shader module cache would not correctly function and have no cache hits in `find` and rather have them in `try_emplace` which negated any performance benefit of it. This has now been fixed by retaining the initial cache key for insertion into the cache while also storing the post-emit bindings and restoring them during a cache hit.	2022-04-18 12:18:15 +05:30
Billy Laws	32fe01e145	Implement batch constant buffer updates Avoids spamming the driver with hundreds of cbuf updates per frame by batching all consecutive updates into one.	2022-04-17 00:35:00 +01:00
PixelyIon	02f99273ac	Implement Shader Module Cache Implements caching of the compiled shader module (`VkShaderModule`) in an associative map based on the supplied IR, bindings and runtime state to avoid constant recompilation of shaders. This doesn't entirely address shader compilation as an issue since host shader compilation is tied to Vulkan pipeline objects rather than Vulkan shader modules, they need to be cached to prevent costly host shader recompilation.	2022-04-16 18:45:56 +05:30
PixelyIon	76d8172a35	Implement Shader IR Cache This implements the first step of a full shader cache with caching any IR by treating the shared pointer as a handle and key for an associative map alongside hashing the Maxwell shader bytecode, it supports both single shader program and dual vertex program caching.	2022-04-16 18:45:56 +05:30
PixelyIon	0baa90d641	Implement `SpanEqual` and `SpanHash` We desire the ability to hash and check equality of data across spans to use associative containers such as `std::unordered_map` with spans. The implemented functions provide an easy way to do that.	2022-04-16 18:45:56 +05:30
Billy Laws	df5d1256c2	Implement an object backed IStorage backing This is more convinient and efficient to use when passing structured data out of applets	2022-04-16 18:45:56 +05:30
Billy Laws	d115ce3c05	Stub the controller applet Mostly based off of yuzu's implementation, this will need to be extended in the future to open up a UI for configuring controllers according to the applications requirements.	2022-04-16 18:45:56 +05:30
Billy Laws	9a8e39cba1	Slightly refactor controller code in HID Now uses ranges where possible and a function to get the number of connected controllers has been added.	2022-04-16 18:45:56 +05:30
Billy Laws	2873f11baa	Pass shared pointers by value in applet infrastructure This is more optimal than crefs when used together with std::move	2022-04-16 18:45:56 +05:30
PixelyIon	8ccef733ff	Fix UB with guest-less Texture/Buffers in `MarkGpuDirty` As there was no check for the lack of a `GuestTexture`/`GuestBuffer`, it would lead to UB when a texture/buffer that had no guest such as the `zeroTexture` from `GraphicsContext` would be marked as dirty they would cause a call to `NCE::RetrapRegions` with a `nullptr` handle that would be dereferenced and cause a segmentation fault.	2022-04-16 18:45:56 +05:30
PixelyIon	372ab8befa	Allow external synchronization for buffers In certain situations such as constant buffer updates, we desire to use the guest buffer as a shadow buffer forwarding all writes directly to it while we update the host using inline buffer updates so they happen in-sequence. This requires special behavior as we cannot let any synchronization operations take place as they would break the shadow buffer, as a result, an external synchronization flag has been added to prevent this from happening. It should be noted that this flag is not respected for buffer recreation which will lead to UB, this can and will break updates in certain cases and this change isn't complete without buffer manager support.	2022-04-16 18:44:53 +05:30
PixelyIon	c0c4db68a8	Fix `BufferView` offset not being added in `vkCmdUpdateBuffer` The offset of the view wasn't added to the `vkCmdUpdateBuffer`, this would cause the offset to be incorrect given the buffer was a view of a larger buffer that wasn't the start of it. This commit fixes that by adding the offset of the view to the buffer update.	2022-04-14 18:06:15 +05:30
PixelyIon	a1c06e0401	Mark GPU resources as dirty before GPU usage We didn't call `MarkGpuDirty` on textures/buffers prior to GPU usage, this would cause them to not be R/W protected when they should be and provide outdated copies if there were any read accesses from the CPU (which are not possible at the moment since we assume all accesses are writes at the moment). This has now been fixed by calling it after synchronizing the resource.	2022-04-14 17:20:05 +05:30
PixelyIon	41a6afed01	Fix `GraphicsContext` code formatting for auto formatter	2022-04-14 15:27:22 +05:30
PixelyIon	624df92616	Change `AddNonGraphicsPass` to `AddOutsideRpCommand` The terminology "Non-Graphics pass" was deemed to be fairly inaccurate since it simply covered all Vulkan commands (not "passes") outside the render-pass scope, these may be graphical operations such as blits and therefore it is more accurate to use the new terminology of "Outside-RenderPass command" due to the lack of such an implication while being consistent with the Vulkan specification.	2022-04-14 15:20:22 +05:30
Billy Laws	a31332e35f	Align Maxwell 3D macro newline slashes	2022-04-14 14:14:52 +05:30
Billy Laws	d79635772f	Implement support for GPU-side constant buffer updating Previously constant buffer updates would be handled on the CPU and only the end result would be synced to the GPU before execute. This caused issues as if the constant buffer contents was changed between each draw in a renderpass (e.g. text rendering) the draws themselves would only see the final resulting constant buffer. Fix this by updating cbufs on the GPU/CPU seperately, only ever syncing them back at the start or after a guest side CPU write, at the moment only a single word is updated at a time however this can be optimised in the future to batch all consecutive updates into one large one.	2022-04-14 14:14:52 +05:30
Robin Kertels	036faedabd	Implement a way to run non-graphics passes with command executor These commands will end the current renderpass and run on their own, this is useful for compute, blits etc.	2022-04-14 14:14:52 +05:30
Billy Laws	feb179fcff	Implement primitive restart support Maxwell3D also supports using an arbitrary restart index value however no games are known to use this so leave it for now.	2022-04-14 14:14:52 +05:30
Billy Laws	3f3acc31d8	Rework swizzle infrastructure to support arbritary format swizzles This is required to support R4G4B4A4 which has no directly corresponding Vulkan format. Co-authored-by: Lunar-Pixel <lunarn452@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	6f85a66151	Implement host-only `Buffer`s We require certain buffers to only be on the host while being accessible through the same abstractions as a guest buffer as they must be interchangeable in usage.	2022-04-14 14:14:52 +05:30
Billy Laws	2c697ec36a	Determine depth/stencil texture aspect based off of image swizzle Required since we can't have a non-rt image with both a depth/stencil aspect at the same time according to vk spec.	2022-04-14 14:14:52 +05:30
PixelyIon	1878e582ad	Add `ScopedStackBlocker` to `RomFile.populate` We needed to block stack frame lookups past JNI code as Java doesn't follow the ARMv8 frame pointer ABI which leads to invalid pointer dereferences. Any JNI function that throws or handles exceptions must do this now or it may lead to a `SIGSEGV`.	2022-04-14 14:14:52 +05:30
Billy Laws	68e693d9f4	Fix DMA Engine debug logs to not crash emu Address causes some type issues when printing directly so explicitly cast to u64 first to prevent them.	2022-04-14 14:14:52 +05:30
Billy Laws	8eaca87de8	Use an empty host texture in place of invalid TIC entries on guest Some games may pass empty TICs as inputs to shaders while not actually using them within the shader. Create an empty texture and pass this in instead when we hit this case, the nullDescriptor feature could be used but it's not supported by all devices so we chose to do it this way instead.	2022-04-14 14:14:52 +05:30
PixelyIon	41b98c7daa	Add stack tracing to `skyline::exception` Skyline's `exception` class now stores a list of all stack frames during the invocation of the exception. These can later be parsed by the exception handler to generate a human-readable stack trace. To assist with more complete stack traces, `-fno-omit-frame-pointer` is now passed on debug builds which forces the inclusion of frames on function calls.	2022-04-14 14:14:52 +05:30
PixelyIon	cd8fa66326	Fix NCE Destruction NCE is implicitly depended on by the `GPU` class due to the NCE Memory Trapping API so the destruction of it must take place after the destruction of the `GPU` class. Additionally, to prevent bugs the NCE destructor must set `staticNce` to `nullptr` as the signal handler will potentially access a destroyed instance of NCE otherwise.	2022-04-14 14:14:52 +05:30
Billy Laws	815f1f4067	Add support for sRGB TIC textures Without this sRGB textures would be interpreted as RGB leading to colours being slighly off. The sRGB flag isn't stored as part of format word so we reuse the _pad_ field of it to store the flag for the switch case.	2022-04-14 14:14:52 +05:30
Billy Laws	1ba4abf950	Add Astc{6x6,8x8} and R4G4B4A4 image formats	2022-04-14 14:14:52 +05:30
MCredstoner2004	dec0571eee	Infrastructure for applets to be implemented This removes a stub for an applet and implements several applet related service calls.	2022-04-14 14:14:52 +05:30
PixelyIon	164d4852fa	Sleep-loop rather than abort during termination We don't want to actually exit the process as it'll automatically be restarted gracefully due to a timeout after being unable to exit within a fixed duration so we just want to infinite sleep during termination. This should fix issues where exiting any game would cause the app to force close after some time as exception signal handling would fail in the background, the app should stay open now and automatically restart itself when another game is loaded in.	2022-04-14 14:14:52 +05:30
PixelyIon	ea00f1bb82	Flush emulation logs after exceptions A lot of logs are incomplete due to being unable to flush inside the signal handler, now we flush after any exceptions so that there is a guarantee of any exceptions being logged as this is crucial for proper debugging.	2022-04-14 14:14:52 +05:30
PixelyIon	62ba180550	Use R5G6B5 as Vulkan swapchain format rather than B5G6R5 B5G6R5 isn't generally supported by the swapchain and the format is used for R5G6B5 with swapped R/B channels to avoid aliasing so we reverse that by using R5G6B5 as the underlying Vulkan format for the swapchain which should be automatically handled by the driver for any copies from B5G6R5 textures and the data representation should be the same as B5G6R5 with swapped R/B channels so not reporting the correct texture::Format should be fine.	2022-04-14 14:14:52 +05:30
MK73DS	e54f86e923	Fix IApplicationFunctions::GetDisplayVersion id (https://switchbrew.org/wiki/Applet_Manager_services#IApplicationFunctions)	2022-04-14 14:14:52 +05:30
Billy Laws	77cf33b643	Trigger command executor before DMA copies DMA copies can use textures currently in active use on the GPU as dst/src so Execute before to prevent a deadlock	2022-04-14 14:14:52 +05:30
Billy Laws	dbbc5704d2	Implement DMA engine Block Linear->Linear copies	2022-04-14 14:14:52 +05:30
Billy Laws	3e4e8de1d2	Implement primitive Linear->Block Linear DMA engine copies Slightly inaccurate and misses some features but good enough for most games, should be revisted later.	2022-04-14 14:14:52 +05:30
Billy Laws	3c26921d54	Implement the Maxwell DMA engine The DMA engine is used to perform DMA buffer/texture copies directly on the GPU. It can deswizzle arbritary regions of input textures, perform component remapping and swizzle into output textures. This impl only supports 1D buffer copies, 2D ones will come later.	2022-04-14 14:14:52 +05:30
Billy Laws	3df76e84c3	Stub IRequest::GetAppletInfo in nifm	2022-04-14 14:14:52 +05:30
Billy Laws	6c5f9941ad	Stub additional IAddOnContentManager functions Used mainly by UE4 games	2022-04-14 14:14:52 +05:30
Billy Laws	486a835d0a	Use guest texture view type to determine the underlying image type If we have a Nx1x1 image then determining the type from dimensions will result in a 1D image being created thus preventing us from creating a 2D view. By using the image view type we can avoid this for textures from TICs since we know in advance how they will be used	2022-04-14 14:14:52 +05:30
Billy Laws	05966f34e5	Stub a pair of ISelfController functions Both used by SMO, SetScreenShotPermission and SetAlbumImageOrientation	2022-04-14 14:14:52 +05:30
Billy Laws	fe37d7c9be	Implement ICommonStateGetter::SetRequestExitToLibraryAppletAtExecuteNextProgramEnabled	2022-04-14 14:14:52 +05:30
Billy Laws	9813f9f8dc	Implement ICommonStateGetter::GetDefaultDisplayResolutionChangeEvent	2022-04-14 14:14:52 +05:30
Billy Laws	7e7c0252ca	Implement IApplicationFunctions::GetDisplayVersion	2022-04-14 14:14:52 +05:30
Billy Laws	b1f10865a0	Attach depth RT to command executor before draws This enforces that the depth RT outlives the draw, without this the depth RT could be freed while in active use by command executor leading to UAFs and crashes.	2022-04-14 14:14:52 +05:30
Billy Laws	0182fabc50	Stub {Set,Get}NpadHandheldActivationMode in HID	2022-04-14 14:14:52 +05:30
Billy Laws	2e197cead5	Support D32S8_Float_Uint_Unorm_Unorm depth/stencil format	2022-04-14 14:14:52 +05:30
Billy Laws	7717a86fb1	Implement VMM region->region copies Required by the DMA engine, a simple memcpy doesn't work since the buffers could span multiple blocks.	2022-04-14 14:14:52 +05:30
Billy Laws	af90d4f977	Implement audren Surround->Stereo downmixing	2022-04-14 14:14:52 +05:30
PixelyIon	ad0005f398	Remove guard-page from main thread stack This was erroneously included while migrating from older code where stack creation was entirely handled with host constructs such as `mmap` directly to using `KPrivateMemory` to manage it, we would create a guard page with `mprotect` that the guest was unaware about and would cause a segfault when a guest accessed the extents of the stack as reported to the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	de81d28b1d	Implement SVC `GetThreadContext3` A partial implementation of the `GetThreadContext3` SVC, we cannot return the whole thread context as the kernel only stores the registers we need according to the ARMv8 ABI convention and so far usages of this SVC do not require the unavailable registers but all future usage must be monitored and potentially require extending the amount of saved registers.	2022-04-14 14:14:52 +05:30
PixelyIon	b706aa3463	Implement SVC `SetThreadActivity` This SVC can pause/resume a thread, it is used by engines like Unity to pause a thread during a GC world stop.	2022-04-14 14:14:52 +05:30
PixelyIon	36a7ad06bd	Use built-in vibrator by default for controller #0 The vibration device had to be set manually prior which led to it generally not being set at all even though a user might want vibration, this commit fixes that by making controller #0 use the built-in vibrator by default.	2022-04-14 14:14:52 +05:30
lynxnb	69ba4f8abb	Swap out boostorg/boost for skyline-emu/boost	2022-04-14 14:14:52 +05:30
PixelyIon	b45437b78b	Move Skyline internal files to external directory Any Skyline files that should have been user-accessible were moved from `/data/data/skyline.emu/files` to `/sdcard/Android/data/skyline.emu/files` as the former directory is entirely private and cannot be accessed without either adb or root. This made retrieving certain data such as saves or loading custom driver shared objects extremely hard to do while this can be trivially done now.	2022-04-14 14:14:52 +05:30
Billy Laws	e5e20f39c9	Implement a simple constant buffer cache In some games such as SMO thousands of constant buffers are bound per frame which was causing an unreasonable number of lookups in both vmm and the buffer manager. Work around this by introducing a simple hashmap based cache, eviction is currently unsupported but not really necessary yet due to the small size of the buffers in the cache.	2022-04-14 14:14:52 +05:30
PixelyIon	cb2614f80e	Handle host accesses for NCE Memory Trapping API We cannot ignore accesses from the host to a region protected by the NCE Memory Trapping API, there's often access to regions which have overlap with a protected region unintentionally and those accesses need to be handled correctly rather than leading to a crash. This is done by implementing an additional signal handler `NCE::HostSignalHandler` to lookup any potential traps on a `SIGSEGV` and handle them correctly or when there isn't a corresponding trap raise a `SIGTRAP` when debugger is connected or delegate to `signal::ExceptionalSignalHandler` when it isn't.	2022-04-14 14:14:52 +05:30
PixelyIon	b04a0c386a	Page out RW-trapped memory in NCE Memory Trapping To cut down memory usage we now page out memory that is RW trapped via the NCE memory trapping API, the callbacks are supposed to page in the memory. This behavior is backed up by Texture/Buffer syncing which would read the host copies of data and write it to the guest, by paging the corresponding data on the guest we're avoiding redundant memory usage.	2022-04-14 14:14:52 +05:30
PixelyIon	344c5f2a62	Implement RAII wrapper over file descriptors The `FileDescriptor` class is a RAII wrapper over FDs which handles their lifetimes alongside other C++ semantics such as moving and copying. It has been used in `skyline::kernel::MemoryManager` to handle the lifetime of the ashmem FD correctly, it wasn't being destroyed earlier which can result in leaking FDs across runs.	2022-04-14 14:14:52 +05:30
PixelyIon	7ce2a903a1	Update LLVM + Oboe Initially this commit was only intended to update LLVM but due to a compilation error on latest LLVM libcxx due to the C++ stdlib header `<algorithm>` being a transitive dependency that is no longer transitively included on the latest LLVM libcxx (as of https://reviews.llvm.org/D119667), this required changes in Skyline and Oboe which were done in https://github.com/google/oboe/pull/1521 and the submodule has been updated to include those changes.	2022-04-14 14:14:52 +05:30
Billy Laws	c549788377	Update shader compiler	2022-04-14 14:14:52 +05:30
Billy Laws	01c027b9f6	Fix GetBlockLinearLayerSize to avoid incorrectly calculating a zero size	2022-04-14 14:14:52 +05:30
PixelyIon	c84badb498	Update NDK (25.0.8221429) + Gradle (7.4.1) + Build Tools (33.0.0)	2022-04-14 14:14:52 +05:30
lynxnb	08e24915d8	Add support for drawing inside the display cutout areas	2022-04-14 14:14:52 +05:30
MK73DS	6e929e6f6a	Stub ICommonStateGetter::SetCpuBoostMode This makes Metroid Dread boot	2022-04-14 14:14:52 +05:30
Billy Laws	d033ff2478	Fix draws when no colour RTs and only depth is bound	2022-04-14 14:14:52 +05:30
Billy Laws	d137051833	Add basic support for 3d/cubemap textures These are mostly used in 3D games like SMO, support is still quite basic and synchronising block linear 3D texture will crash in most cases due to them being unimplemented.	2022-04-14 14:14:52 +05:30
Billy Laws	bcc00216b7	Fix incorrect Bc2/3 block sizes	2022-04-14 14:14:52 +05:30
PixelyIon	7e9b0fec77	Increase reported `audren` revision to 11 Some games crash due to requiring an `audren` version greater than 7. The `audren` version can be increased without any issues as `audren` is stubbed and therefore the reported version doesn't matter.	2022-04-14 14:14:52 +05:30
PixelyIon	e294fa8c91	Add subpass limit quirk to fix Adreno driver bug Older Adreno proprietary drivers (5xx and below) will segfault while destroying the renderpass and associated objects if more than 64 subpasses are within a renderpass due to internal driver implementation details. This commit introduces checks to automatically break up a renderpass when that limit is hit.	2022-04-14 14:14:52 +05:30
PixelyIon	65d5a3bce5	Align all `Buffer`s to page boundary We have support for overlapping buffers which allows us to merge a lot of smaller buffers located on a single page into a single larger buffer which allows for better performance. It additionally ensures that all host buffers match the alignment guarantees of the guest and adequately fulfill host alignment requirements.	2022-04-14 14:14:52 +05:30
PixelyIon	cb1ec9a7f4	Rework `BufferManager`, `Buffer` and `BufferView` This commit encapsulates a complex sequence of cascading changes in the process of supporting overlaps for buffers: * We determined that it is impossible to resolve overlaps with multiple intervals per buffer within the constraints of each overlap being a contiguous view, support for multiple intervals was therefore dropped. The older buffer manager code was entirely reworked to be simpler due to only handling one interval per buffer with code now being based off `IntervalMap` but tailored specifically for buffers. * During overlap resolution, the problem of how existing views into the buffer being recreated would be updated, it had to be replaced with a larger buffer that could contain all overlaps and all existing views would need to be repointed to it. This was addressed by a buffer owning all views to itself, we could automatically recalculate the offset of all views and update the buffers with it. * We still needed to update usage of existing views which was done by handling all access (such as inside a recorded draw) to buffer view properties via `BufferView::RegisterUsage` which dispatches a callback with the view and the corresponding backing buffer. This callback can be stored and called during overlap resolution with the new buffer. * We had issues with lifetime of the buffer with the handle-like semantics of `BufferView` introduced in the last buffer-related commit, if we updated the view to be owned by a new buffer we'd need to extend the lifetime of the new buffer not the older one and the only way to do this was a proxy owner object `BufferDelegate` which holds a shared pointer to the real `Buffer` which in-turn holds a pointer to all `BufferDelegate` objects to update on repointing. A `BufferView` is effectively just a wrapper around `std::shared_ptr<BufferDelegate>` with more favorable semantics but generally just forwarding calls. It should be additionally noted that to support usage of `RegisterUsage` the code around buffers in `GraphicsContext` was refactored to defer truly binding till the recording phase.	2022-04-14 14:14:52 +05:30
PixelyIon	a6781b38f4	Clear `syncBuffers` after `CommandExecutor` execution Due to an oversight, we weren't clearing the list of buffers that needed to be synced after every execution which led to them building up. Due to the relatively cheap synchronization of buffers and only doing so on faults this wasn't caught until now, it does depress the framerate significantly over time due to the size of the list growing to be in the range of 100k buffer views depending on the title.	2022-04-14 14:14:52 +05:30
kaikecarlos	49c0ba1207	Implement `IAccountServiceForApplication::IsUserRegistrationRequestPermitted`	2022-04-14 14:14:52 +05:30
kaikecarlos	e8cc760b10	Implement IHidServer Functions Add GetVibrationDeviceInfo and StartSixAxisSensor	2022-04-14 14:14:52 +05:30
kaikecarlos	9f51664b1d	Stub IRS Service	2022-04-14 14:14:52 +05:30
lynxnb	707c0cc0af	Stub `aocsrv::IAddOnContentManager::ListAddOnContent`	2022-04-14 14:14:52 +05:30
lynxnb	873ed641ea	Stub `nfp::IUser::ListDevices` and `nfp::IUser::GetState`	2022-04-14 14:14:52 +05:30
lynxnb	7d518cba2b	Stub `am::ICommonStateGetter::IsVrModeEnabled`	2022-04-14 14:14:52 +05:30
Billy Laws	c55e1a135e	Update adrenotools	2022-04-14 14:14:52 +05:30
Robin Kertels	594f061b21	Implement SSBOs Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
Billy Laws	82d2a9ab56	Unify engine related macros to avoid excessive code duplication	2022-04-14 14:14:52 +05:30
Billy Laws	ae41ddf4f0	Implement a skeleton compute engine The Kepler compute engine is used to run compute jobs encapsulated in to QMDs on the GPU, this commit doesn't implement compute itself but adds the register and QMD structs that will be needed for it in the future.	2022-04-14 14:14:52 +05:30
Billy Laws	0298a7b1f6	Implement the actual inline to memory engine on subch 2 Used mostly by OGL games for copying stuff around.	2022-04-14 14:14:52 +05:30
Billy Laws	ba7111d33a	Add maxwell3d I2M support	2022-04-14 14:14:52 +05:30
Billy Laws	8c73b62b2c	Implement basic inline2memory engine support Not currently used by anything but will be used by both compute, 3D and its own engine in the future. Block linear copies are currently unsupported.	2022-04-14 14:14:52 +05:30
Billy Laws	5c387f5c5a	Fixup depth mode init value to allow ignoring redundant calls	2022-04-14 14:14:52 +05:30
PixelyIon	7a5c771f44	Rework GPU BufferView to have handle-like semantics We wanted views to extend the lifetime of the underlying buffers and at the same time preserve all views until the destruction of the buffer to prevent recreation which might be costly in the future when we need `VkBufferView`s of the buffer but also require a centralized list of all views for recreation of the buffer. It also removes the inconsistency between `BufferView*` being returned in `GetXView` in `GraphicsContext`.	2022-04-14 14:14:52 +05:30
Billy Laws	fae5332f20	Disable descriptor aliasing on Adreno to workaround shader compiler bug Alised descriptor sets are incorrectly interpreted by the shader compiler causing it to bugger up LLVM function argument types and crash Co-authored-by: PixelyIon <pixelyion@protonmail.com>	2022-04-14 14:14:52 +05:30
Billy Laws	fc2c123ae2	Implement GPU depthMode register This controls the depth range used by the shader, hades already has support for the necessary patching so we only need to pass the current mode over to it and it'll do the necessary work.	2022-04-14 14:14:52 +05:30
Billy Laws	7e088ca465	Fix constbuf updates to actually increment the write offset Uses the register directly now as when we modify it we want the changes to be visible from macros too.	2022-04-14 14:14:52 +05:30
PixelyIon	d2f3479610	Use `eB5G6R5UnormPack16` VkFormat for `B5G6R5Unorm` and `R5G6B5Unorm` Using `eB5G6R5UnormPack16` (with a swizzle for `R5G6B5Unorm`) removes the need for `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` when those formats are aliased which happens in Sonic Mania among other titles.	2022-04-14 14:14:52 +05:30
PixelyIon	24d7066d8b	Add quirk to avoid `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` on Adreno GPUs Adreno GPUs have significant performance penalties from usage of `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` which require disabling UBWC and on Turnip, forces linear tiling. As a result, it's been made an optional quirk which doesn't supply the flag in `VkImageCreateInfo` and logs a warning if a view with a different Vulkan format from the original image is created.	2022-04-14 14:14:52 +05:30
PixelyIon	731d06010d	Set `eMutableFormat` in Texture Image Creation We often need to alias the underlying data as multiple Vulkan formats which requires the `eMutableFormat` bit to be set in `VkImageCreateInfo`, without doing this there'll be validation layer errors and potentially GPU bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	dafcfa68ca	Transition texture layout to `eGeneral` after creation As we no longer set the layout to general inside the Texture constructor, yet, we need it to be set prior to the image being used as an attachment. We need to transition the layout to `eGeneral` after creation of the texture object.	2022-04-14 14:14:52 +05:30
PixelyIon	5dea15632c	Add Controller Setup Guide A setup guide for controllers that goes through every available button/stick sequentially and opens up a corresponding dialog to map them.	2022-04-14 14:14:52 +05:30
PixelyIon	e2cae74425	Fix `RecyclerView` height in `CoordinatorLayout` for non touch-mode Any `RecyclerView`s with an app bar in a `CoordinatorLayout` would end up going off-screen due to the layout behavior implementing an offset by using a transform which would not correctly handle focusing on off-screen objects. This has now been fixed by manually adjusting height to be clipped to what is visible on the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	3ae62c7fcc	Collapse `appBarLayout` on `appList` focus We collapse the app bar when the focus is on the app list which only occurs while using a controller, this is required as the app bar will never be collapsed otherwise. It also removes the older code to work around the limitation on `View.FOCUS_DOWN` by collapsing only when the end of the list was reached.	2022-04-14 14:14:52 +05:30
PixelyIon	3e4ec7323b	Tweak grid compact items Removes card elevation as it visually conflicts with the scrim, this also makes the scrim a bit darker to emphasize the text and slightly reduces the border radius.	2022-04-14 14:14:52 +05:30
PixelyIon	1d984b6de3	Add padding to end of `app_list` A small amount of padding at the end of `app_list` to signify that the end of the list has been reached was added.	2022-04-14 14:14:52 +05:30
PixelyIon	bac7c526ef	Make layout selectable for grid items The entire layout is now selectable for grid items rather than just the card, this greatly increases the visibility of the selection when not in touch mode as the contrast of a darken effect on the icon can be minimal depending on how dark the icon already is.	2022-04-14 14:14:52 +05:30
PixelyIon	1d070e6332	Close `InputStream` after usage in `KeyReader` The `InputStream` would not be closed after reading the key file in `KeyReader#import`, it's now wrapped with `use{ }` which handles closing the stream after usage.	2022-04-14 14:14:52 +05:30
MK73DS	647cb07dc8	Stub functions in IAccountServiceForApplication: - GetUserCount - InitializeApplicationInfo - IsUserAccountSwitchLocked	2022-04-14 14:14:52 +05:30
PixelyIon	b41d8b7997	Use `Surface#setFrameRate` for suggesting display refresh rate Setting the refresh rate via the Display API's`preferredDisplayModeId` is an outdated method to do it on Android 11 and above, we now use `Surface#setFrameRate` alongside it to suggest a refresh rate for the display.	2022-04-14 14:14:52 +05:30
PixelyIon	730bf504f8	Correct Adreno texture binding quirk We incorrectly determined an Adreno driver bug to require padding between binding slots but the real issue was not supporting consecutive binding writes for `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` and was fixed by the padding slot unintentionally requiring individual writes. The quirk has now been corrected to explicitly specify this as the bug and the solution is more apt.	2022-04-14 14:14:52 +05:30
PixelyIon	da8cb48933	Fix Interval Map `GetAlignedRecursiveRange` lookup bug Any lookups done using `GetAlignedRecursiveRange` incorrectly added intervals in the exclusive interval entry lookups as the condition for adding them was the reverse of what it should've been due to a last minute refactor, it led to graphical glitches and crashes. This has been fixed and the lookups should return the correct results.	2022-04-14 14:14:52 +05:30
PixelyIon	f2faa74707	Fix crashes due to `SEGV_ACCERR` check On certain devices, accesses to a protected memory region can return `si_code` as non-`SEGV_ACCERR` values, this leads to a crash as we only pass access violations to the trap handler and would lead to not doing so on those devices which would then result in going to the crash handler.	2022-04-14 14:14:52 +05:30
PixelyIon	62c16fa73e	Upgrade Gradle (7.4), AGP (7.1.2) and Kotlin Dependencies	2022-04-14 14:14:52 +05:30
PixelyIon	77e2797219	Delete expired `weak_ptr`s for Texture/Buffer views A large amount of Texture/Buffer views would expire before reuse could occur in `Texture::GetView`/`Buffer::GetView`. These can lead to a substantial memory allocation given enough time and they are now deleted during the lookup while iterating on all entries. It should be noted that there are a lot of duplicate views that don't live long enough to be reused and the ultimate solution here is to make those views live long enough to be reused.	2022-04-14 14:14:52 +05:30
PixelyIon	881bb969c4	Implement access-driven Buffer synchronization Similar to constant redundant synchronization for textures, there is a lot of redundant synchronization of buffers. Albeit, buffer synchronization is far cheaper than texture synchronization it still has associated costs which have now been reduced by only synchronizing on access.	2022-04-14 14:14:52 +05:30
PixelyIon	7532eaf050	Attach Texture to Cycle in `Texture::TransitionLayout` Not doing so could result in the texture being destroyed before the completion of a transition and lead to undefined behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	3268b3779a	Implement access-driven Texture synchronization There was a lot of redundant synchronization of textures to and from host constantly as we were not aware of guest memory access, this has now been averted by tracking any memory accesses to the texture memory using the NCE Memory Trapping API and synchronizing only when required.	2022-04-14 14:14:52 +05:30
PixelyIon	3e33d49faf	Implement NCE Memory Trapping API An API for trapping accesses to guest memory and performing callbacks based on those accesses alongside managing protection of the memory. This is a fundamental building block for avoiding redundant synchronization of resources from the guest and host. Note: All accesses are treated as write accesses at the moment, support for picking up read accesses will be implemented later	2022-04-14 14:14:52 +05:30
PixelyIon	08510d75b0	Implement Interval Map An interval map is a crucial piece of infrastructure required for memory faulting to track any regions that have an associated callback and their protection. Additionally, efficient page-aligned lookups with semantics optimal for memory faulting are also a requirement and the ability to associate multiple regions with a single callback/protection entry rather than doing so on a per-region basis as we deal with split-mapping resources.	2022-04-14 14:14:52 +05:30
PixelyIon	5c9e42e384	Use mirror mappings for Textures and Buffers This is a prerequisite to memory trapping as we need to write to the mirror to avoid a race condition with external threads writing to a texture/buffer while we do so ourselves for the sync on a read/write, it also avoids an additional `mprotect` to `-WX`/`RWX` on a read access. An additional advantage for textures especially is that we now support split-mapping textures due to laying them out in a contiguous mirror and they will not require costly algorithmic changes. Buffers should also benefit from not needing to iterate over every region when they are split into multiple mappings.	2022-04-14 14:14:52 +05:30
PixelyIon	577a67babd	Support mirrors of multiple non-contiguous memory regions `CreateMirror` is limited to creating a mirror of a single contiguous region which does not work when creating a contiguous mirror of multiple non-contiguous regions. To support this functionality, `CreateMirrors` which expects a list of page-aligned regions and maps them into a contiguous mirror.	2022-04-14 14:14:52 +05:30
PixelyIon	e35ab6d1e0	Move to mapping guest AS as shared memory We want to create arbitrary mirrors in the guest address space and to make this possible, we map the entire address space as a shared memory file. A mirror is mapped by using `mmap` with the offset into the guest address space.	2022-04-14 14:14:52 +05:30
Billy Laws	a5dd961f01	Add support for batched method sending Important for constbuf updates which would be very slow if done one at a time.	2022-04-14 14:14:52 +05:30
Robin Kertels	43879e2476	Round up when calculating size of compressed texture in bytes	2022-04-14 14:14:52 +05:30
Robin Kertels	d889550e84	Don't set COLOR_ATTACHMENT_BIT for compressed formats. The better solution would be to only set this for formats that support it on original HW but this will get rid of the validation errors for now.	2022-04-14 14:14:52 +05:30
Robin Kertels	82296ac5b8	Use buffer size instead of allocation size for Buffer constructor Fixes a validation error.	2022-04-14 14:14:52 +05:30
Robin Kertels	752245c3c8	Enable provoking vertex feature	2022-04-14 14:14:52 +05:30
Robin Kertels	dd45d054e7	Enable shaderDrawParameters	2022-04-14 14:14:52 +05:30
Billy Laws	737ff840a5	Update adrenotools for BTI	2022-04-14 14:14:52 +05:30
Billy Laws	7e16c1f989	Heavily optimise GPFIFO command dispatch to reduce redundant checks Previously for methods with count > 1 the subchannel and engine would be looked up for each part of the method rather than only doing so at the start. Each call also needed to be looked up to see if it touched a macro or GPFIFO method. Fix this by doing checks outside of the main dispatch loop with templated helper lambdas to avoid needing to repeat lots of code. Maxwell3D is the only subchannel with a fast path for now but more can be added later if needed.	2022-04-14 14:14:52 +05:30
Billy Laws	b4927d0138	Add support for turnip and driver file redirection via libadrenotools	2022-04-14 14:14:52 +05:30
Billy Laws	dd91d063a5	Pass native library dir to OS + reorder OS init order so paths are first This is required for integrating libadrenotools, which needs access to library and app directories in the GPU class constructor.	2022-04-14 14:14:52 +05:30
Billy Laws	900d00a876	Update tzcode with missed bugfix	2022-04-14 14:14:52 +05:30
Billy Laws	011de98940	Rework formats to support passing through guest swizzle values Almost every Maxwell format now directly corresponds to a Vulkan format. This allows formats to be passed through and the swizzle used directly from guest (with some extra swizzle handling for edge cases) thus saving the need to explicitly support each swizzle combination which is adds a lot of code bloat. The format header is additionally reordered with line breaks to separate formats by their bits-per-block.	2022-04-14 14:14:52 +05:30
Billy Laws	6f17d1351f	Fixup ordering for B10G11R11Float texture format	2022-04-14 14:14:52 +05:30
Billy Laws	78238d550a	Add 6 channel downmixing support for Audout The specific attenuations used for each channel are taken from Ryujinx.	2022-04-14 14:14:52 +05:30
Billy Laws	2e1a1a965d	Fixup AudioTrack locking	2022-04-14 14:14:52 +05:30
PixelyIon	727f83e969	Fix Incorrect Vertex Binding Divisor State Submission We always submit pipeline divisor descriptions regardless of binding input rate being vertex rather than instance. This is invalid behavior and has been fixed by only submitting binding descriptors when the input rate is per-instance.	2022-04-14 14:14:52 +05:30
PixelyIon	9f7e80cf8f	Fix Adreno Texture Sampler Binding Bug Adreno proprietary drivers suffer from a bug where `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` requires 2 descriptor slots rather than one, we add a padding slot to fix this issue. `QuirkManager` was introduced to handle per-vendor/per-device errata and allow enabling this on Adreno proprietary drivers specifically as to not affect the performance of other devices.	2022-04-14 14:14:52 +05:30
PixelyIon	ddb2ba8a1b	Rename `QuirkManager` to `TraitManager` Quirk terminology was deemed to be inappropriate for describing the features/extensions of a device. It has been replaced with traits which is far more fitting but quirks will be used as a terminology for errata in devices.	2022-04-14 14:14:52 +05:30
PixelyIon	0b2ce6a8f3	Fix Texture Handle Offset Calculation The texture handle offset calculation involved an incorrect shift by descriptor size which was found to be unnecessary and would result in an invalid handle that had the wrong TIC/TSC index and caused broken rendering.	2022-04-14 14:14:52 +05:30
PixelyIon	aa57ec6d55	Destroy `CommandExecutor` Nodes Before Waiting on Execution `nodes` and `syncTextures` were cleared after waiting on the `CommandExecutor` fence rather than before, this wasted execution time after the wait for something that could be performed prior to the wait.	2022-04-14 14:14:52 +05:30
PixelyIon	90a1b3348c	Implement D24S8 + R11G11B10 Formats	2022-04-14 14:14:52 +05:30
PixelyIon	bd718175ce	Enable `VK_KHR_uniform_buffer_standard_layout` when available We now attempt to enable `VK_KHR_uniform_buffer_standard_layout` when present as lax UBO layout significantly reduces complexity. If a device doesn't support this extension, we still assume that the device supports it implicitly as this has proven to be true across all major mobile GPU vendors regardless of the driver version but enabling this prevents validation layer errors.	2022-04-14 14:14:52 +05:30
PixelyIon	22ce531e6f	Force Memory Barrier at `VkRenderPass` Start We depend on past commands to have completed execution in a renderpass, a subpass dependency on all graphics stages from `VK_SUBPASS_EXTERNAL` to subpass #0 is used to enforce this. Nvidia and Adreno proprietary drivers implicitly do this but Turnip or Mali drivers require this or they execute out of order.	2022-04-14 14:14:52 +05:30
PixelyIon	35fde2cd0b	Rework Blocklinear Texture Deswizzling Blocklinear texture decoding was broken for padding blocks and would incorrectly decode them resulting in major texture corruption for any textures with their widths not aligned to 64 bytes. This has now been fixed with neater code which avoids redundant repetition of any code using lambdas and functions where necessary.	2022-04-14 14:14:52 +05:30
PixelyIon	043be4d8f7	Implement Maxwell3D Two-Side Stencil Toggle Stencil operations are configurable to be the same for both sides or have independent stencil state for both sides. It is controlled via the previously unimplemented `stencilTwoSideEnable`.	2022-04-14 14:14:52 +05:30
PixelyIon	80ae7b255a	Implement Maxwell3D Front Face Flip	2022-04-14 14:14:52 +05:30
PixelyIon	40a3887695	Implement Maxwell3D Viewport Y Swizzle & Lower-Left Origin	2022-04-14 14:14:52 +05:30
Billy Laws	3be30e68c3	Add D16 depth format and ZF32 TIC format Used by One Piece Unlimited World Red	2022-04-14 14:14:52 +05:30
Billy Laws	be007c4ccc	Fixup texture swizzling to actually function Before this we were not applying the supplied swizzles, will be superseeded in the future by using guest swizzle values.	2022-04-14 14:14:52 +05:30
Billy Laws	6e48460c0d	Add BC2/3 format support	2022-04-14 14:14:52 +05:30
Billy Laws	2253bc3151	Reorder GPU quirks member to prevent it constructing after device init	2022-04-14 14:14:52 +05:30
Billy Laws	62db21fb78	Rework GPFIFO method distribution and macros to support multiple engines Fermi2D supports macros in addition to Maxwell3D, these both share code memory. To support this we rework the macro interpreter to support passing in a target engine and abstract the communications out into an interface that can be implemented by applicable engines. ``` GPFIFO <-> MME <-> Maxwell3D ^ ^---> Fermi2D X------------> I2M X------------> MaxwellComputeB X--Flush-----> MaxwellDMA ```	2022-04-14 14:14:52 +05:30
Billy Laws	8d5463ef28	Drop engine base class usage from GPFIFO This class does nothing since we made stopped GPFIFO submits from using virtual functions so it can be dropped.	2022-04-14 14:14:52 +05:30
Billy Laws	4378658cbc	Update BCeNabler to support ---X .text devices	2022-04-14 14:14:52 +05:30
PixelyIon	41aad83c33	Tie Shader `ObjectPool` Lifetime to Shader `Program` Shader programs allocate instructions and blocks within an `ObjectPool`, there was a global pool prior that was never reaped aside from on destruction. This led to a leak where the pool would contain resources from shader programs that had been deleted, to avert this the pools are now tied to shader programs.	2022-04-14 14:14:52 +05:30
PixelyIon	e747de37cf	Implement Blocklinear TIC Type	2022-04-14 14:14:52 +05:30
PixelyIon	723189a948	Calculate Blocklinear Texture Aligned Size Correctly The size of blocklinear textures did not consider alignment to Block/ROB boundaries before, it is aligned to them now. Incorrect sizes led to textures not being aliased correctly due to different size calculations for GraphicBufferProducer surfaces and Maxwell3D color RTs.	2022-04-14 14:14:52 +05:30
Billy Laws	95685b8207	Avoid iterator invalidation segfault when unregistering a syncpt waiter erase invalidated `it` leading to a potential segfault if the GPU was very far behind, bail out early to avoid that since there can only be one occurence at most in the buffer anyway.	2022-04-14 14:14:52 +05:30
Billy Laws	e7bfd93541	Implement BC7 format support Used by ARMS	2022-04-14 14:14:52 +05:30
Billy Laws	99652c5eda	Support partially mapped cbufs Buggy games sometimes supply an incorrect cbuf size so limit buffers to the first unmapped region.	2022-04-14 14:14:52 +05:30
PixelyIon	6a6f51ea84	Implement Maxwell3D Depth/Stencil State Implements the entirety of Maxwell3D Depth/Stencil state for both faces including compare/write masks and reference value. Maxwell3D register `stencilTwoSideEnable` is ignored as its behavior is unknown and could mean the same behavior for both stencils or the back facing stencil being disabled as a result of this it is unimplemented.	2022-04-14 14:14:52 +05:30
PixelyIon	9f5c3d8ecd	Force Textures to be Optimal on Host GPU We don't respect the host subresource layout in synchronizing linear textures from the guest to the host when mapped to memory directly, this leads to texture corruption and while the real fix would involve respecting the host subresource layout, this has been deferred for later as real world performance advantages/disadvantages associated with this change can be observed more carefully to determine if it's worth it.	2022-04-14 14:14:52 +05:30
Billy Laws	ab4962c4e4	Implement additional texture formats, including BCn BCeNabler is required for BCn textures, the pre-swizzled formats will be removed when arbitary swizzle support is added later.	2022-04-14 14:14:52 +05:30
Billy Laws	600b94505c	Fix A2R10G10B10 render target format This was wrongly described as R10G10B10A2 in the enum when it's actually A2R10G10B10, a format natively supported in Vulkan with just a swizzle.	2022-04-14 14:14:52 +05:30
Billy Laws	175ba11f07	Integrate BCeNabler support into QuirkManager Allows using BCn format textures on devices where they are unsupported by the driver.	2022-04-14 14:14:52 +05:30
Billy Laws	47d920d91e	Make GPU private static functions file-local	2022-04-14 14:14:52 +05:30
PixelyIon	edd51c3dfa	Fix Color RT Disabling Bug Color RTs are disabled by setting their format as `None`, it was removed while transitioning to macros and resulted in a missing format exception. It has been readded as several applications depend on this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	a2285669b3	Use static vector for shader bytecode to prevent constant reallocation Using `std::vector` for shader bytecode led to a lot of reallocation due to constant resizing, switching over the static vector allows for a single static allocation of the maximum possible guest shader size (1 MiB) to be done for every stage resulting in a 6 MiB preallocation which is unnoticeable given the total memory overhead of running a Switch application.	2022-04-14 14:14:52 +05:30
PixelyIon	21a6866def	Fix Maxwell3D Blend Enum Conversion Bugs The `OneMinusSourceAlpha` blending factor was converted to `eOneMinusSrcColor` rather than `eOneMinusSrcAlpha` leading to incorrect blending behavior in certain titles. A similar issue with the order of `MinimumGL`/`MaximumGL` and `SubtractGL`/`ReverseSubtractGL` being the opposite of what it should've been, both of these issues have been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	0a506088f4	Fix `NextSubpassNode` Subpass Index Bug `NextSubpassNode` didn't increment `subpassIndex` which runs commands with the wrong subpass index resulting in them accessing invalid attachments or other bugs that may arise from using the wrong subpass.	2022-04-14 14:14:52 +05:30
PixelyIon	defbfe8f78	Serialize Maxwell3D Draw State for Subpass All Maxwell3D state was passed by reference to the draw command lambda, this would break if there was more than one pass or the state was changed in any way before execution. All state has now been serialized by value into the draw command lambda capture, retaining state regardless of mutations of the class state.	2022-04-14 14:14:52 +05:30
PixelyIon	934130b3e6	Remove Implicit Command Executor Resource Attachment Any usage of a resource in a command now requires attaching that resource externally and will not be implicitly attached on usage, this makes attaching of resources consistent and allows for more lax locking requirements on resources as they can be locked while attaching and don't need to be for any commands, it also avoids redundantly attaching a resource in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	f0e9c42097	Fix Fence Cycle Double Insertion Lifetime Bug If an object is attached to a `FenceCycle` twice then it would cause `FenceCycleDependency::next` to be overwritten and lead to destruction of dependencies prior to the fence being signaled causing usage of deleted resources. This commit fixes this by tracking what fence cycle a dependency is currently attached to and doesn't reattach if it's already attached to the current fence cycle.	2022-04-14 14:14:52 +05:30
PixelyIon	6a831f6ed7	Add `VK_EXT_shader_demote_to_helper_invocation` Quirk An assumption was hardcoded into `Shader::Profile` regarding devices supporting demotion of shader invocations to helpers. This assumption wasn't backed by enabling the `VK_EXT_shader_demote_to_helper_invocation` extension via a quirk leading to assertions when it was used by the shader compiler, a quirk has now been added for the extension and is supplied to the shader compiler accordingly.	2022-04-14 14:14:52 +05:30
lynxnb	58c871ed9a	Correctly hide system bars in `EmulationActivity` on Android >= 11	2022-04-14 14:14:52 +05:30
Billy Laws	3ff8075151	Move vertex and RT format conv to macros and fill them fully in Makes the format conversions easier to read and shorter, and adds in some new formats needed to complete the RT table properly.	2022-04-14 14:14:52 +05:30
PixelyIon	8f0db18624	Fix `ControllerActivity` Controller Type Change Crash If the controller type was changed from a type with a larger amount of buttons/axes to one with a fewer amount, a crash would occur due to the transition animation retaining those elements as children yet returning `NO_POSITION` from `getChildAdapterPosition` in `DividerItemDecoration` which was an unhandled case and led to an OOB array access.	2022-04-14 14:14:52 +05:30
PixelyIon	2c46709064	Fix `ControllerPreference`'s `index` not being passed to Activity A bug caused by not passing the index argument to `ControllerActivity` led to all preferences opening the activity that pertained to Controller #1. This was fixed by passing the `index` argument in the activity launch intent.	2022-04-14 14:14:52 +05:30
PixelyIon	270ee4a7a6	Update Gradle + AGP + Kotlin Dependencies Gradle was updated to `7.3.3` and AGP to `7.1.0-rc01` from `beta04` with all other dependencies being updated to the latest available versions.	2022-04-14 14:14:52 +05:30
PixelyIon	98b366c1f5	Fix Texture Synchronization Bug Fixes texture corruption due to incorrect synchronization, the barrier would not enforce waiting till the texture was entirely rendered causing an incomplete texture to be downloaded which lead to rendering bugs for certain GPUs including ARM's Mali GPUs.	2022-04-14 14:14:52 +05:30
PixelyIon	aea40e6496	Fix `enabledFeature2` Unlinking Assertion Bug A bug caused an assertion if both `VK_EXT_custom_border_color` and `VK_EXT_vertex_attribute_divisor` due to mistakenly unlinking `PhysicalDeviceVertexAttributeDivisorFeaturesEXT` instead of `PhysicalDeviceCustomBorderColorFeaturesEXT` when `VK_EXT_custom_border_color` isn't supported which would potentially lead to unlinking the same structure twice and cause the assertion.	2022-04-14 14:14:52 +05:30
Billy Laws	68f31c3688	Use macros for defining texture formats and their conversions Avoids the need to repeat all the possible component types for each texture format while also making them simpler to add and easier to read.	2022-04-14 14:14:52 +05:30
lynxnb	a9d4e6bb1a	Add screen orientation setting	2022-04-14 14:14:52 +05:30
PixelyIon	bc29b23972	Implement CPU-only Maxwell3D Inline Constant Buffer Updates Implements inline constant buffer updates that are written to the CPU copy of the buffer rather than generating an actual inline buffer write, this works for TIC/TSC index updates but won't work when the buffer is expected to actually be updated inline with regard to sequence rather than just as a buffer upload prior to rendering. GPU-sided constant buffer updates will be implemented later with optimizations for updating an entire range by handling GPFIFO `Inc`/`NonInc`directly and submitting it as a host inline buffer update.	2022-04-14 14:14:52 +05:30
PixelyIon	08f29f7da4	Make `ActiveDescriptorSet` movable and non-copyable There should only ever be a single instance of a `ActiveDescriptorSet` that tracks the lifetime of a descriptor set as the destructor is responsible for freeing the descriptor set. There are cases where a new object inheriting the descriptor set needs to be created in these cases we need to have move semantics and make the destructor of the prior object inert, this allows for moving to the new object without any side effects. If the copy constructor was used in these cases the older object would free the set on its destruction which would lead to the set being invalid on existing instances which is incorrect behavior and would likely lead to driver crashes.	2022-04-14 14:14:52 +05:30
PixelyIon	bb14af4f7a	Implement Maxwell3D Sampled Textures The descriptor sets should now contain a combined image and sampler handle for any sampled textures in the guest shader from the supplied offset into the texture constant buffer. Note: Games tend to rely on inline constant buffer updates for writing the texture constant buffer and due to it not being implemented, the value will be read as 0 which is incorrect.	2022-04-14 14:14:52 +05:30
PixelyIon	d9a9e52350	Use `ConstantBuffer` instead of `BufferView` for Shader Constant Buffers We want read semantics inside the constant buffer object via the mappings to avoid a pointless GPU VMM mapping lookup. It is a fairly frequent operation so this is necessary, the ability to write directly will be added in the future as well.	2022-04-14 14:14:52 +05:30
PixelyIon	adb0a16873	Implement Maxwell 3D Textures Implements parsing for the Maxwell 3D TIC pool and conversion of a TIC into a `GuestTexture`, support is limited to pitch-linear RGB565/A8R8G8B8 textures at the moment but will be extended as games utilize more formats and layouts. Support for 1D buffers is also omitted at the moment since they need special handling with them effectively being treated as buffers in Vulkan rather than images.	2022-04-14 14:14:52 +05:30
PixelyIon	a7b90e7825	Change Texture Pitch Unit to Bytes from Pixels The pitch of the texture should always be supplied in terms of bytes as it denotes alignment on a byte boundary rather than a pixel one, it is also always utilized in terms of bytes rather than pixels so this avoids an unnecessary conversion. Note: GBP stride unit was assumed to be pixels earlier but is likely bytes which is why there are no changes to the supplied value there, if this is not the case it'll be fixed in the future	2022-04-14 14:14:52 +05:30
PixelyIon	a9aa16798f	Add `-fsigned-bitfields` for defined bitfield `int` behavior We want consistent behavior between signed `int`s in bitfields and outside of bitfields, the `-fsigned-bitfields` flag enforces this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	87c8dc94d2	Implement Maxwell3D Samplers Maxwell3D `TextureSamplerControl` (TSC) are fully converted into Vulkan samplers with extension backing for all aspects that require them (border color/reduction mode) and approximations where Vulkan doesn't support certain functionality (sampler address mode) alongside cases where extensions may not be present (border color).	2022-04-14 14:14:52 +05:30
PixelyIon	e48a7d7009	Fix Mapping Caching For Maxwell 3D Buffers Code involving caching of mappings was copied from `RenderTarget` without much consideration for applicability in buffers, the reason for caching mappings in RTs was that the view may be invalidated by more than the IOVA/Size being changed but this doesn't hold true for buffers generally so invalidation can only be on the view level with the mappings being looked up every time since the invalidation would likely change them.	2022-04-14 14:14:52 +05:30
PixelyIon	ff27dce24c	Implement `ObjectHash` for hashing trivial objects in maps `std::hash` doesn't have a generic template where it can be utilized for arbitrary trivial objects and implementing this might result in conflicts with other types. To fix this a generic templated hash is now provided as a utility structure, that can be utilized directly in hash-based containers such as `unordered_map`.	2022-04-14 14:14:52 +05:30
PixelyIon	97cfcba0da	Add Nullability for Optional Semantics to `span` Nullability allow for optional semantics where a span may be explicitly invalidated with `nullptr` being used as a sentinel value for it and a boolean operator that allows trivial checking for if the span is valid or not.	2022-04-14 14:14:52 +05:30
PixelyIon	c11962e8e4	Implement Maxwell3D Bindless Texture Constant Buffer Index The index of the constant buffer with bindless texture descriptors is now retrieved from Maxwell3D register state and passed to the shader compiler.	2022-04-14 14:14:52 +05:30
PixelyIon	1c3f62b7b4	Implement Maxwell3D Indexed Drawing	2022-04-14 14:14:52 +05:30
PixelyIon	23cdfe2139	Implement Maxwell3D Index Buffers Adds support for index buffers including U8 index buffers via the `VK_EXT_index_type_uint8` extension which has been added as an optional quirk but an exception will be thrown if the guest utilizes it but the host doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	a4041364e1	Address CR comments Note: CR comments regarding `ShaderSet` and `PipelineStages` will be addressed at a later date with a common class for associative enum arrays.	2022-04-14 14:14:52 +05:30
PixelyIon	e1e14e781f	Support Dual Vertex Shader Programs Add support for parsing and combining `VertexA` and `VertexB` programs into a single vertex pipeline program prior to compilation, atomic reparsing and combining is supported to only reparse the stage that was modified and recombine once at most within a single pipeline compilation.	2022-04-14 14:14:52 +05:30
PixelyIon	974cf03c18	Add Atomic Pipeline Stage Invalidation Atomically invalidate pipeline stages as runtime information that pertains to them changes rather than never recompiling pipelines on runtime information being updated resulting in out of date pipelines or recompiling all pipelines on any runtime information updates.	2022-04-14 14:14:52 +05:30
PixelyIon	5414db8411	Rework Maxwell3D Shader/Pipeline Stages Compilation with UBO support Shader compilation is now broken into shader program parsing and pipeline shader compilation which will allow for supporting dual vertex shaders and more atomic invalidation depending on runtime state limiting the amount of work that is redone. Bindings are now properly handled allowing for bound UBOs to be converted to the appropriate host UBO as designated by the shader compiler by creating Vulkan Descriptor Sets that match it.	2022-04-14 14:14:52 +05:30
PixelyIon	055d315048	Seperate Maxwell3D Stages into Shader/Pipeline We need this to make the distinction between a shader and pipeline stage in as shader programs are bound at a different rate than that of pipeline stage resources such as UBO.	2022-04-14 14:14:52 +05:30
PixelyIon	492dd47218	Implement Vulkan Descriptor Set Allocator A fixed descriptor set allocator which manages the size of the pool with automatic reallocations when any allocations run out of descriptors.	2022-04-14 14:14:52 +05:30
PixelyIon	9af9f1d41a	Implement Maxwell3D Constant Buffer Selector The Constant Buffer Selector is used to point to a constant buffer that will be bound to a shader stage or updated with inline data.	2022-04-14 14:14:52 +05:30
PixelyIon	afa34e320a	Retain Shader Binding State Across Stages An instance of `Shader::Backend::Bindings` must be retained across all stages for correct emission of bindings, which is now done inside `GraphicsContext::GetShaderStages`.	2022-04-14 14:14:52 +05:30
PixelyIon	550d12b7fa	Set Shader Runtime Generic Vertex Attribute Types Correctly The vertex attribute types supplied prior were just the default which is `Float`, this works for some cases but will entirely break if the attribute type isn't a float. The attribute types are now set correctly.	2022-04-14 14:14:52 +05:30
PixelyIon	a2de6b9255	Fix Maxwell3D `vertexEndGl` Register Offset The offset was set to 0x586 which is the location of `vertexBeginGl`, it's been corrected now and set to 0x585.	2022-04-14 14:14:52 +05:30
Billy Laws	5815cda7a7	Update Vulkan-Hpp to v1.2.202	2022-04-14 14:14:52 +05:30
PixelyIon	bd6cd0056c	Support Multi-Aspect Copy in `Texture::CopyIntoStagingBuffer` Only copying a single aspect was supported by `CopyIntoStagingBuffer` earlier due to not supplying a `VkBufferImageCopy` for each aspect separately, this has now been done with Color/Depth/Stencil aspects having their own `VkBufferImageCopy` for the `VkCmdCopyImageToBuffer` command.	2022-04-14 14:14:52 +05:30
PixelyIon	daff17c776	Order `TextureView` Definition Correctly The definition of the `TextureView` class was spread across `texture.cpp` and has now been moved to the top of the file above the other half of the definition.	2022-04-14 14:14:52 +05:30
PixelyIon	189b9533f2	Disable Vertex Buffers With 0 as IOVAs A buffer with 0 as the start/end IOVA should be invalid as there shouldn't be any mappings at 0 in GPU VA, titles such as Puyo Puyo Tetris configure the Vertex Buffer with 0 IOVAs which leads to a segmentation fault without this exception.	2022-04-14 14:14:52 +05:30
PixelyIon	cfeb8098db	Attach `TextureView`/`BufferView` Lifetime to `FenceCycle` The lifetime of a texture and buffer view is now bound by the `FenceCycle` in `CommandExecutor`, this ensures that a `VkImageView` isn't destroyed prior to usage leading to UB.	2022-04-14 14:14:52 +05:30
PixelyIon	34fc1e32b8	Remove `Texture`s from `RenderPassNode::Storage` The lifetime of all textures bound to a RenderPass alongside syncing of textures is already handled by `CommandExecutor` and doesn't need to be redundantly handled by `RenderPassNode`. It's been removed as a result of this.	2022-04-14 14:14:52 +05:30
PixelyIon	45c7a89fc3	Cleanup `BufferView`/`TextureView` Locking Code Renames the variable to be neater and less confusing alongside adding comments for `try_lock()` to make the goal of the function more apparent.	2022-04-14 14:14:52 +05:30
PixelyIon	7776ef2cd0	Support Depth/Stencil RT in Draw Adds the depth/stencil RT as an attachment for the draw but with `VkPipelineDepthStencilStateCreateInfo` stubbed out, it'll not function correctly and the contents will not be what the guest expects them to be.	2022-04-14 14:14:52 +05:30
PixelyIon	525850ae09	Stub `VkPipelineDepthStencilStateCreateInfo` Maxwell3D Depth State is composed of several registers and will be implemented at a later date, for the time being it's been stubbed.	2022-04-14 14:14:52 +05:30
PixelyIon	9e63ecf05d	Implement Maxwell3D Depth/Stencil Clears Support for clearing the depth/stencil RT has been added as its own function via either optimized `VkAttachmentLoadOp`-based clears or `vkCmdClearAttachments`. A bit of cleanup has also been done for color RT clears with the lambda for the slow-path purely calling the command rather than creating the parameter structures.	2022-04-14 14:14:52 +05:30
PixelyIon	bf89f96bf5	Implement Optimized LoadOp Clears for Depth/Stencil Attachments Implements `AddClearDepthStencilSubpass` in `CommandExecutor` which is similar to `ClearColorAttachment` in that it uses `VK_ATTACHMENT_LOAD_OP_CLEAR` for the clear which is far more efficient than using `VK_ATTACHMENT_LOAD_OP_LOAD` then doing the clear.	2022-04-14 14:14:52 +05:30
PixelyIon	6f6413f02d	Fix `VkSubpassDependency` for Depth/Stencil Attachments The stage/access mask for `VkSubpassDependency` were hardcoded to only be valid for color attachments earlier, this has now been fixed by branching based on the format aspect.	2022-04-14 14:14:52 +05:30
PixelyIon	aa32f6b017	Add Depth/Stencil Format Support to `Texture` Sets `VkImageUsageFlags` correctly rather than hardcoding it for color attachments and adds multiple `VkBufferImageCopy` to `VkCmdCopyBufferToImage` for Color/Depth/Stencil aspects of an image.	2022-04-14 14:14:52 +05:30
PixelyIon	68c990c041	Implement Maxwell3D Depth/Stencil Render Target Support the Maxwell3D Depth RT for Z-buffering, this just creates an equivalent `RenderTarget` object with no support on the API-user side (IE: `Draw` and `ClearBuffers`).	2022-04-14 14:14:52 +05:30
PixelyIon	2a8bcc60c7	Make Render Targets Abstract for Color/Depth RTs This prefixes all RT functions that deal with color RTs with `Color` and abstracts out common functions that will be used for both color and depth RTs. All common Maxwell3D structures are also moved out of the `ColorRenderTarget` (`RenderTarget` previously) structure.	2022-04-14 14:14:52 +05:30
PixelyIon	b0f084ae32	Implement Shader Compiler Input Topology Sets the input toplogy in the runtime information for the shader compiler correctly based on the Maxwell3D input topology.	2022-04-14 14:14:52 +05:30
PixelyIon	7a63ad7d3d	Implement `VkPipelineCache` for host pipeline caching To allow for caching of pipelines on the host a `VkPipelineCache` has been added, it is entirely in-memory and is not flushed to the disk which'll be done in the future alongside caching guest shaders to further avoid translation where possible.	2022-04-14 14:14:52 +05:30
PixelyIon	4dcf12c4c0	Implement Maxwell3D Draws Uses all Maxwell3D state converted into Vulkan state to do an equivalent draw on the host GPU, it sets up RT/Vertex Buffer/Vertex Attribute/Shader state and creates a stubbed out `VkPipelineLayout` for the draw. Any descriptor state isn't currently handled and is yet to be implemented, currently there's no Vulkan pipeline cache supplied which will be implemented subsequently.	2022-04-14 14:14:52 +05:30
PixelyIon	57b0d6a2fb	Stub `VkPipelineMultisampleStateCreateInfo` Multisampling will be worked on later and for the time being is being safely stubbed by setting the sample count to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	56b3a01a59	Track `VkRenderPass` and Subpass Index for Subpass Function Nodes We require a handle to the current renderpass and the index of the subpass in certain cases, this is now tracked by the `CommandExecutor` and passed in as a parameter to `NextSubpassFunctionNode` and the newly-introduced `SubpassFunctionNode`.	2022-04-14 14:14:52 +05:30
PixelyIon	cb7f68b98d	Allow Attaching Texture/Buffers to `CommandExecutor` Switch from `SubmitWithCycle` to manually allocating the active command buffer to tag dependencies with the `FenceCycle` that prevents them from being mutated prior to execution. This new paradigm could also allow eager recording of commands with only submission being deferred.	2022-04-14 14:14:52 +05:30
PixelyIon	aeea3e6f66	Allow manual allocation of `ActiveCommandBuffer` `CommandScheduler` API users can now directly allocate an active command buffer that they need to manage alongside its fence, this can allow for more efficient recording as it doesn't need to be immediately submitted after, it can also allow attaching objects to a `FenceCycle` prior to submission that can be useful for locking resources.	2022-04-14 14:14:52 +05:30
PixelyIon	8989305637	Implement Host Vertex Buffer Translation Uses the buffer cache to retrieve an equivalent host vertex buffer for a corresponding guest vertex buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	b6ba770a27	Implement Maxwell3D Shader Compilation Compiles shaders supplied by the guest with caching and automatic invalidation, the size of the shader is also automatically determined by looking for `BRA $` instructions which cause an infloop, it should be noted that we have a maximum shader bytecode size, any shader above this size will not be supported.	2022-04-14 14:14:52 +05:30
PixelyIon	08afda6ac4	Implement Graphics Shader Compilation in `ShaderManager` Graphics shaders can now be compiled using the shader compiler and emit SPIR-V that can be used on the host. The binding state isn't currently handled alongside constant buffers and textures support in `GraphicsEnvironment` yet.	2022-04-14 14:14:52 +05:30
PixelyIon	353ca8ec84	Fix Viewport X/Y Translation The operands of the subtraction in the X/Y translation calculation were the wrong way around which led to negative translations that would translate the viewport off the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	f06a12170f	Set Default Color Write Mask to RGBA The default color write mask should mask no channels and write all of them and should be mutated to mask out certain channels as required by the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	23faf1370c	Use Static Arrays for Vertex Buffer Bindings & Attributes We cannot statically construct the vertex buffer/attribute arrays for Vulkan due to inactive attributes or buffers which isn't possible on Vulkan, we also cannot just change the count dynamically as there might be disabled buffers or attributes in the middle. We just have a `static_array` which should dynamically be filled in with buffer binding/attribute Vulkan structures before submission.	2022-04-14 14:14:52 +05:30
PixelyIon	8652edb07b	Make `GuestBuffer` format-less Buffers generally don't have formats that are fundamentally associated with them unless they're texel buffers, if that is the case it can be manually set in `BufferView`.	2022-04-14 14:14:52 +05:30
PixelyIon	03314ec7d2	Introduce `BufferManager` The Buffer Manager handles mapping of guest buffers to host buffer views with automatic handling of sub-buffers and eventually supporting recreation of overlapping buffers to create a single larger buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	bde61d72cc	Introduce `Buffer` and `BufferView` Implements infrastructure for using guest buffers on the host for rendering, a `BufferManager` is still missing which'd handle mapping from guest buffers to host buffers and will be subsequently committed. It should be noted that `BufferView` is also disconnected from `Buffer` and shared for every instance with the same properties like `TextureView` is now.	2022-04-14 14:14:52 +05:30
PixelyIon	6eda1777c5	Rework `TextureView` to be disconnected from `Texture` We want `TextureView`(s) to be disconnected from the backing on the host and instead represent a specific texture on the guest with a backing that can change depending on mapping of new textures which'd invalidate the backing but should now be automatically repointed to an appropriate new backing. This approach also requires locking of the backing to function as it is mutable till it has been locked or the backing has an attached `FenceCycle` that hasn't been signaled which will be added for `CommandExecutor` in a subsequent commit.	2022-04-14 14:14:52 +05:30
PixelyIon	82916657fb	Only Enable Shader Compiler Debug Mode in Debug Builds Sets properties that relate to debugging in `Shader::Settings` to `true` only for debug builds while leaving them disabled for release builds.	2022-04-14 14:14:52 +05:30
PixelyIon	b09f28c0ba	Implement Missing Shader Compiler Quirks Introduces the `supportsShaderViewportIndexLayer` quirk and sets `Shader::Profile::support_int64_atomics` depending on if the `supportsAtomicInt64` quirk is available.	2022-04-14 14:14:52 +05:30
PixelyIon	f3e81094a2	Implement Shader Compiler Property Quirks Introduces the `floatControls`, `supportsSubgroupVote` and `subgroupSize` quirks for the shader compiler which are based on Vulkan `PhysicalDevice` properties.	2022-04-14 14:14:52 +05:30
PixelyIon	51c4df24b5	Switch from `VK_VERSION_` to `VK_API_VERSION_` macros Vulkan has officially deprecated `VK_VERSION_*` macros for versioning as it has introduced the variant into the version. It should however be `0` for the Vulkan APi and doesn't need to be printed.	2022-04-14 14:14:52 +05:30
PixelyIon	0588a525b4	Implement Shader Compiler Extension/Feature Quirks Introduces several quirks for optional features used by the shader compiler which are now reported in the `Shader::HostTranslateInfo` and `Shader::Profile` structure. There are still property-related quirks for the shader compiler which haven't been implemented in this commit.	2022-04-14 14:14:52 +05:30
PixelyIon	8f3887c56a	Create `memory::Buffer` & Implement `StagingBuffer` as derivative A `Buffer` class was created to hold any generic Vulkan buffer object with `span` semantics, `StagingBuffer` was implemented atop it as a wrapper for `Buffer` that inherits from `FenceCycleDependency` and can be used as such.	2022-04-14 14:14:52 +05:30
PixelyIon	a55aca76c6	Rename `TextureView::backing` to `TextureView::texture` It was determined that `backing` wasn't a very descriptive name and that it conflicted with the texture's own backing, the name was changed to `texture` to make it more apparent that it was specifically the `Texture` object backing the view.	2022-04-14 14:14:52 +05:30
PixelyIon	482c573b81	Introduce `FlatMemoryManager::ReadTill` for scanning semantics A memory manager function to read into a vector till it satisfies the supplied function or hits an early stop condition like hitting the end of vector or reaching an unmapped region. This can be used to efficiently scan for values in GPU VA.	2022-04-14 14:14:52 +05:30
PixelyIon	31c4f1ca4e	Unlink `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when disabled When `VK_EXT_vertex_attribute_divisor` is not available, `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` is unlinked from the device enabled feature list as it is undefined behavior to link a structure provided by an extension without enabling that extension.	2022-04-14 14:14:52 +05:30
PixelyIon	032866c9b1	Allow Injecting External Vulkan Layers Set `com.android.graphics.injectLayers.enable` to allow injection of external Vulkan layers which is done by GPU debuggers such as RenderDoc.	2022-04-14 14:14:52 +05:30
PixelyIon	b97e06f617	Update Vulkan Validation Layer to 1.2.198.0 SDK release	2022-04-14 14:14:52 +05:30
PixelyIon	e8d92a6858	Update Shader Compiler Update to `2c295a067d`	2022-04-14 14:14:52 +05:30
PixelyIon	7df2670ece	Fix `QuirkManager`'s `EXT_SET_V` macro bug `EXT_SET_V` would enable the extension regardless of if it was actually the correct extension or if the version was high enough as long as the hash matched. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	e9ed771b48	Check for `supportsMultipleViewports` feature before usage If the host only supports a single viewport then we set `viewportCount` and `scissorCount` in `VkPipelineViewportStateCreateInfo` to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	3e45006d14	Make `shaderImageGatherExtended` a required `VkDevice` feature `shaderImageGatherExtended` is required by the shader compiler, to avoid complications associated with making it optional and considering that it's supported by the vast majority of Vulkan mobile devices, it was made a mandatory feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ece2785582	Introduce `ShaderManager` with Proxy Shader Compiler Logger/Settings This class will be entirely responsible for any interop with the shader compiler, it is also responsible for caching and compilation of shaders in itself.	2022-04-14 14:14:52 +05:30
PixelyIon	def9cedbee	Add yuzu Shader Compiler as a submodule We plan to use our fork of yuzu's shader compiler for GPU shader compilation so it's been added as a submodule.	2022-04-14 14:14:52 +05:30
PixelyIon	746af4cb4c	Add Sirit as a submodule We require Sirit as it is a dependency for yuzu's shader compiler where it uses it to emit SPIR-V in an easy and efficient manner.	2022-04-14 14:14:52 +05:30
PixelyIon	dbc94f36d3	Add Range v3 as a submodule We want to utilize features from C++ 20 ranges but they haven't been entirely implemented in libc++ so in the meantime we use the reference implementation for it which is Ranges v3.	2022-04-14 14:14:52 +05:30
PixelyIon	89e9a41a86	Implement `VkPipelineViewportStateCreateInfo` "Viewport Transforms" and "Viewport Scissors" were combined into one section to reflect their state in Vulkan correctly like all other sections.	2022-04-14 14:14:52 +05:30
PixelyIon	38119e21d4	Implement Vulkan-Supported Maxwell3D Primitive Topologies Any primitive topologies that are directly supported by Vulkan were implemented but the rest were not and will be implemented with conversions as they are used by applications, they are: * LineLoop * QuadList * QuadStrip * Polygon	2022-04-14 14:14:52 +05:30
PixelyIon	138f884159	Implement Maxwell3D Vertex Attributes Translates all Maxwell3D vertex attributes to Vulkan with the exception of `isConstant` which causes the vertex attribute to return a constant value `(0,0,0,X)` which was trivial in OpenGL with `glDisableVertexAttribArray` and `glVertexAttrib4(..., 0, 0, 0, 1)` but we don't have access to this in Vulkan and might need to depend on undefined behavior or manually emulate it in a shader. This'll be revisited in the future after checking host GPU behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	4b9f99bb27	Make `ENUM_STRING` function `static` `ENUM_STRING` can be used inside a `class`/`struct`/`union` for `enum`s contained within them. Making the function `static` allows doing this and doesn't require supplying a `this` pointer of the enclosing class for usage.	2022-04-14 14:14:52 +05:30
PixelyIon	c2a6da6431	Implement Maxwell3D Vertex Buffer Limit Sets the end of VBOs based on the `vertexArrayLimits` register array which provides an IOVA to the end of the VBO.	2022-04-14 14:14:52 +05:30
PixelyIon	d8890f13e1	Explicitly make `default` case `break` for `Maxwell3D::HandleMethod` This being made implicit removes any confusion that all cases would need to be implemented and explicitly define that the CF should continue onto the 2nd switch-case when it cannot find any matches in the first one.	2022-04-14 14:14:52 +05:30
PixelyIon	612f324e78	Implement Maxwell3D Vertex Buffer Instance Rate Implements the `isVertexInputRatePerInstance` register array which controls if the vertex input rate is either per-vertex or per-instance. This works in conjunction with the vertex attribute divisor for per-instance attribute repetition of attributes.	2022-04-14 14:14:52 +05:30
PixelyIon	476c070c7a	Fix Minor Maxwell3D Register Ordering Issues We order all registers in ascending order, a few registers namely `colorLogicOp`, `colorWriteMask`, `clearBuffers` and `depthBiasClamp` were erroneously not following this order which has now been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	32de7e5150	Use `decltype` over `typeof` globally We inconsistently utilized `typeof` and `decltype` all over the codebase, this has now been fixed by uniformly using `decltype` as `typeof` is a GCC extension and not in the C++ standard alongside having the hidden side effect of removing references from the determined type.	2022-04-14 14:14:52 +05:30
PixelyIon	841ee9fc15	Check for `vertexAttributeInstanceRateZeroDivisor` feature before usage Check for `vertexAttributeInstanceRateZeroDivisor` in `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when the Maxwell3D register corresponding to the vertex attribute divisor is set to 0. If it isn't then it logs a warning and sets the value anyway which could result in UB since the only alternative is an exception that stops emulation which might not be optimal if the game mostly works fine without this, we will add a user-facing warning when we intentionally allow UB like this in the future.	2022-04-14 14:14:52 +05:30
PixelyIon	c3895a8197	Support `VkPhysicalDeviceFeatures2` Extensions Implement the infrastructure to depend on `VkPhysicalDeviceFeatures2` extended feature structures which can be utilized to retrieve the specifics of features from extensions. It is implemented in the form of `vk::StructureChain` with `vk::PhysicalDeviceFeatures2` that can be extended with any extension feature structures.	2022-04-14 14:14:52 +05:30
PixelyIon	ff5515d4d1	Implement Maxwell3D Vertex Buffer Bindings This implements everything in Maxwell3D vertex buffer bindings including vertex attribute divisors which require the extension `VK_EXT_vertex_attribute_divisor` to emulate them correctly, this has been implemented in the form of of a quirk. It is dynamically enabled/disabled based on if the host GPU supports it and a warning is provided when it is used by the guest but the host GPU doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	d163e4ffa6	Introduce `IOVA` union for flipped words of IOVAs in `GraphicsContext` The Maxwell3D `Address` class follows the big-endian register ordering for addresses while on the host we consume them in little-endian, the `IOVA` class is the host equivalent to the `Address` class with implicitly flipped 32-bit register ordering. It shares implicit decomposition semantics from `Address` due to similar requirements with a minor difference of being returned by reference rather than value as we want to have value setting semantics with implicit decomposition while we don't for `Address`.	2022-04-14 14:14:52 +05:30
PixelyIon	73646c4da8	Implicitly decompose `Address` into `u64` The semantics of implicitly decomposing the `Address` class into a `u64` were determined to be appropriate for the class. As it is an integer type this effectively retains all semantics from using an integer directly for the most part.	2022-04-14 14:14:52 +05:30
PixelyIon	48d0b41f16	Implement Maxwell3D Common/Independent Color Write Mask Maxwell3D supports both independent and common color write masks like color blending but for common color write masks rather than having register state specifically for it, the state from RT 0 is extended to all RTs. It should be noted that color write masks are included in blending state for Vulkan while being entirely independent from each other for Maxwell, it forces us to use the `independentBlend` feature even when we are doing common blending unless the color write mask is common as well but to simplify all this logic the feature was made required as it supported by effectively all targeted devices.	2022-04-14 14:14:52 +05:30
PixelyIon	92809f8a78	Implement Maxwell3D Independent/Common Color Blending Maxwell3D supports independent blending which has different blending per-RT and common blending which has the same blending for all RTs. There is a register determining which mode to utilize and we simply have two arrays of `VkPipelineColorBlendAttachmentState` for the RTs that we toggle between to make the transition between them extremely cheap.	2022-04-14 14:14:52 +05:30
PixelyIon	2ceb6465e8	Make `independentBlend` a required `VkDevice` feature Independent blending is supported by effectively every Vulkan 1.1 Android GPU, it gives us the ability to architecture Maxwell3D blending emulation better as we can avoid additional checks for independent blending state and having a fallback path for when the host doesn't support the feature.	2022-04-14 14:14:52 +05:30
PixelyIon	cd737fbdd8	Add Required `VkDevice` Features A prior commit added the ability to utilize features with quirks but this implements the ability to require a feature be present on the host or an exception will be thrown. It allows us to make useful assumptions that result in a better architecture in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	081d3277c1	Enable Quirk + Required `VkDevice` Extensions Implements the infrastructure required to enable optional extensions set in `QuirkManager` alongside the required extensions in the `GPU` class. All extensions should be correctly resolved now and according to what the device supports.	2022-04-14 14:14:52 +05:30
PixelyIon	6099b1ead5	Fix Maxwell3D Register `lineWidthAliased` Offset The offset was incorrectly set to `0x4D` rather than `0x4ED` which is what it should be. This would've led to bugs in line width determination and likely broken any aliased line rendering entirely.	2022-04-14 14:14:52 +05:30
PixelyIon	51659e1329	Enable `VkDevice` Features Selectively We selectively enable GPU features that we require as enabling all of them might result in extra driver overhead in certain circumstances. Setting them is handled by `QuirkManager` with the new `FEAT_SET` function that ties a quirk with a feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ec378814aa	Stub Maxwell3D Alpha Testing We stub alpha testing as it doesn't exist in Vulkan and few titles use it, it can be emulated in the future using a shader patch with manually discarding fragments failing the alpha test function but this'll be added in later as it isn't high priority at the moment and has associated overhead with it so other options might be explored at the time.	2022-04-14 14:14:52 +05:30
PixelyIon	83ec99fa48	Print GPU Quirks At Startup It is essential to know what quirks a certain GPU may have to debug an issue, these are now printed at startup into the log alongside all other GPU information. A new `QuirkManager::Summary` function was implemented to provide this functionality.	2022-04-14 14:14:52 +05:30
PixelyIon	01eb16e59a	Implement Maxwell3D Color Logic Operations Implements a basic part of Vulkan blending state which are color logic operations applied on the framebuffer after running the fragment shader. It is an optional feature in Vulkan and not supported on any mobile GPU vendor aside from ImgTec/NVIDIA by default.	2022-04-14 14:14:52 +05:30
PixelyIon	662935c35d	Attempt Flushing `Logger` During Fatal Signals Any signals that lead to exception handling being triggered now attempt to flush all logs given that the log mutex is unoccupied, this is to mostly help logs be more complete when exiting isn't graceful.	2022-04-14 14:14:52 +05:30
PixelyIon	586bee2c59	Remove Maxwell3D Zero Initialization Calls A lot of calls in Maxwell3D register initialization ended up setting the register to 0 which should be implicit behavior and most calls would be eliminated by the redundancy check which had to be manually disabled. It was determined to be better to move this responsibility to the called function to initialize to state equivalent to the corresponding register being 0. All initialization calls with the argument as 0 have been removed now due to this, it was the vast majority of calls.	2022-04-14 14:14:52 +05:30
PixelyIon	49cc0964e2	Initialize Maxwell3D Registers Correctly Maxwell3D Registers weren't initialized to the correct values prior, this commit fixes that by doing `HandleMethod` calls with all the register values being initialized. This is in contrast to the registers being set without calling the methods in `GraphicsContext` or otherwise resulting in bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	ea87b8878c	Implement `B8G8R8A8{Unorm/Srgb}` RT Format Needed by Maxwell3D Register Initialization	2022-04-14 14:14:52 +05:30
PixelyIon	69e7d8b574	Remove Vulkan to Skyline Format Conversion Function The function `GetFormat` was seemingly no longer required due to us never converting from a Vulkan format to a Skyline format, most conversions only went from Skyline to Vulkan and were generally lossy due to certain formats being missing in Vulkan and approximated using channel swizzles. As a result of this, it was pointless to maintain and has now been removed.	2022-04-14 14:14:52 +05:30
PixelyIon	5ed26fef23	Implement Maxwell3D Rasterizer State Maxwell3D registers relevant to the Vulkan Rasterizer state have been implemented aside from certain features such as per-face polygon modes that cannot be implemented due to Vulkan limitations. A quirk was utilized to dynamically support the provoking vertex being the last vertex as opposed to the first as well.	2022-04-14 14:14:52 +05:30
PixelyIon	8ef225a37d	Introduce `QuirkManager` for runtime GPU quirk tracking We require a way to track certain host GPU features that are optional such as Vulkan extensions, this is what the `QuirkManager` class does as it checks for all quirks and holds them allowing other components to branch based off these quirks.	2022-04-14 14:14:52 +05:30
PixelyIon	8803616673	Reorder `GraphicsContext` Members All members are now placed at the start of sections they are relevant to rather than at the start of the class.	2022-04-14 14:14:52 +05:30
PixelyIon	107d337d77	Fix `MacroInterpreter::MethodAddress` Bitfield Padding Due to compiler alignment issues, the bitfield member `increment` of `MacroInterpreter::MethodAddress` was mistakenly padded and moved to the next byte. This has now been fixed by making its type `u16` like the member prior to it to prevent natural alignment from kicking in.	2022-04-14 14:14:52 +05:30
PixelyIon	26966287c7	Implement Maxwell3D Shader Program Registers This commit added basic shader program registers, they simply track the address a shader is pointed to at the moment. No parsing of the shader program is done within them.	2022-04-14 14:14:52 +05:30
PixelyIon	93ea919c8f	Fix warnings from `NVRESULT` due to unused lambda capture A previously used `this` capture is no longer used since the introduction of the static logger.	2022-04-14 14:14:52 +05:30
lynxnb	882b939335	Automatically import key files from search location	2022-03-25 09:40:21 +05:30
Robin Kertels	6cf2ef8fb9	Grant Uri permission when sharing logs	2022-03-24 03:39:41 +05:30
lynxnb	092dcb18c8	Stub `ectx:w` and `ectx:aw` Glue services	2022-02-06 21:57:38 +05:30
Robin Kertels	2993f65a1c	Ignore empty lines in key files	2022-02-04 23:51:46 +05:30
MCredstoner2004	0ceecbba4f	Implement fixed aspect ratio surface Adds a setting to letterbox the surface with black bars to 16:9 or 21:9 aspect ratio to avoid stretching out the rendered image	2022-01-12 22:09:39 +05:30
lynxnb	6913a90361	Use the new log file name & ext for every logger context	2021-11-11 16:32:19 +01:00
lynxnb	5cd1f01690	Refactor all logger calls	2021-11-11 16:13:24 +01:00
lynxnb	769e6c933d	Make `Logger` class static and introduce `LoggerContext` A thread local LoggerContext is now used to hold the output file stream instead of the `Logger` class. Before doing any logging operations, a LoggerContext must be initialized. This commit will not build successfully on purpose.	2021-11-11 16:13:24 +01:00
PixelyIon	69ef93bfa8	Add Dividers to `ControllerActivity`'s `RecyclerView` Dividers after titles were missing in `ControllerActivity` which made it look inconsistent with `SettingsActivity` which did have them. They have now been added by extending `DividerItemDecoration` to be drawn before any `ControllerHeaderItem`.	2021-11-11 20:20:19 +05:30
PixelyIon	b230afcd35	Fix FAB Icon color in `OnScreenEditActivity` The icons in these FABs had the same color as the FAB prior which led them to being invisible. This has been fixed by setting a white tint on them which makes the icons clearly visible.	2021-11-11 18:59:09 +05:30
PixelyIon	e4fbee1626	Style Improvements to `LicenseDialog` Additional padding has been added to the text alongside making it be left-aligned rather than center-aligned and justified. A newline has also been added to the copyright notice for Skyline to make it look nicer.	2021-11-11 16:35:30 +05:30
PixelyIon	36a1f2a2ec	Make all `Dialog`s use `@color/backgroundColor` as the background color We wanted the color of the modals used by the dialogs to be the same as our regular background color rather than a lighter grey. This has now been enforced with style attributes in the case of `AlertDialog` and `setBackground` in the case of `BottomSheetDialog`.	2021-11-11 16:34:38 +05:30
PixelyIon	3f3891839e	Make all `AlertDialog`s use `MaterialComponents` theme We inconsistently used `AppCompat`'s `AlertDialog` theme in Settings while using `MaterialComponents`'s theme in Controller Configuration. This has now been fixed by universally using the `MaterialComponents` theme.	2021-11-11 16:28:57 +05:30
PixelyIon	ff5887975b	Remove Logo from `MainActivity` The Skyline logo was added to the title area but it ended up being too distracting with a light theme as the logo was designed purely for a white background. Ultimately, even though it looked good with the dark theme we had to remove it.	2021-11-11 13:58:25 +05:30
PixelyIon	43b9d95dbc	Rework App Dialog Buttons Aligning the buttons to the bottom of the game image was determined to look odd due to the amount of padding between the title and buttons. They are now back to being below the title but the buttons have been resized with "Play" being a wide button while "Pin" has been replaced with Google Material Icons's "Add To Home Screen" icon and sized down to an icon-only button.	2021-11-11 13:47:20 +05:30
lynxnb	6a0ad25c27	Update `BottomSheetDialog` appearance * Fix incorrect dialog surface color * Adjust buttons' position	2021-11-10 17:41:07 +01:00
lynxnb	df120ab76d	Fix unicode characters being turned into emojis for some vendors	2021-11-10 17:41:07 +01:00
lynxnb	b7b0f07ba8	Update application branding - Logo is now displayed next to the app name - Remove search bar animation - New color accent - Improve visibility of controller binding setting's glyphs	2021-11-10 17:41:07 +01:00
lynxnb	827439d2d1	Use the new font for the application name in main activity	2021-11-10 17:41:07 +01:00
lynxnb	07f899e904	Update application icon	2021-11-10 17:41:07 +01:00
Billy Laws	d88b08d986	Address PR feedback	2021-11-10 21:35:36 +05:30
Billy Laws	1b453c04ca	Remove completed nvmap TODO Pins have been implemented so the to-do is no longer needed.	2021-11-10 21:35:36 +05:30
Billy Laws	d2d181725f	Remove unused virtEnd variable in FlatMemoryManager::{Read, Write}	2021-11-10 21:35:36 +05:30
Billy Laws	60fbfad4bc	Add virtual dtors to time service code	2021-11-10 21:35:36 +05:30
Billy Laws	ef10d3d394	Use semantic wrapping where appropriate for class initialiser lists	2021-11-10 21:35:36 +05:30
Billy Laws	6b33268d85	Remove unused gm20b EngineID enum	2021-11-10 21:35:36 +05:30
Billy Laws	73896c2e6b	Fixup nvdrv channel types to follow naming conventions	2021-11-10 21:35:36 +05:30
Billy Laws	ad900aba7a	s/Host1X/Host1x/ as per Nvidia naming	2021-11-10 21:35:36 +05:30
Billy Laws	dbfb1cfe20	Fully implement the nvdrv Host1xChannel::Submit operation This pushes a set of command buffers into the Host1x command FIFO allocated for the channel, returning fence thresholds that can be waited on for completion,	2021-11-10 21:35:36 +05:30
Billy Laws	baefb0fe93	Implement the Host1x command FIFO together with barebones Host1x classes The Host1x block of the TX1 supports 14 separate channels to which commands can be issued, these all run asynchronously so are emulated the same way as GPU channels with one FIFO emulation thread each. The command FIFO itself is very similar to the GPFIFO found in the GPU however there are some differences, mainly the introduction of classes (similar to engines) and the Mask opcode (which allows writing to a specific set of offsets much more efficiently). There is an internal Host1x class which functions similar to the GPFIFO class in the GPU, handling general operations such as syncpoint waits, this is accessed via the simple method interface. Other channels such as NVDEC and VIC are behind the 'Tegra Host Interface' (THI) in HW, this abstracts out the classes internal details and provides a uniform method interface ontop of the Host1x method one. We emulate the THI as a templated wrapper for the underlying class. Syncpoint increments in Host1x are different to GPU, the THI allows submitting increment requests that will be queued up and only be applied after a specific condition in the associated engine is met; however the option to for immediate increments is also available.	2021-11-10 21:35:36 +05:30
Billy Laws	2494cafee8	Cleanup GPFIFO comments and make Run() private	2021-11-10 21:35:36 +05:30
Billy Laws	2577658fc7	Avoid GetPointer on nvmap handles where they would be accessed via SMMU GetPointer sets the sharedMemMapped flag, which should only be set if other userspace processes have the handle mapped.	2021-11-10 21:35:36 +05:30
Billy Laws	fd0420443c	Add template utils for constructing all elements in an std::array This avoids the excessive repetition needed for the case where array members have no default constructor. eg: ```c++ std::array<Type, 10> channels{util::MakeFilledArray<Type, 10>(typeConstructorArg, <...>)}; ```	2021-11-10 21:35:36 +05:30
Billy Laws	34bf413661	Fix bitmask check for event IDs > 32 in Ctrl::SyncpointFreeEventBatch Doing 1 << 32 would result in an integer overflow rather than the desired behaviour of checking a bit, make 1 64 bit to present that.	2021-11-10 21:34:30 +05:30
Billy Laws	debab7c9c7	Implement nvmap handle pinning/unpinning nvmap allows mapping handles into the SMMU address space through 'pins'. These are refcounted mappings that are lazily freed when required due to AS space running out. Nvidia implements this by using several MRU lists of handles in order to choose which ones to free and which ones to keep, however as we are unlikely to even run into the case of not having enough address space a naive queue approach works fine. This pin infrastructure is used by nvdrv's host1x channel implementation in order to handle mapping of both command and data buffers for submit.	2021-11-10 21:34:30 +05:30
Billy Laws	a0c57256cc	Hookup FlatMemoryManager for SMMU into SoC The SMMU is used to control the mappings of peripherals such as the VIC and NVDEC.	2021-11-10 21:34:30 +05:30
Billy Laws	97dc053ffd	Move FlatAllocator allocation error handling to the caller This is a prerequisite for nvmap SMMU memory management, which only frees the memory handles refer to if an allocation fails.	2021-11-10 21:34:30 +05:30
Billy Laws	04e5237ec1	Stub host1x channel devices and IOCTLs host1x channels are generally similar to GPU channels however there is only one channel for each specific class (like a GPU engine) and an address space is shared between them all. This PR implements the simple IOCTLs with the larger ones that will depend on changes outside of nvdrv being left for future commits. This is enough to partly run oss-nvjpeg.	2021-11-10 21:34:30 +05:30
Billy Laws	5087d3dc2a	Reserve host1x channel syncpoints	2021-11-10 21:34:30 +05:30
Billy Laws	b0a5dab0f7	Support passing additional constructor arguments to nvdrv devices Needed for host1x channels which use the same class for multiple types.	2021-11-10 21:34:30 +05:30
Billy Laws	eb6f052873	Fixup GpuChannel::SetNvmapFd to take an FD rather than an nvmap handle	2021-11-10 21:34:30 +05:30
Billy Laws	386a3447a8	Introduce variable sized span support to nvdrv deserialisation The element containing the size first needs to be saved to a save slot with Save<T, slotId>, it can then be read back later as the size of a span with SlotSizeSpan<T, slotId>. This is needed to support the Host1XChannel Submit IOCTL.	2021-11-10 21:34:30 +05:30
Billy Laws	6eeaa343f8	Avoid crash when passing unallocated syncpoint IDs to EventWait	2021-11-10 21:34:30 +05:30
PixelyIon	fbfad21f03	Migrate `Maxwell3D::Registers` to `OffsetMember` Maxwell3D registers were primarily written with absolute offsets and ended up being fairly messy due to attempting to emulate this using struct relative positioning resulting in a lot of pointless padding members. This has now been improved by utilizing `OffsetMember` to directly use offsets resulting in much neater code.	2021-11-10 21:31:45 +05:30
PixelyIon	69761389ff	Extend `OffsetMember` with direct `operator=`/`operator[]` It was found to be semantically advantageous to directly pass-through certain operators such as the assignment (`=`) and array index (`[]`) operators. These would require a dereference prior to their usage otherwise but now can be directly used.	2021-11-10 21:30:02 +05:30
Billy Laws	cc7b2a0ab8	Introduce `OffsetMember` for offset-based union members The offset of a member in a structure was determined by its relative position and compiler alignment. This is unoptimal for larger structures such as those found in GPU Engines that are primarily named by offset rather than relative positioning and end up requiring a massive amount of padding to function as is. A solution to this problem was simply to supply member offsets directly which can be done by using `OffsetMember` alongside a `union`.	2021-11-10 21:17:31 +05:30
PixelyIon	fb476567ff	Introduce `JniString` as C++ wrapper over `jstring` We used to manually call JNI UTF-8 string allocation and deallocation functions when utilizing a `jstring` but this could be erroneous and is just inconvenient. All of this has now been consolidated into an class `JniString` which is a wrapper around `std::string` and creates a copy of the contents of the `jstring` in its constructor and passes them into the `std::string` constructor.	2021-11-09 21:18:47 +05:30
PixelyIon	79ceb2cf23	Improve Vulkan `Texture` Synchronization The Vulkan Pipeline Barriers were unoptimal and incorrect to some degree prior as we purely synchronized images and not staging buffers. This has now been fixed and improved in general with more relevant synchronization.	2021-11-09 21:08:03 +05:30
PixelyIon	bed9fbf5e7	Fix `EmulationActivity.vibrateDevice` assert due to `null` Vibrator `EmulationActivity.vibrateDevice` would assert when a `null` Vibrator is provided due to one not being set in the controller configuration. This has now been fixed by the code not playing a vibration when a vibration device isn't selected.	2021-11-01 00:28:11 +05:30
PixelyIon	414c0104c3	Rework Joy-Con Vibration Conversion The guest -> host vibration conversion code was entirely broken as it didn't set the vibration `start`/`end` timestamps correctly for a cycle nor did it subtract from the `totalAmplitude` (`currentAmplitude` now) when it a cycle ended due to an incorrect `if` statement and contents. It would just end up saturating the amplitude as much as possible by adding more and more to `totalAmplitude` on every cycle while never subtracting which is entirely wrong and led to a very noticeable drop in amplitude when a vibration was repeated. It's been entirely reworked to fix all the issues listed above and remove a lot of code that had no understandable purpose. More comments have also been added to the code to make it more readable with better variable and function naming alongside it.	2021-11-01 00:28:11 +05:30
PixelyIon	96027f0f09	Build libraries with `-Ofast` for debug builds To offset some of the performance overhead of using debug builds, we now optimize all libraries using `-Ofast` while building Skyline itself with `-O0`.	2021-10-31 16:05:08 +05:30
PixelyIon	4b80e1f91c	Use libcxx from LLVM Project submodule The version of libcxx shipped with Android NDK is fairly outdated and doesn't contain several features we desire such as C++ 20 ranges. This has been fixed by using libcxx directly from the LLVM Project which has been added as a submodule and can be updated independently of NDK.	2021-10-31 16:04:44 +05:30
PixelyIon	82154f3ef6	Upgrade AGP to `7.1.0-beta01` & NDK to `24.0.7856742` We've moved to using a beta AGP as `7.0.2` is breaks `clangd` and other C++ features on Beta/Canary Android Studio. NDK was additionally updated with `mbedtls` to fix warnings caused by it alongside some other minor fixes to code for newer versions of libcxx. The new AGP has a bug where it does not look for executables specified in `android_gradle_build.json` in `PATH` that includes `ninja` which is provided by the `ninja-build` package on the system rather than Android SDK's CMake on GitHub Actions (Ubuntu 20.04). This has been fixed by symlinking `/usr/bin/ninja` to the project root which is searched in for the `ninja` executable.	2021-10-31 15:50:15 +05:30
PixelyIon	962d8dc4c8	Return immediately for non-joining `KProcess::Kill`s when already killed Locking `KProcess::threadMutex` when a process is being killed by another thread with `join` can lead to the non-joining killer effectively joining as it's waiting on the joining killer to relinquish the mutex. This has been fixed by having an atomic boolean tracking if the process has already been killed and if it has, immediately returning prior to locking the mutex for any non-joining killers.	2021-10-31 15:45:10 +05:30
Billy Laws	6f59cba68d	Adds bounds checks to resampler to avoid OOB reads Resampling would sometimes perform an OOB read into `inputBuffer` due it not containing enough data to calculate corresponding the output sample, this has been fixed by introducing bounds checking to ensure that the buffer has enough data.	2021-10-29 21:46:51 +05:30
PixelyIon	9e3b7a75b2	Use `finishAffinity` instead of `finishAndRemoveTask` The method used to finish (`finishAndRemoveTask`) an activity prior to going back to `MainActivity` or restarting the process led to the process prematurely exiting entirely and would result in it not being restarted or another activity not being launched. This has now been fixed by utilizing `finishAffinity` in its place which correctly only ends the activities with the same affinity as the caller.	2021-10-29 21:20:04 +05:30
PixelyIon	9f5ab13858	Implement `R16G16{Unorm/Sint/Uint}` RT Formats Utilized in "The Touryst"	2021-10-29 20:21:58 +05:30
PixelyIon	afebf77544	Zero-Initialize `GuestTexture` Members Members of `GuestTexture` were apparently not being initialized and this led to UB since they would be read as random values. Titles such as Super Mario Odyssey avoided setting `baseArrayLayer` which led to it being left at the default value which was completely random and this would lead to crashes. This commit fixes this by initializing said values correctly.	2021-10-27 22:49:45 +05:30
PixelyIon	a0921f8261	Implement `R16G16B16A16Snorm`/`R16G16B16A16Sint` RT Formats Utilized in "The Touryst"	2021-10-27 22:49:45 +05:30
PixelyIon	df64ff5d14	Zero-Fill `IAudioRenderer::RequestUpdate` Output Buffer Some titles don't clear the output buffer prior to submission, as the service is expected to fill all of it in, our audren implementation is incomplete and doesn't end up doing this leaving the contents of the buffer to be undefined leading to UB in the form of SEGFAULTs or the application throwing a fatal error. This has been patched over by 0-filling the buffer which is a sane default value for the fields that aren't filled in albeit not a replacement for a proper audren implementation.	2021-10-27 16:18:20 +05:30
PixelyIon	1f3519e6e3	Fix Logger Message OOB Access Certain titles can submit logs where the last field is one off by the buffer end, the logger loop now considers this and terminates if there isn't enough data left to read the field type and length.	2021-10-26 21:59:47 +05:30
PixelyIon	645183c903	Fix OOB Vibration Array Access in `VibrateDevice` Access to the `vibrations` field in `vibrations[3].period` could lead to UB, this has been replaced with a proper check which adds up the period over all vibrations instead. A minor cleanup with variable names and explicit types for integer arithmetic has also been done.	2021-10-26 21:57:28 +05:30
PixelyIon	cfdb2abf9e	Fix Non-`builtin` Uncached `Vibrator` Getter If a non-builtin vibrator was attempted to be fetched, it'd insert it in the vibrator cache and return directly as opposed to playing the vibration on it prior to returning. This has now been fixed, the value is both put into the cache and the vibration is played on it.	2021-10-26 21:54:33 +05:30
PixelyIon	dc3f7f1ab4	Fix Incorrect Scissor Extent The decomposition from `texture::Dimension` to `vk::Rect2D` was somehow implicit and completely incorrect resulting in wrong conversion with undefined values. It's now been fixed by explicitly setting `vk::Rect2D::extent` to `scissor` specifically.	2021-10-26 20:08:53 +05:30
PixelyIon	a60f238479	Fix `GPU::DebugCallback` Type String Extraction The second parameter of `std::string_view::substr` was assumed to be an end position (similar to `std::span`) rather than `count` which it is. As a result of this, it was entirely broken but only held together by a constant factor being subtracted from it which was derived by trial and error. It's now been fixed by returning a count rather than the absolute position.	2021-10-26 20:08:35 +05:30
PixelyIon	10ed5bf418	Silence errors from libraries Library headers would produce errors that are out of our control and as a result of that, we just want to ignore this. This is possible by including the offending headers as system headers, compilers don't emit any warnings arising from them. This was extended to all libraries rather than just those which currently emitted warnings for consistency's sake.	2021-10-26 20:08:18 +05:30
Billy Laws	70d1b4994c	Enable Wconversion and fix warnings produced	2021-10-26 11:41:24 +01:00
PixelyIon	315d2dc26c	Update NDK to 23.1.7779620	2021-10-26 10:46:36 +05:30
PixelyIon	661396fc97	Resume `EmulationActivity` when launcher icon is used Fix a bug where attempting to launch Skyline from the launcher while emulation was in progress would result in `MainActivity` launching rather than `EmulationActivity`. We always want `EmulationActivity` to stay on top of the stack and be launched whenever Skyline is resumed, no other activity should be able to run in parallel to `EmulationActivity` in any user-accessible manner.	2021-10-26 10:46:36 +05:30
PixelyIon	39e924aec8	Resume rather than relaunch when same shortcut is used	2021-10-26 10:46:36 +05:30
Billy Laws	1e7347bf72	Use semantic wrapping for nvdrv where appropriate	2021-10-26 10:46:36 +05:30
PixelyIon	830a800d9e	Consolidate `AddAttachment` Loops + Rename `Renderpass` -> `RenderPass`	2021-10-26 10:46:36 +05:30
PixelyIon	92a21ea616	Cleanup & Use C++ Concepts in `utils.h`	2021-10-26 10:46:36 +05:30
PixelyIon	ea2626bcc6	Address CR Comments	2021-10-26 10:46:36 +05:30
PixelyIon	1d532628cb	Null-Check Optional NACP before extracting application title Not doing this can lead to the NACP being filled with invalid data and led to crashes on homebrew titles like SpaceNX.	2021-10-26 10:46:36 +05:30
PixelyIon	c8821c7313	Update `nvdrv` perms to 11.0.0+ & Implement `nvdrv:a` service `nvdrv:a` (For Applets) is used by some older homebrew such as SpaceNX which don't fall back to `nvdrv` (For Applications).	2021-10-26 10:46:36 +05:30
PixelyIon	3b4bbd2b38	Switch to using exceptions for guest exiting Guest-driven exiting could cause objects left on the heap due to a `std::longjmp` from high up in the host call stack, this has been fixed by introducing `ExitException` which implicitly unrolls the stack with the exception handling mechanism.	2021-10-26 10:46:36 +05:30
PixelyIon	eff5711c49	Split monolithic `common.h` into smaller chunks * Resolves dependency cycles in some components * Allows for easier navigation of certain components like `span` which were especially large * Some imports have been moved from `common.h` into their own files due to their infrequency	2021-10-26 10:46:36 +05:30
PixelyIon	1d57bab08f	Revamp `LicenseDialog` + Update Licenses + Stop Bintray Usage * Update licenses for dependent projects * Add copyright notices (as provided) * Revamp styling for `LicenseDialog` * Fix invisible `PreferenceDialog` buttons in Settings * Consolidating color variables into `colorPrimary`, `backgroundColor` and `backgroundColorVariant` * Use `com.google.android.flexbox:flexbox:3.0.0` (Google Maven) rather than `com.google.android:flexbox:2.0.1` (Bintray)	2021-10-26 10:46:36 +05:30
PixelyIon	bbf28d1942	Improve Clean Exit + Audio Pausing + Improve System Language Setting * Clean Exiting was improved by implementing a robust system for when to abandon clean exiting and simply restart the process alongside moving clean exiting to the background when the application is quit by using the back button * Audio is now automatically paused whenever the application is moved to the background and automatically resumed when it's brought to the foreground * The system language setting had several errors and inconsistencies which have now been fixed, it's been brought more in line with HOS language (Albeit not entirely due to no region setting in Skyline) * Fix a bug with `ThreadLocal` where the atomic `list` pointer was uninitialized resulting in a `SEGFAULT` during the destructor	2021-10-26 10:46:36 +05:30
PixelyIon	a7548c79a0	Android 12 Support + Update Libraries + Include Khronos Validation Layer * Fix handling `SA_EXPOSE_TAGBITS` bit being set in Android 12 `sigaction` * Fix CMake bug using `CMAKE_INTERPROCEDURAL_OPTIMIZATION_RELEASE` when not supported causing `-fuse-ld=gold` to be emitted as a linker flag * Support using `VIBRATOR_MANAGER_SERVICE` rather than `VIBRATOR_SERVICE` on Android 12 * Optimize Imports for Kotlin code * Move away from deprecated APIs in Kotlin or explicitly mark where it's not possible * Update SDK, NDK and libraries * Enable Gradle Configuration Cache	2021-10-26 10:46:36 +05:30
Billy Laws	b7d0f2fafa	Implement support for pushbuffer methods split across multiple GpEntries These are used heavily in OpenGL games, which now, together with the previous syncpoint changes, work perfectly. The actual implementation is rather novel as rather than using a per-class state machine for all methods we only use it for those that are known to be split across GpEntry boundaries, as a result only a single bounds check is added to the hot path of contiguous method execution and the performance loss is negligible.	2021-10-16 12:13:30 +01:00
Billy Laws	fc017e1e95	Implement pre-wait and post-increment syncpoint operations in submit These are used by both OpenGL and Vulkan games as opposed to including the operations inside the main commandbuffer.	2021-10-16 12:13:30 +01:00
PixelyIon	9b9bf8d300	Introduce `ThreadLocal` Class + Fix Several GPU Bugs * Fix `AddClearColorSubpass` bug where it would not generate a `VkCmdNextSubpass` when an attachment clear was utilized * Fix `AddSubpass` bug where the Depth Stencil texture would not be synced * Respect `VkCommandPool` external synchronization requirements by making it thread-local with a custom RAII wrapper * Fix linear RT width calculation as it's provided in terms of bytes rather than format units * Fix `AllocateStagingBuffer` bug where it would not supply `eTransferDst` as a usage flag * Fix `AllocateMappedImage` where `VkMemoryPropertyFlags` were not respected resulting in non-`eHostVisible` memory being utilized * Change feature requirement in `AndroidManifest.xml` to Vulkan 1.1 from OGL 3.1 as this was incorrect	2021-10-16 12:13:30 +01:00
Billy Laws	eb25f60033	Implement multichannel support for GPU Allows the execution of multiple channels at the same time, with locking being performed on the host GPU scheduler layer, address spaces can be bound to one or more channels.	2021-10-16 12:13:30 +01:00
PixelyIon	b762d1df23	Introduce Texture Always Sync + Wait on GPU Execution + More RT Formats Infrastructure for always syncing textures has been introduced now, they will be synced prior to and after every execution. This does considerably reduce the performance alongside waiting on GPU execution to finish but it will be partially recouped once conditional syncing is performed.	2021-10-16 12:13:30 +01:00
PixelyIon	f8acc1e131	Improve Shared Fonts + Fix AM `PopLaunchParameter` & Choreographer Bug * Move Shared Font TTFs to AAsset storage + Support external shared font loading from `/data/data/skyline.emu/data/fonts` * Fix bug in `IApplicationFunctions::PopLaunchParameter` caused by ignoring `LaunchParameterKind` * Fix bug with Choreographer causing it to be awoken and exit prior to the destruction of `PresentationEngine` * Fix bug with `IDirectory::Read` where it used `inputBuf` for the output buffer rather than `outputBuf` * Improve `GetFunctionStackTrace` logs when `dli_sname` or `dli_fname` are missing * Support more RT Formats	2021-10-16 12:13:30 +01:00
PixelyIon	95a08627e5	Subpass Support + More RT Formats + Fix `FenceCycle` Cyclic Dependencies Support for subpasses was added by reworking attachment reuse code to account for preserved attachments and subpass dependencies. A lot of RT formats were also added to allow SMO to boot up entirely, it should be noted that it doesn't render anything. `FenceCycle` had a cyclic dependency which broke clean exit, we now utilize `std::weak_ptr<FenceCycle>` inside the `Texture` object. A minor fix for broken stack traces was also made caused by supplying a `nullptr` C-string to libfmt when a symbol was unresolved which caused an `abort` due to invocation of `strlen` with it.	2021-10-16 12:13:30 +01:00
PixelyIon	239d2625e2	Introduce `CommandExecutor` + Implement `ClearBuffers` + More RT Formats This commit introduces the `CommandExecutor` which is responsible for creating and orchestrating a Vulkan command graph comprising of `CommandNode`s that construct all the objects required for rendering. As a result of the infrastructure provided by `CommandExecutor`, `ClearBuffers` could be implemented and be appropriately utilized. A bug regarding scissors was also determined and fixed in the PR, the extent of them were previously inaccurate and this has now been fixed. Note: We don't synchronize any textures from the guest for now as this would override the contents on the host, this'll be fixed with the appropriate write tracking but it also results in a black screen for anything that writes to FB	2021-10-05 01:13:22 +05:30
PixelyIon	3879d573d5	Fix Command Buffer Allocation & `FenceCycle` This commit fixes a major issue with command buffer allocation which would result in only being able to utilize a command buffer slot on the 2nd attempt to use it after it's freed, this would lead to a significantly larger amount of command buffers being created than necessary. It also fixes an issue with the command buffers not being reset after they were utilized which results in UB eventually. Another issue was fixed with `FenceCycle` where all dependencies are only destroyed on destruction of the `FenceCycle` itself rather than the function where the `VkFence` was found to be signalled.	2021-10-05 01:13:22 +05:30
PixelyIon	bee28aaf0d	Validation Layer Filter + Fix `Texture`, GPU & `PresentationEngine` bugs This commit implements a filter by type for any validation layer output, this allows filtering out any logs which may be unnecessary and additionally triggering a breakpoint as required. An issue concerning the `NDEBUG` flag never being set was fixed, it's now supplied as a release compiler flag. The issue can manifest itself by always relying on a validation layer even though it shouldn't on release, this is why the validation layer was mistakenly disabled entirely previously by using `#ifndef` rather than `#ifdef`. An issue with the initial layout of a texture being supplied as neither `VK_IMAGE_LAYOUT_UNDEFINED` or `VK_IMAGE_LAYOUT_PREINITIALIZED` was fixed, these cases are now handled by transitioning to those layouts after creating the image rather than supplying it within `initialLayout`. Another issue was fixed regarding not maintaining a transformation after a surface has been destroyed and recreated existed and manifested itself when the user would go out of the app and come back in, they would see the surface having an identity transformation rather than the desired one.	2021-10-05 01:13:22 +05:30
PixelyIon	54908afc44	Texture GMMU Address Resolution + Refactor `Maxwell3D::CallMethod` Fixes bugs with the Texture Manager lookup, fixes `RenderTarget` address extraction (`low`/`high` were flipped prior), refactors `Maxwell3D::CallMethod` to utilize a case for the register being modified + preventing redundant method calls when no new value is being written to the register, and fixes the behavior of shadow RAM which was broken previously and would lead to incorrect arguments being utilized for methods.	2021-10-05 01:13:22 +05:30
PixelyIon	270f2db1d2	Initial Texture Manager Implementation + Maxwell3D Render Target Implement the groundwork for the texture manager to be able to report basic overlaps and be extended to support more in the future. The Maxwell3D registers `RenderTargetControl`, `RenderTarget` and a stub for `ClearBuffers` were implemented. A lot of changes were also made to `GuestTexture`/`Texture` for supporting mipmapping and multiple array layers alongside significant architectural changes to `GuestTexture` effectively disconnecting it from `Texture` with it no longer being a parent rather an object that can be used to create a `Texture` object. Note: Support for fragmented CPU mappings hasn't been added for texture synchronization yet	2021-10-05 01:13:22 +05:30
PixelyIon	8cba1edf6d	Introduce Boost as a submodule + Minor Fixes Utilize Boost Container's `small_vector` for optimizing allocations, fix certain implicit casting issues and make `ILogger` not output an additional newline in the log when the application supplies one at the end of the log	2021-10-05 01:13:22 +05:30
PixelyIon	190fde110f	Introduce `GraphicContext` and Implement Viewport Transform + Scissors	2021-10-05 01:13:22 +05:30

... 6 7 8 9 10 ...

1262 Commits