skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2025-01-23 18:31:13 +01:00

Author	SHA1	Message	Date
PixelyIon	4f6a67af36	Fix `Texture` Trap Data Race The trap callbacks did not wait on the `Texture` to complete synchronization to the guest, this resulted in races where the contents written to the texture would be overwritten by the synced content. This commit fixes that by waiting on the fences at the end of the trap callback.	2022-08-06 22:20:54 +05:30
PixelyIon	cb7c3602e7	Attach `TextureView` to `FenceCycle` The lifetime of `TextureView` objects wasn't correctly managed as they weren't being attached the the `FenceCycle` in `AttachTexture`, this led to them getting deleted and causing all sorts of UB.	2022-08-06 22:20:54 +05:30
PixelyIon	ffaefc82d3	Call all flush callbacks prior to `CommandExecutor` submission The flush callbacks inside `CommandExecutor` weren't being called prior to submission as they should've been, this fixes that by calling them. It additionally removes the requirement to manually flush Maxwell3D at the end of `ChannelGpfifo` pushbuffers as it's a flush callback and will automatically be called by `Submit`. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-08-06 22:20:54 +05:30
PixelyIon	e65707cd9d	Handle `CommandExecutor` submission at end of `ChannelGpfifo` PB Any work that was done in a `ChannelGpfifo` pushbuffer needs to be submitted at the end of it, if it isn't done then the work might incorrectly be not done till the next submission. This commit fixes it by calling `CommandExecutor::Submit` at the end of a pushbuffer, submitting any buffers that would've been left over. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-08-06 22:20:54 +05:30
PixelyIon	7b209c54a2	Only reallocate `MegaBuffer` on usage Certain submissions might not utilize megabuffering but reserve a `MegaBuffer` regardless, this is not optimal since it can inflate the allocations and waste memory. This commit addresses the issue by eliding the allocation given the current submission doesn't utilize them.	2022-08-06 22:20:54 +05:30
PixelyIon	2366f81443	Fix `Buffer::PollFence` incorrectly handling null-`FenceCycle` If a `FenceCycle` isn't attached then `PollFence` returned `false` while it should return if the buffer has any concurrent GPU usages in flight, this has now been fixed by returning `true` in those cases.	2022-08-06 22:20:54 +05:30
PixelyIon	34e1e39d1c	Always reset all attached resources on `Submit` Certain resources can be attached to an empty `Submit` with no nodes, this can cause it to become a false dependency and not be removed till the next non-empty submission. This has now been fixed by doing a reset regardless of if any nodes exist.	2022-08-06 22:20:54 +05:30
PixelyIon	47db8e8cbc	Fix GPU inline copy callback for `Buffer::Write` The GPU inline copy callback was broken for `Buffer::Write` as it wasn't always called when it needed to be and didn't handle attaching of the buffer to the executor which would cause it to be unlocked. This commit addresses both of these issues, it introduces a `AttachLockedBuffer` method to attach an already locked buffer to the executor.	2022-08-06 22:20:54 +05:30
PixelyIon	2636a37b31	Introduce alternative FPS measurement for disabled frame throttling The FPS is implicitly bound to the refresh rate due to the timestamp being that of the presentation time, this leads to a misleading FPS figure for disabled frame throttling. It has now been fixed by using the frame submission time rather than the presentation time when frame throttling is disabled and to make this more apparent the color of the OSD FPS has been changed.	2022-08-06 22:20:54 +05:30
PixelyIon	0f56d01e58	Fix `Packed` format component ordering in `IsAdrenoAliasCompatible` All `Packed` formats have their components stored in the opposite ordering to the label, this was not followed for `IsAdrenoAliasCompatible` prior and the ordering has now been flipped.	2022-08-06 22:18:42 +05:30
PixelyIon	3ca56ef578	Fix NCE Trapping API Deadlock A deadlock was caused by holding `trapMutex` while waiting on the lock of a resource inside a callback while another thread holding the resource's mutex waits on `trapMutex`. This has been fixed by no longer allowing blocking locks inside the callbacks and introducing a separate callback for locking the resource which is done after unlocking the `trapMutex` which can then be locked by any contending threads.	2022-08-06 22:18:42 +05:30
PixelyIon	a6599c30b4	Correct `IntervalMap` insertion `end` calculation The `end` pointer for `interval` was incorrectly calculated as `interval.data() + interval.size_bytes()` which would be incorrect when the interval span type is not `u8` as the pointer derived from `interval.data()` would be a pointer to the span type rather than a byte pointer and be subject to arithmetic of that object's size rather than in terms of a byte.	2022-08-06 22:18:42 +05:30
PixelyIon	b0910e7b1a	Avoid locking `Texture`/`Buffer` in trap handler We generally don't need to lock the `Texture`/`Buffer` in the trap handler, this is particularly problematic now as we hold the lock for the duration of a submission of any workloads. This leads to a large amount of contention for the lock and stalling in the signal handler when the resource may be `Clean` and can simply be switched over to `CpuDirty` without locking and utilizing atomics which is what this commit addresses.	2022-08-06 22:18:42 +05:30
PixelyIon	a60d6ec58f	Replace host immutability `FenceCycle` with GPU usage tracking We utilized a `FenceCycle` to keep track of if the buffer was mutable or not and introduced another cycle to track GPU-side requirements only on fulfillment of which could the buffer be utilized on the host but due to the recent change in the behavior this system ended up being unoptimal. This commit replaces the cycle with a boolean tracking if there are any usages of the resource on the GPU within the current context that may prevent it from being mutated on the CPU. The fence of the context is simply attached to the buffer based off this which was allowed as the new behavior of buffer fences matches all the requirements for this.	2022-08-06 22:18:42 +05:30
PixelyIon	217d484cba	Abstract `TextureView`/`BufferDelegate` locking into `LockableSharedPtr` An atomic transactional loop was performed on the backing `std::shared_ptr` inside `BufferView`/`TextureView`'s `lock`/`LockWithTag`/`try_lock` functions, these locks utilized `std::atomic_load` for atomically loading the value from the `shared_ptr` recursively till it was the same value pre/post-locking. This commit abstracts the locking functionality of `TextureView`/`BufferDelegate` into `LockableSharedPtr` to avoid code duplication and removes the usage of `std::atomic_load` in either case as it is not necessary due to the implicit memory barrier provided by locking a mutex.	2022-08-06 22:18:42 +05:30
PixelyIon	2d08886e4e	Utilize `TextureView` rather than `Texture` for presentation `PresentationEngine` and `GraphicBufferProducer` methods that utilized textures for the surface utilized the `Texture` type rather than the `TextureView` type, this was never correct but at the time of authoring this code `TextureView` was not finalized and in a major flux which is why it was not utilized and `Texture` was utilized instead. Now that is is far more stable, it has been replaced with `TextureView`.	2022-08-06 22:18:42 +05:30
PixelyIon	d7399e33c1	Avoid waiting on mutex in `PresentationEngine::Present` We want to block on the host thread during presentation while the host surface isn't present to implicitly pause the game, this can end up being fairly costly as it involves locking the `PresentationEngine` mutex which can lead to a lot of contention with the presentation thread. This fixes the issue by polling if there is a surface and only if there isn't then doing the wait as it isn't mandatory to wait always, we'll eventually run into the guest thread stalling.	2022-08-06 22:18:42 +05:30
PixelyIon	30475ffc43	Fix `queueBuffer` `GraphicBuffer` Compatibility Check Newer versions of the Deko3D homebrew were crashing due to this check and it was discovered that the check was incorrect and rather than comparing the `NvSurface` what had to be compared was the `GraphicBuffer` associated with the slot directly. Co-authored-by: lynxnb <niccolo.betto@gmail.com>	2022-08-06 22:18:42 +05:30
PixelyIon	c2685d5f5c	Fix consistency issues with external project copyright headers The copyright headers for external project such as yuzu/Ryujinx were inconsistent in ordering, Skyline should always be the first item in the list. In addition, they didn't always link to the project's GitHub which has also been fixed.	2022-08-06 22:18:42 +05:30
PixelyIon	0ac5f4ce27	Lock `TextureManager`/`BufferManager` during submission Multiple threads concurrently accessing the `TextureManager`/`BufferManager` (Referred to as "resource managers") has a potential deadlock with a resource being locked while acquiring the resource manager lock while the thread owning it tries to acquire a lock on the resource resulting in a deadlock. This has been fixed with locking of resource manager now being externally handled which ensures it can be locked prior to locking any resources, `CommandExecutor` provides accessors for retrieving the resource manager which automatically handles locking aside doing so on attachment of resources.	2022-08-06 22:18:42 +05:30
PixelyIon	1239907ce8	Rework `Texture` & `Buffer` for `Context` and `FenceCycle` Chaining GPU resources have been designed with locking by fences in mind, fences were treated as implicit locks on a GPU, design paradigms such as `GraphicsContext` simply unlocking the texture mutex after attaching it which would set the fence cycle were considered fine prior but are unoptimal as it enforces that a `FenceCycle` effectively ensures exclusivity. This conflates the function of a mutex which is mutual exclusion and that of the fence which is to track GPU-side completion and led to tying if it was acceptable to use a GPU resource to GPU completion rather than simply if it was not currently being used by the CPU which is the function of the mutex. This rework fixes this with the groundwork that has been laid with previous commits, as `Context` semantics are utilized to move back to using mutexes for locking of resources and tracking the usage on the GPU in a cleaner way rather than arbitrary fence comparisons. This also leads to cleaning up a lot of methods that involved usage of fences that no longer require it and therefore can be entirely removed, further cleaning up the codebase. It also opens the door for future improvements such as the removal of `hostImmutableCycle` and replacing them with better solutions, the implementation of which is broken at the moment regardless. While moving to `Context`-based locking the question of multiple GPU workloads being in-flight while using overlapping resources came up which brought a fundamental limitation of `FenceCycle` to light which was that only one resource could be concurrently attached to a cycle and it could not adequately represent multi-cycle dependencies. `FenceCycle` chaining was designed to fix this inadequacy and allows for several different GPU workloads to be in-flight concurrently while utilizing the same resources as long as they can ensure GPU-GPU synchronization.	2022-08-06 22:18:42 +05:30
PixelyIon	07d45ee504	Introduce `FenceCycle` Chaining If we want to allow submitting multiple pieces of work to the GPU at once while still requiring CPU synchronization, we'll need to track all past fence cycles associated with a resource alongside the current one. To solve this the concept of chaining fences has been introduced, fences from past usages can be chained to the latest fence which'll then recursively forward operations to chained fences. This change also ends up mandating a move away from `FenceCycleDependency` as it would prevent fences from concurrently locking the same resources which is required for chaining to work as two fences being chained fundamentally means they're locking the same resources. The `AtomicForwardList` is therefore used as the new container.	2022-08-06 22:18:42 +05:30
PixelyIon	cf9e31c1eb	Implement Atomic Forward List An implementation of a singly-linked list with atomic access to allow for lock-free access semantics, it eliminates the requirement for a mutex which can introduce additional consideration for synchronization.	2022-08-06 22:18:42 +05:30
PixelyIon	6b9269b88e	Introduce `Context` semantics to GPU resource locking Resources on the GPU can be fairly convoluted and involve overlaps which can lead to the same GPU resources being utilized with different views, we previously utilized fences to lock resources to prevent concurrent access but this was overly harsh as it would block usage of resources till GPU completion of the commands associated with a resource. Fences have now been replaced with locks but locks run into the issue of being per-view and therefore to add a common object for tracking usage the concept of "tags" was introduced to track a single context so locks can be skipped if they're from the same context. This is important to prevent a deadlock when locking a resource which has been already locked from the current context with a different view.	2022-08-06 22:18:42 +05:30
PixelyIon	d913f29662	Only set `hasFragileUserData` for signed builds We do not want to allow saving of user data on unsigned builds as they don't have a stable signature and will not properly handle reinstallation. This can lead to a situation where the user has to resort to complex techniques to completely uninstall the package such as ADB or calling into PM directly.	2022-08-06 22:18:42 +05:30
PixelyIon	3139889a09	Implement Asynchronous Presentation We currently present all frames synchronously on the thread that calls into SurfaceFlinger functions, this is unoptimal as it doesn't match guest behavior which can lead to delaying the guest from working on the next frame. This commit queuing up frames to non-blocking and handles all waiting then presenting the frame on a dedicated thread.	2022-08-06 22:18:42 +05:30
PixelyIon	6e09dc5204	Fix thread name setting We utilize `pthread_setname_np` to set the thread names but didn't check for any errors which resulted in the `Skyline-Choreographer` and `ChannelCmdFifo` not having proper names as they exceeded the 16 character limit on thread names for the pthread function. This has now been fixed by changing the names and introducing error checking to invocations of this function.	2022-08-06 22:18:42 +05:30
PixelyIon	7a0cfb484c	Add NPOT `AlignUp` utility All our normal alignment functions are designed to only handle power of 2 (`POT`) multiples as we only align or check alignment to `POT` multiples but there are cases where this is not possible and we deal with `NPOT` multiples which is why this function is required.	2022-08-06 22:18:42 +05:30
PixelyIon	662ea532d8	Skip waiting on host GPU after command buffer submission We waited on the host GPU after `Execute` but this isn't optimal as it causes a major stall on the CPU which can lead to several adverse effects such as downclocking by the governor and losing the opportunity to work in parallel with the GPU. This has now been fixed by splitting `Execute`'s functionality into two functions: `Submit` and `SubmitWithFlush` which both execute all nodes and submit the resulting command buffer to the GPU but flushing will wait on the GPU to complete while the non-flush variant will not wait and work ahead of the GPU.	2022-08-06 22:18:42 +05:30
PixelyIon	5129d2ae78	Add move-assignment semantics to `ActiveCommandBuffer`/`MegaBuffer` We need move-assignment semantics to viably utilize these objects as class members, they cannot be replaced without move-assign (or copy-assign but that is undesirable here). This commit fixes that by introducing a move assignment operator to them while making the `slot` a pointer which has the necessary nullability semantics.	2022-08-06 22:18:42 +05:30
lynxnb	8991ccac65	Pass `ViewHolder` on bind to RecyclerView items instead of `ViewBinding` This change lets items get the updated position of their view holder in the adapter. Fixes an issue where the position of items was not updated after being removed from a `SelectableGenericAdapter`.	2022-08-06 22:00:19 +05:30
lynxnb	bb922100cb	Improve rendering for Right-To-Left layouts	2022-08-06 22:00:19 +05:30
lynxnb	240e7033d7	Support loading a user-selected driver during vulkan initialization	2022-08-06 22:00:19 +05:30
lynxnb	c812de48ea	Show an undo button after deleting a gpu driver After a driver has been deleted, a snackbar will be shown confirming the deletion, with an button to undo it.	2022-08-06 22:00:19 +05:30
lynxnb	59c60df993	Add `GPU Driver Configuration` preference This preference launches `GpuDriverActivity` for managing custom gpu drivers. When the device has an incompatible GPU, the preference will be disabled and greyed out.	2022-08-06 22:00:19 +05:30
lynxnb	48cf1263bc	Add a custom GPU driver configuration activity The activity adds the following functionalities: * Lists installed drivers * Allows the user to install new drivers, or remove installed ones * Allows the user to select the driver that will be used by the emulator	2022-08-06 22:00:19 +05:30
lynxnb	e9f609b923	Add a `gpuDriver` preference setting This setting represent the GPU driver selected by the user to be used by the emulator.	2022-08-06 22:00:19 +05:30
lynxnb	1815199d2b	Add utilities for reading and installing gpu driver packages	2022-08-06 22:00:19 +05:30
lynxnb	f3dd3e53c1	Miscellaneous imports cleanup in `preference` package	2022-08-06 22:00:19 +05:30
lynxnb	1dfea9ef6f	Create an `ItemDecorations` file for all `RecyclerView` item decorations All item decorations are now placed in one file so that any `RecyclerView` in the app can use the same ones.	2022-08-06 22:00:19 +05:30
lynxnb	a59f2baa3a	Add a `SelectableGenericAdapter` as subclass of `GenericAdapter` `SelectableGenericAdapter` extends `GenericAdapter` with support for marking one item as selected.	2022-08-06 22:00:19 +05:30
lynxnb	e93fdce845	Add support for removal of items from `GenericAdapter`	2022-08-06 22:00:19 +05:30
lynxnb	0d1c7965df	Add a `ZipUtils` class for unpacking zip files	2022-08-06 22:00:19 +05:30
lynxnb	b03f624191	Add `kotlinx.serialization-json` dependencies	2022-08-06 22:00:19 +05:30
Billy Laws	f52ea7bddb	Make deferred draw and constant buffer updates reentrant-safe At some point we will call Submit within draws or constant buffer updates, to avoid any infinite recursion mark draw/cbuf pending as false before performing any operation	2022-07-29 20:07:14 +01:00
Billy Laws	dbb684835f	Fix depthClampDisable register offset in Maxwell 3D	2022-07-29 20:07:14 +01:00
Billy Laws	7fd9d347e3	Use per-RT blend enable registers even when independent blend is disabled The common blend enable register seems to be used for something else. This is required for blending to work correctly in OpenGL games	2022-07-29 20:07:14 +01:00
Billy Laws	048c2fdd29	Fix Vulkan framebuffer dimensions calculations The framebuffer needs to be large enough to contain both the render area extent and offset	2022-07-29 20:07:14 +01:00
Billy Laws	0e1aa765fc	Prevent CNTVCT_EL0 reads from being optimised out by the compiler Without this the compiler will assume the read always produces the same value, causing issues when the register is used to time function execution	2022-07-29 20:07:14 +01:00
Billy Laws	1df98ba57f	Enable fwrapv for defined signed integer overflow behaviour Nintendo enables this for HOS so we should do the same to avoid any cases where it's relied on.	2022-07-29 20:07:14 +01:00

1 2 3 4 5 ...

1116 Commits