skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2024-11-23 06:29:18 +01:00

Author	SHA1	Message	Date
Billy Laws	7f1b6de1fe	Update hades	2023-02-04 23:10:45 +00:00
Billy Laws	94ac457ce0	Ensure mappings are always aligned to big page size when deallocated and mapped Since we align up when allocating, not doing so when deallocating would result in a gradual buildup of boundary pages that eventually fill the whole address space.	2023-02-04 23:10:45 +00:00
Billy Laws	d659b4f55e	Swap min and max depth when negative scale is used Fixes Super Mario 3D All Stars rendering.	2023-02-04 23:10:45 +00:00
Billy Laws	198e9e8e48	Avoid page faults when using the fallback shader size These occured in some homebrew otherwise.	2023-02-04 23:10:45 +00:00
Billy Laws	10e7e6272a	Pass in pipeline tessellation state to Vulkan	2023-02-04 23:10:45 +00:00
Billy Laws	12c88babd0	Fix address space allocator slow path to avoid OOB	2023-02-04 23:10:45 +00:00
Billy Laws	4a4f6df792	Stub GetBufferHistory transaction	2023-02-04 23:10:45 +00:00
Billy Laws	3795ecceff	Stub IdleTickCount GetInfo result	2023-02-04 23:10:45 +00:00
Billy Laws	3ef84b27c3	Avoid pipeline cache warning	2023-02-04 23:10:45 +00:00
Billy Laws	bb3baa888d	Add a hack to disable shader subgroup shuffles These are about 100x as expensive on adreno than nvidia due to the lack of a dedicated instruction, since some games work fine without them add a hack to disable them.	2023-02-04 23:10:45 +00:00
Billy Laws	568306195f	Prevent Vulkan guest crashes by avoiding intermediate syncpt event signal state The vulkan guest driver doesn't expect a 0xB return code from SyncptEventWait, even though this is valid when an event is being signalled. Just ignore the intermediate state instead as doing so avoids races without causing any more.	2023-02-04 23:10:45 +00:00
Billy Laws	fcb8f2a229	Apply texture shader compiler generated descriptor shifts These were missed on a hades version upgrade.	2023-02-04 23:10:45 +00:00
Billy Laws	bbef006051	Simplify free descriptor set accounting and update ratios Slightly reduces descriptor usage in Breath of The Wild	2023-02-04 23:10:45 +00:00
Billy Laws	5e862cf5f7	Bail out early if the new pipeline key matches that of the current one Prevents the transition cache of some pipelines from getting full of copies of itself in cases where an update happens redundantly.	2023-02-04 23:10:45 +00:00
Billy Laws	3e971d4043	Wait for pipeline compilation to finish before loading the guest The excessive blocking caused by initial compilation happening async to the guest caused issues in some cases, now we have a Vulkan pipeline cache to speed it up we can wait for a full compile before launch without too many issues.	2023-02-04 23:10:45 +00:00
Billy Laws	be6f08cd97	Add debug pipeline statistics recording for finding redundant pipelines	2023-02-04 23:10:45 +00:00
Billy Laws	6333a92b53	Only include active RTs in pipeline state key This was causing a buildup of many redundant pipelines in SMO as a depth-only shader was being called without previous RTs being unbound.	2023-02-04 23:10:45 +00:00
Billy Laws	9d3a9f63d5	Move graphics piplines away from storing hades shader info struct By only using what we need, and mirroring the descriptor structs to allow for much tighter packing (while keeping the same member names) we can reduce pipeline memory to about 1/3 of what it was before.	2023-02-04 23:10:45 +00:00
Billy Laws	dd92cb1536	Implement support for (de)serialising VkPipelineCaches to/from storage Significantly improves launch times in games with many shader combinations, giving an 5x speedup in some cases.	2023-02-04 23:10:45 +00:00
Billy Laws	db173083d7	Update edge credits	2023-02-04 23:10:45 +00:00
PabloG02	35617930d5	Fix rebase	2023-01-28 11:57:19 +00:00
PabloG02	8b9d6f79ab	Add option to enable/disable shader cache	2023-01-28 11:57:19 +00:00
PixelyIon	8bfda0d84d	Fix hole punching in mappings with SVC `UnmapPhysicalMemory` Certain titles such as Super Smash Bros Ultimate can use SVC `UnmapPhysicalMemory` to punch holes into physical memory mappings, this wasn't handled correctly as we completely deleted the portion after the hole. It has now been fixed which results in these titles which depend on this behavior to work now.	2023-01-23 21:28:59 +00:00
hacobot.dev	ff1e62df7a	deleted unnecessary convertion	2023-01-23 21:28:49 +00:00
hacobot.dev	75f6f5e31c	pull request requested changes	2023-01-23 21:28:49 +00:00
hacobot.dev	7cd13916a3	Main activity is now refreshing when the group checkbox is changed	2023-01-23 21:28:49 +00:00
hacobot.dev	b67bfe3848	Added functionality to make optional to group games by format and sort	2023-01-23 21:28:49 +00:00
Billy Laws	6b9be2edd4	Add note about circular queue append contiguosity guarantees	2023-01-20 21:19:04 +00:00
PabloG02	535eafb57a	Add Android 13 themed icon	2023-01-20 21:08:33 +00:00
PabloG02	d544ccf5ea	Stub INotificationServicesForApplication	2023-01-20 21:08:12 +00:00
PabloG02	2fa5ea451e	Stub IPrepoService::SaveReportWithUserOld2	2023-01-20 21:08:12 +00:00
PabloG02	7327cdbde9	Stub some functions in IDeliveryCacheStorageService	2023-01-20 21:08:12 +00:00
PabloG02	c53d99d393	Stub IDeliveryCacheFileService and IDeliveryCacheDirectoryService	2023-01-20 21:08:12 +00:00
PabloG02	299d11d86f	Stub IApplicationFunctions::GetNotificationStorageChannelEvent	2023-01-20 21:08:12 +00:00
Billy Laws	7c623f8301	Use a spinlock for thread waiter mutex Since the waitermutex is only ever locked for a short amount of time, spinning in contention-heavy scenarios ends up quite a bit more efficient than a kernel wait.	2023-01-20 21:07:59 +00:00
Billy Laws	e2463b7619	Adjust gpfifo WFI to only do a pipeline barrier	2023-01-20 21:07:59 +00:00
Billy Laws	2b282ece1a	Add more fine-grained buffer recreation locking	2023-01-20 21:07:59 +00:00
Billy Laws	85a23e73ba	Implement a shared spinlock and use it for GPU VMM	2023-01-20 21:07:59 +00:00
Billy Laws	fd5c141dbf	Correct GetNpadIrCameraHandle return value	2023-01-20 21:07:59 +00:00
Billy Laws	a8b32c3cef	Cleanup helper pipeline cache code	2023-01-20 21:07:59 +00:00
Billy Laws	1f99d63a80	Incr transition cache size	2023-01-20 21:07:59 +00:00
Billy Laws	262f92900d	Ensure unmapped VMM ranges return an invalid span	2023-01-20 21:07:59 +00:00
Billy Laws	0a608fb4b2	Update to latest hades	2023-01-20 21:07:59 +00:00
Billy Laws	44f6aada18	Always set blend state for all colour attachments	2023-01-20 21:07:59 +00:00
Billy Laws	177925be93	Avoid OOB memory acceses when trying to read OOB TICs Some games pass in invalid texture handles (0xffff) when they don't need the texture so return the null texture in this case.	2023-01-20 21:07:59 +00:00
Billy Laws	d8a4a2b08d	Use a spinlock for GPU waiter thread	2023-01-20 21:07:59 +00:00
Billy Laws	f1aed86177	Add a workaround for split-mapping shaders Some games split shaders across multiple mappings and also miss the end header, so read a suitably large amount and hope that's enough for now.	2023-01-20 21:07:59 +00:00
Billy Laws	704660bbeb	Store render nodes in a linearly allocated linked list This is much faster in reldebug builds than boost::stable_vector while still providing iterator stability	2023-01-20 21:07:59 +00:00
Billy Laws	326c05a5de	Add guest shader replacement and dumping support	2023-01-20 21:07:59 +00:00
Billy Laws	2f6d27e8d7	Rework circular queue locking Should now be (hopefully) race-free, also switch to a spinlock to avoid any locking overhead.	2023-01-20 21:07:59 +00:00
lynxnb	5d527cb965	Add `CNTFRQ_EL0` workaround value for Exynos 1280	2023-01-15 10:16:01 +00:00
PabloG02	ea0217de47	Add TIC format: 0x78D24952	2023-01-13 18:05:22 +00:00
Abandoned Cart	88b3f371f4	Display a preview of the current profile picture This removes the need to concatenate the variable multiple times, recycles the scaled bitmap after it has been stored, addresses the Android Studio complaint about that method name, and generates a preview of the current profile image as the preference icon.	2023-01-13 14:28:20 +01:00
lynxnb	aa36c591c6	Exclude Home button from controller setup guide	2023-01-11 20:51:18 +00:00
Maccraft123	c3924e0f08	Stub out InlineKeyboard instead of throwing an error	2023-01-11 20:47:39 +00:00
lynxnb	2a421e7146	Run emulation in a separate process for release builds only	2023-01-11 23:38:57 +05:30
lynxnb	950438bf58	Enable `VK_KHR_image_format_list` during device init `VK_KHR_image_format_list` is a requirement for `VK_KHR_imageless_framebuffer`, which we use.	2023-01-11 23:38:57 +05:30
PixelyIon	d39112e9b9	Enable `IApplicationDisplayService::ConvertScalingMode` implementation The implementation for this service function wasn't added to the service function table. Additionally, the type for the output `ScalingMode` was implicitly `int` as it was unspecified in the `enum class` which has now been corrected to `u64` as it should be.	2023-01-11 23:38:57 +05:30
PixelyIon	45d0558d00	Check for no Vulkan physical devices Due to broken drivers, it's possible to find no Vulkan physical devices but this can lead to a cryptic segfault. This explicitly checks for it instead and throws an exception which will be emitted into logcat thus can be easily caught.	2023-01-11 23:38:57 +05:30
PixelyIon	f882b613bc	Fix `.hook` section being allocated without any hooked symbols Due to the trampoline and save/load context functions, `GetHookSectionSize` returned a non-zero size for when there were no hooked symbols supplied to it. This is problematic as it isn't required and hooking is currently not stable so it can lead to crashes or freezes in certain titles.	2023-01-11 00:13:15 +05:30
PixelyIon	3fa314f6cb	Always print thread IDs rather than handles for SVC logs Handles are rather arbitrary and difficult to reference, as a result, we've moved to thread IDs across the board for logs.	2023-01-11 00:13:15 +05:30
PixelyIon	e192d4e5c1	Warn when `RemoveThread` is called on a non-inserted thread	2023-01-11 00:13:15 +05:30
PixelyIon	3a6f205e6f	Clear `insertThreadOnResume` in `RemoveThread` A thread can be paused while it is in a synchronization primitive which will do `RemoveThread`, we need to update the state of `insertThreadOnResume` in this case by clearing it so it isn't incorrectly reinserted on resuming the thread.	2023-01-11 00:13:15 +05:30
PixelyIon	7fef849594	Make `UpdateCore`'s locking `coreMigrationMutex` requirement explicit `Scheduler::UpdateCore` implicitly depended on `KThread::coreMigrationMutex` being locked during calls to it, this requirement has now been made explicit to avoid confusion.	2023-01-11 00:13:15 +05:30
PixelyIon	c4b4532222	Check `waitThread` rather than `waitMutex` during condvar timeouts When a timeout occurs in `ConditionVariableWait`, we used to check `waitMutex` which is cleared by `MutexUnlock` but when we hit the CAS case in `ConditionVariableSignal` then we don't clear `waitMutex`. It's far more reliable to check `waitThread` as an indication for if the thread has already been unlocked as it's cleared at the start of `ConditionVariableWait` and would implicitly stay cleared in the CAS case while being set in `MutexLock` and being unset in `MutexUnlock`.	2023-01-11 00:13:15 +05:30
PixelyIon	2525bafe06	Consolidate thread yielding in `Scheduler` There's multiple locations where a thread is yielded in the scheduler and all of them repeat the code of checking for `pendingYield` and signalling with an optional optimization of checking if the thread being yielded is the calling thread. All this functionality has now been consolidated into `Scheduler::YieldThread` which checks for `pendingYield` and does the calling thread yield optimization. This should lead to better readability and better performance in cases where `UpdatePriority` would signal the calling thread.	2023-01-11 00:13:15 +05:30
PixelyIon	8b973a3de3	Always set `forceYield` for running threads in `PauseThread` `forceYield` was incorrectly not set when pausing running threads if the thread already had `pendingYield` set. This could lead to cases where `Rotate` would later throw an exception due to it being unset.	2023-01-11 00:13:15 +05:30
PixelyIon	6645692288	Don't block while inserting paused threads Blocking while inserting a paused thread can lead to deadlocks where the inserting thread later resumes the paused thread. Co-authored-by: Billy Laws <blaws05@gmail.com>	2023-01-11 00:13:15 +05:30
Billy Laws	643f4cf864	Ensure thread doesn't migrate during `InsertThread` As we didn't hold `coreMigrationMutex`, the thread could simply migrate during `InsertThread` which would lead to the thread potentially never waking up as it's been inserted on a non-resident core. Co-authored-by: PixelyIon <pixelyion@protonmail.com>	2023-01-11 00:13:15 +05:30
PixelyIon	7f7352ed59	Recalculate highest-priority waiters during cvar/address signaling `SignalToAddress`/`ConditionVariableSignal` need to wake waiters in priority order, while threads are inserted in order this doesn't remain the case as priority updates don't reinsert the thread into `syncWaiters`. It was determined that reinsertion into `syncWaiters` would be fairly complex due to locking the `syncWaitersMutex` with the thread's mutexes. To avoid this, this commit instead sorts waiters by priority at signal time to always wake threads in the right order.	2023-01-11 00:13:15 +05:30
PixelyIon	626008d8e2	Fix `WaitForAddress` timeout mutex deadlock Calling `WaitSchedule` inside the block where `syncWaiterMutex` is locked causes a race with other threads which lock the core mutex and `syncWaiterMutex` together. This commit moves the `WaitSchedule` outside the block while simply setting a flag to wait later similar to `ConditionVariableWait`'s timeout case. Co-authored-by: Billy Laws <blaws05@gmail.com>	2023-01-11 00:13:15 +05:30
PixelyIon	4df3c98225	Add double-insertion debug check to `InsertThread` This is a cause for a large amount of scheduler bugs so we should generally check for this on debug builds as it is a fairly easy way to check for issues for some performance cost.	2023-01-11 00:13:15 +05:30
PixelyIon	5694c9b34b	Rename `KThread::waitKey` to `KThread::waitMutex` It was determined that `waitKey` is too ambiguous when waiter members are used for both mutexes and condition variables.	2023-01-11 00:13:15 +05:30
PixelyIon	91bb8d231a	Rename `ConditionalVariable` -> `ConditionVariable` "Conditional Variable" is a typo which was propagated through the codebase, it has been corrected to "Condition Variable".	2023-01-11 00:13:15 +05:30
PixelyIon	f487d81769	Refactor Condition Variable Waiting/Signalling The way we handled waking/timeouts of condition variables was fairly inaccurate to HOS as we moved locking of the mutex to the waker thread which could change the order of operations and would cause what were functionally spurious wakeups for all awoken threads. This commit fixes it by doing all locks on the waker thread and only awakening the waiter thread once the condition variable was signalled and the mutex was unlocked. In addition, this fixes races between a timeout and a signal that could lead to double-insertion as a result of a refactor of how timeouts work in the new system.	2023-01-11 00:13:15 +05:30
PixelyIon	1eb4eec103	Allow locking external thread in `MutexLock` We want the ability to lock mutexes on behalf of other threads to refactor condition variables to match HOS on waking behavior.	2023-01-11 00:13:15 +05:30
PixelyIon	6bbe9de881	Fix result returned by `MutexLock` `MutexLock` incorrectly returned `InvalidCurrentMemory` for cases where the userspace value didn't match the expected value. It's been corrected to return no error in those cases while preserving the error code for usage in `ConditionalVariableWait`.	2023-01-11 00:13:15 +05:30
PixelyIon	08ef88b156	Add early-timeout path for `WaitForAddress`	2023-01-11 00:13:15 +05:30
PixelyIon	d0c56235f4	Read `address` atomically in `WaitForAddress` We didn't read the values for arbitration atomically in all cases as we should have, this consolidates the reading of the value and uses the value across all cases.	2023-01-11 00:13:15 +05:30
PixelyIon	e8a1bd1aad	Fix `WaitForAddress` timeout signal race A race could occur from the timeout path in `WaitForAddress` taking place at the same time as `SignalToAddress` has been caused, this causes a deadlock due to double-insertion.	2023-01-11 00:13:15 +05:30
Billy Laws	0f1d97fe2c	Update edge supporter names	2023-01-08 21:35:14 +00:00
Billy Laws	31fb6d30eb	Fake maxwell occlusion query results	2023-01-08 19:30:52 +00:00
Billy Laws	a92c26531e	Keep holes in descriptors for unsupported bindings	2023-01-08 19:30:52 +00:00
Billy Laws	81d82008c7	Pre-signal suspend ticks event	2023-01-08 19:30:52 +00:00
Billy Laws	3e5992e366	Update hades	2023-01-08 19:30:52 +00:00
Billy Laws	45bbf3bb2a	Fix indirect draws with direct buffers We need to wait on the GPFIFO manually as we won't hit the traps when accesing the indirect params with direct as we usually would.	2023-01-08 19:30:52 +00:00
Billy Laws	68ad052cb1	Add geometry passthrough shader support for vertex layer writes	2023-01-08 19:30:52 +00:00
Billy Laws	ec519a7d52	Return null texture on encountering unmapped textures	2023-01-08 19:30:52 +00:00
Billy Laws	97e127153b	Make shader trap mutex recursive There are cases there we hit a shader trap within the GPU, by making it recursive we avoid deadlocking on reads within the GPU.	2023-01-08 19:30:52 +00:00
Billy Laws	1a6165f74d	Fix GetReadOnlyBackingSpan for non-direct buffers This was missed in the initial implementation	2023-01-08 19:30:52 +00:00
Billy Laws	4e5141f879	Fix missed attempt increment in spinlock Should hog CPU slightly less and correctly yield now	2023-01-08 19:30:52 +00:00
Billy Laws	35a46acbb1	Determine storage buffer alignment dynamically	2023-01-08 19:30:52 +00:00
Billy Laws	12d80fe6c2	Use a shared mutex for GPU VMM to avoid deadlocks Two reads need to be able to occur simultanously or deadlocks ccan occur (e.g read traps to wait on GPU but GPU needs to read).	2023-01-08 19:30:52 +00:00
Billy Laws	28b2a7a8a1	Dynamically apply GPU turbo clocks only when GPU submissions are queued Allows for the GPU to clock down in cases where it's idle for most of the time, while still forcing maximum clocks when we care.	2023-01-08 19:30:52 +00:00
Billy Laws	81f3ff348c	Transition memory handling from memfd to anonymous shared mappings Memfd mappings are incompatible with KGSL user memory importing on older kernels, transition to shared anon mappings to avoid this.	2023-01-08 19:30:52 +00:00
Billy Laws	cc3c869b9f	Attempt to signal the vsync event at present time if possible Some games rely on the vsync event to schedule frames, by matching its timing with presentation we can reduce needless waiting as the game will immediely be able to queue the next frame after presentation.	2023-01-08 19:30:52 +00:00
Billy Laws	918a493a45	Implement wfi and setReference GPFIFO barriers	2023-01-08 19:30:52 +00:00
Billy Laws	7315ba04e6	Fixup optional flattenable binder obj structure	2023-01-08 19:30:52 +00:00
Billy Laws	90e21b0ca1	Split syncpoints into host-guest pairs This allows for the presentation engine to grab the presentation image early when direct buffers are in use, since it'll handle sync on its own using semaphores it doesn't need to wait for GPU execution.	2023-01-08 19:30:52 +00:00
Billy Laws	966c31810a	Return appropriate fences in surfaceflinger queue buffer	2023-01-08 19:30:52 +00:00
Billy Laws	afef6c5123	Always populate all colour attachments This better follow the Vulkan spec, which doesn't mention anything about writes to OOB attachments, only those marked as unused.	2023-01-08 19:30:52 +00:00
Billy Laws	3571737392	Reset maxwell3d quick bind state before adding subpasses to executor If a submission happens during the call to addsubpass we could end up with invalid quick bind state, move this to to before to prevent that.	2023-01-08 19:30:52 +00:00
Billy Laws	3d31ade35f	Implement an alternative buffer path using direct memory importing By importing guest memory directly onto the host GPU we can avoid many of the complexities that occur with memory tracking as well as the heavy performance overhead in some situations. Since it's still desired to support the traditional buffer method, as it's faster in some cases and more widely supported, most of the exposed buffer methods have been split into two variants with just a small amount of shared code. While in most cases the code is simpler, one area with more complexity is handling CPU accesses that need to be sequenced, since we don't have any place we can easily apply writes to on the GPFIFO thread that wont also impact the buffer on the GPU, to solve this, when the GPU is actively using a buffer's contents, an interval list is used to keep track of any GPFIO-written regions on the CPU and any CPU reads to them will instead be directed to a shadow of the buffer with just those writes applied. Once the GPU has finished using buffer contents the shadow can then be removed as all writes will have been done by the GPU. The main caveat of this is that it requires tying host sync to guest sync, this can reduce performance in games which double buffer command buffers as it prevents us from fully saturating the CPU with the GPFIFO thread.	2023-01-08 19:30:52 +00:00
Billy Laws	b3f7e990cc	Allow for tying guest GPU sync operations to host GPU sync This is necessary for the upcoming direct buffer support, as in order to use guest buffers directly without trapping we need to recreate any guest GPU sync on the host GPU. This avoids the guest thinking work is done that isn't and overwriting in-use buffer contents.	2023-01-08 19:30:52 +00:00
Billy Laws	89c6fab1cb	Implement a way to check if the command record thread is idle Useful for debugging and testing	2023-01-08 19:30:52 +00:00
Billy Laws	c67f27e914	Add a setting to control the maximum number of accumulated GPU cmds This helps to keep the GPU fed when processing large command buffers which don't have any syncpoints to force a flush inbetween.	2023-01-08 19:30:52 +00:00
Billy Laws	77214a98dd	Add a setting to force maximum GPU clocks on KGSL devices	2023-01-08 19:30:52 +00:00
Billy Laws	83ecc33a77	Update adrenotools	2023-01-08 19:30:52 +00:00
Billy Laws	3ecaedd71e	Add adrenotools direct mapping support	2023-01-08 19:30:52 +00:00
Pablo	8846a85d3a	Stub some IPurchaseEventManager functions	2022-12-31 10:45:18 +00:00
PabloG02	80c0f8f04d	Implement full profile picture support Extends the profile picture stub into a full-fledged implementation with the ability for users to set their profile picture in settings while having the Skyline icon as the default profile picture.	2022-12-27 22:53:41 +05:30
PixelyIon	7a3d2e4a26	Start `KThread` TID from 1 rather than 0 HOS's TIDs are one-based rather than zero-based, certain titles such as Pokémon Arceus, Naruto Shippuden: Ultimate Ninja Storm 3, Splatoon 3, etc. use the TID being zero as a sentinel value but as we assigned this ID to our first thread prior it broke this logic which has now been fixed by this commit as it now matches HOS behavior.	2022-12-27 22:36:06 +05:30
Billy Laws	bab659587f	Use e1 sample count for blits	2022-12-22 18:05:45 +00:00
Billy Laws	516ece6b04	Calculate renderarea from attachment min size	2022-12-22 18:05:45 +00:00
Billy Laws	4a3cd69257	Populate graphics pipeline manager from cache at launch-time	2022-12-22 18:05:45 +00:00
Billy Laws	e9bcdd06eb	Introduce a pipeline cache manager for simple read/write cache accesses All writes are done async into a staging file, which is then merged into the main pipeline cache file at the time of the next launch. Upon encountering file corruption the cache can be trimmed up to the last-known-good entry to avoid any excessive loss of data from just one error.	2022-12-22 18:05:45 +00:00
Billy Laws	06bf1b38af	Introduce a pipeline state accessor that reads from a bundle	2022-12-22 18:05:45 +00:00
Billy Laws	7dd3a1db0f	Avoid InterconnectContext use in graphics PipelineManager We will soon move to a global pipeline manager instance, so it wont be possible to use InterconnectContext at pipeline-creation time anymore	2022-12-22 18:05:45 +00:00
Billy Laws	ffe7263848	Add quirk for 615 drivers with broken multithreaded compilation	2022-12-22 18:05:45 +00:00
Billy Laws	755f7c75af	Add pipeline (de)serialisation support to bundle See comments in code for details on the on-disk format.	2022-12-22 18:05:45 +00:00
Billy Laws	937eff392f	Switch execution-numbers to be globally unique tags This is required for making pipelines usable across channels without introducing caching bugs.	2022-12-22 18:05:45 +00:00
Billy Laws	072b8193a1	Implement thread pool based async pipeline compilation with futures By distributing the load of shader compiling onto multiple threads and then only waiting for completion until absolutely neccessary we can reduce compilation stutters significantly.	2022-12-22 18:05:45 +00:00
Billy Laws	186549748d	Implement HelperShader-local pipeline cache and use dynamic state Avoids the heavy overhead of the VK pipeline cache when we really only have a few bits of non-dynamic state	2022-12-22 18:05:45 +00:00
Billy Laws	9115b8cae8	Properly hash dynamic states in pipeline cache	2022-12-22 18:05:45 +00:00
Billy Laws	7c4b4765bf	Reduce thresholds for slot increase and buffer/texture fast readback	2022-12-22 18:05:45 +00:00
Billy Laws	f32ab1feff	Include BS thread pool library	2022-12-22 18:05:45 +00:00
Billy Laws	ce428af2e6	Use attachment formats rather than views in VK pipeline cache	2022-12-22 18:05:45 +00:00
Billy Laws	e849264028	Abstract out pipeline-compile-time GPU state accesses Introduces the base abstractions that will be used for pipeline caching, with a 'PipelineStateBundle' that can be (de)serialised to/from disk and an abstract accessor class to allow switching between creating disk-cached pipelines and fresh ones.	2022-12-22 18:05:45 +00:00
Billy Laws	2e96248fb6	Track RT format info in PackedPipelineState and move VK conv code there When caching pipelines we can't cache whole images, only their formats so refactor PackedPipelineState so that it can be used for pipeline creation, as opposed to passing in a list of attachments.	2022-12-22 18:05:45 +00:00
Billy Laws	bc7e1eb380	Split-out hash from ShaderBinary struct This isn't necessary for pipeline creation and creates some difficulty with pipeline caching.	2022-12-22 18:05:45 +00:00
Dima	de10ab1219	Stub SetConnectionConfirmationOption	2022-12-18 20:34:55 +00:00
Dima	f3b2b4317e	Stub some IPrepoService calls	2022-12-18 20:34:55 +00:00
Dima	efef67b92b	Stub some IAudioDevice calls	2022-12-18 20:34:55 +00:00
Dima	3a94bcf692	Fix ListOpenContextStoredUsers stub The problem is in StoreOpenContext wasn't storing any user, but ListOpenContextStoredUsers was writing default user (when it's not stored by StoreOpenContext)	2022-12-18 20:34:55 +00:00
TheASVigilante	3c5f8dd876	Fix small typo	2022-12-18 14:49:54 +00:00
lynxnb	6599c1dccf	Stub `GyroscopeZeroDriftMode` Related service calls are called in a loop by SM3DW. A variable tracking zero drift mode has been added to `npad_device`, but it's unused at the moment.	2022-12-10 14:59:44 +00:00
Dima	dcc3047ba8	Stub ErrorCommonArg	2022-12-10 14:58:20 +00:00
Dima	68253fe995	Stub mii:e/mii:u Needed for SSBU	2022-12-10 14:58:20 +00:00
Dima	69ee3cfc66	Stub DeleteDirectory Should allow deleting/rewriting saves in some games	2022-12-10 14:58:20 +00:00
Dima	bbd34ae7e7	Validate if entries are not empty before using Should fix saving problem in Baldur's Gate: Dark Alliance II at least	2022-12-10 14:58:20 +00:00
Dima	5f510d84d7	Stub IsVibrationPermitted	2022-12-10 14:58:20 +00:00
Dima	51d1f519af	Stub ListDisplays	2022-12-10 14:58:20 +00:00
Dima	a3866a3129	Stub LibraryAppletShop	2022-12-10 14:58:20 +00:00
Dima	1ebec7db82	Stub GetImageSize and LoadImage	2022-12-10 14:58:20 +00:00
Dima	52c4228ecf	Stub some friends service calls Needed for Diablo 3	2022-12-10 14:58:20 +00:00
Dima	ebcbc5b05b	Validate NpadId for ActivateVibrationDevice	2022-12-10 14:58:20 +00:00
Dima	4bdd033354	Stub SetRecordVolumeMuted	2022-12-10 14:58:20 +00:00
Dima	f6d95aae01	Stub GetCacheStorageSize	2022-12-10 14:58:20 +00:00
Dima	4ab8699cd4	Stub ImportServerPki	2022-12-10 14:58:20 +00:00
Dima	41cf4bb12d	Stub GetLanguageCode	2022-12-10 14:58:20 +00:00
Dima	3e078d54b6	Stub GetIdleTimeDetectionExtension	2022-12-10 14:58:20 +00:00
Dima	2311f777fc	Stub IsCpuOverclockEnabled	2022-12-10 14:58:20 +00:00
Dima	4601c28c28	Stub GetCurrentIpAddress	2022-12-10 14:58:20 +00:00
Dima	18e6a6c53c	Stub DeclareOpenOnlinePlaySession and DeclareCloseOnlinePlaySession	2022-12-10 14:58:20 +00:00
Dima	150c1370c2	Stub some IApplicationFunctions funcs	2022-12-10 14:58:20 +00:00
Dima	a6f3aa3062	Stub TrySelectUserWithoutInteraction and ListQualifiedUsers	2022-12-10 14:58:20 +00:00
Dima	5a9a2861df	Add TitleId TextView in App Dialog	2022-12-10 14:57:46 +00:00
Abandoned Cart	b08fcd7027	Favor a predefined "click" over system vibration	2022-12-10 14:57:33 +00:00
Abandoned Cart	cfd3bfecba	Add a rudimentary OSC button vibration setting	2022-12-10 14:57:33 +00:00
Billy Laws	7c802aea46	Mark vertex buffers as dirty on limit changes	2022-12-03 22:50:56 +00:00
Billy Laws	df19810c6c	Always set vertex stride for unbound buffers	2022-12-03 22:50:56 +00:00
Billy Laws	f4f658e3b7	Fix typo	2022-12-03 22:50:56 +00:00
Billy Laws	45b10ef776	Return whole mapping for shader code when end instrs aren't found	2022-12-03 22:50:56 +00:00
Billy Laws	d849875656	Only unlock GPU channel state on queue wait if it was previously locked	2022-12-03 22:50:56 +00:00
Billy Laws	a5e0a64adc	Switch patch error logs to debug	2022-12-03 22:50:56 +00:00
Billy Laws	af7c54297f	Cache staging buffer used for texture download	2022-12-03 22:50:56 +00:00
Billy Laws	8c5e6d2bb4	Update VKMA	2022-12-03 22:50:56 +00:00
Billy Laws	bba07fb101	Update for new hades	2022-12-03 22:50:56 +00:00
Billy Laws	a16383fd4b	Disable compute shaders on mali This will need to be debugged properly at some point but its fine for now.	2022-12-03 22:50:56 +00:00
Billy Laws	d69c6851f3	Update hades	2022-12-03 22:50:56 +00:00
Billy Laws	137d801843	Skip host1x HW emulation and effectively stub submission This was causing a bunch of logspam and isn't really needed as we will be using a HLE approach.	2022-12-03 22:50:56 +00:00
Billy Laws	579a2d9337	Add dynamic executor slot growth	2022-12-03 22:50:56 +00:00
Billy Laws	60169fce4c	Support 0-sized constant buffers	2022-12-03 22:50:56 +00:00
Billy Laws	b86dd99e1a	Align all SSBOs to 0x40 bytes Required by Adreno GPUs	2022-12-03 22:50:56 +00:00
Billy Laws	bfae292fb0	Make executor slot count setting exponential	2022-12-03 22:50:56 +00:00
Billy Laws	e0ae94be9d	Enable robustness1 Vulkan feature	2022-12-03 22:50:56 +00:00
Billy Laws	e8ef2d80af	CMake build file updates	2022-12-03 22:50:56 +00:00
Billy Laws	bf03f945ee	Implement the Kepler compute engine This can reuse a fair bit of the now-commonised Maxwell 3D code and mostly consists of compute-specific pipeline code which was deemed not suitable for being commonised (e.g. descriptor update code is somewhat duplicated). Of note is how compute lacks any active state at all de to its use of QMDs which bundle up all state into a single object in memory.	2022-12-03 22:50:56 +00:00
Billy Laws	4bc81f007f	Add some convinience helpers to compute engine regs	2022-12-03 22:50:56 +00:00
Billy Laws	4267a6af36	Add support for parsing and compiling compute shaders to the shader manager	2022-12-03 22:50:56 +00:00
Billy Laws	86dab65af4	Commonise maxwell3d state updater	2022-12-03 22:50:56 +00:00
Billy Laws	a0b81d54d6	Use pitch layout for linear RTs More likely to match in the texture cache when being sampled.	2022-12-03 22:50:56 +00:00
Billy Laws	ac85df7b7a	Start transition cache lookup with most recent one	2022-12-03 22:50:56 +00:00
Billy Laws	62c86b7690	Move maxwell3d to common constant buffer code	2022-12-03 22:50:56 +00:00
Billy Laws	8f0a6e78c5	Add Vulkan stride dynamic state and robustness support Fixes the waterfall in SMO by specifying vertex buffer bounds.	2022-12-03 22:50:56 +00:00
Billy Laws	23a7f70a8e	Commonise maxwell3d guest shader caching code	2022-12-03 22:50:56 +00:00
Billy Laws	6f6a312692	Commonise maxwell3d pipeline binding handling code A lot of pipeline code is difficult to commonise due to the inherent difference between compute and graphics pipelines, however the binding layout is shared so we can at least commonise that	2022-12-03 22:50:56 +00:00
Billy Laws	be8cbabd97	Commonise maxwell3d texture code This will be shared with the compute engine implementation.	2022-12-03 22:50:56 +00:00
Billy Laws	61e95c4b2c	Commonise maxwell3d sampler code This will be shared with the compute engine implementation, the only thing of note with this is that the binding register is now passed as a param since it is part of the compute QMD which can't be dirty tracked.	2022-12-03 22:50:56 +00:00
Billy Laws	7f93ec3df6	Commonise maxwell3d interconnect common code for use by other engines The compute engine will require most of this for basic functionality.	2022-12-03 22:50:56 +00:00
Billy Laws	281838fde1	Apply GPU readback hack to both buffers and textures And rename as appropriate.	2022-12-03 22:50:56 +00:00
Billy Laws	f358c4517e	Update edge credits	2022-12-03 22:50:56 +00:00
Billy Laws	eb00dc62f8	Implement support for 36 bit games by using split code/heap mappings Although rtld and IPC prevent TLS/IO and code from being above the 36-bit AS limit, nothing depends the heap being below it. We can take advantage of this by stealing as much AS as possible for code in the lower 36-bits.	2022-12-02 22:10:03 +00:00
Dima	e8e1b910c3	Add possibility to disable audio output	2022-12-02 00:33:28 +01:00
lynxnb	70109f8fbd	Work around invalid values in `CNTFRQ_EL0` register Exynos SoCs have a bug where the `CNTFRQ_EL0` register is either set to 0 or contain incoherent values. With this patch, the frequency value is loaded into a static variable and used instead of reading the register. The value will be initialised to the correct value for affected SoCs, while unaffected ones will use the value from the register.	2022-12-02 00:23:28 +01:00
lynxnb	54d0246ca6	Tweak `GpuDriverActivity` FAB padding	2022-11-28 00:06:07 +01:00
lynxnb	2e8d7b559c	Use the original view padding/margin when applying window insets Adding to the current view padding/margin values results in applying the insets over and over again as insets listeners can be called multiple times.	2022-11-28 00:04:39 +01:00
Billy Laws	b2384e83f5	Add prepo:a service	2022-11-25 16:26:00 +00:00
Billy Laws	736216a6f4	Stub OpenPatchDataStorageByCurrentProcess	2022-11-25 16:26:00 +00:00
Billy Laws	44033d7f8d	Adjust CalendarTime year to be relative to 0AD	2022-11-25 16:26:00 +00:00
Billy Laws	2ce2604421	Implement VFS file deletion	2022-11-25 16:26:00 +00:00
Billy Laws	6c968e0357	Fix GetEntryType IPC return type	2022-11-25 16:26:00 +00:00
lynxnb	ec220c8ea9	Use an extended FAB in `GpuDriverActivity`	2022-11-23 19:49:42 +05:30
lynxnb	163f4f2014	Fix window insets handling when in landscape mode To avoid code duplication, insets handling has been moved to a separate interface.	2022-11-23 19:49:42 +05:30
lynxnb	ab6c5f4c50	Improve robustness of `KeyReader.import` * Close the input and output file streams before moving the output file to the final destination * Clean up the destination path before moving the new file * Introduce a `ImportResult` return value to differentiate between the possible causes of import errors * Display more meaningful error messages in the UI	2022-11-23 19:49:42 +05:30
lynxnb	38129d9dc3	Mark some strings as non-translatable	2022-11-23 19:49:42 +05:30
lynxnb	ee8c055641	Make `GpuDriverInstallResult` PascalCase	2022-11-23 19:49:42 +05:30
Billy Laws	7f1667de82	Avoid using trapping for frequently trapped shaders Fall back to hashing for every shader access as that ends up being faster than applying traps for every execution.	2022-11-19 12:49:05 +00:00
Billy Laws	06095918a9	Introduce per-channel sequence number for invalidation tracking For cases like shaders, which may be uploaded through I2M (which no longer causes an execution) we need a way to cause an invalidation on all writes	2022-11-19 12:49:05 +00:00
Billy Laws	97e3f7fd34	Increase max swapchain image count	2022-11-19 12:49:05 +00:00
Billy Laws	c49119f5ef	Fixup depth bounds register arguments	2022-11-19 12:49:05 +00:00
Billy Laws	db3c5c33c4	Clamp depth bounds into 0-1 range	2022-11-19 12:49:05 +00:00
Billy Laws	e1bbd521d9	Fix potential circular queue submission race If a producer thread was waiting for the queue to have free space and the consumer thread hadn't yet acquired the production mutex a deadlock could occur	2022-11-19 12:49:05 +00:00
Billy Laws	13baf2312f	Add a workaround for sampling BGRA textures with a swizzle	2022-11-19 12:49:05 +00:00
Billy Laws	13a96c5aba	Implement a helper shader for partial clears These are not natively supported by Vulkan, so use a helper shader and colorWriteMask for the same behaviour.	2022-11-19 12:49:05 +00:00
Billy Laws	ac0e225114	Use vkCmdBlit for texture copies when formats dont match	2022-11-19 12:49:05 +00:00
Billy Laws	c8fc8f84ec	Fallback to RGBA888 for unsupported swapchain formats as opposed to swizzle	2022-11-19 12:49:05 +00:00
Billy Laws	e0bc0d3a97	Avoid megabuffering buffers larger than the chunk size	2022-11-19 12:49:05 +00:00
Billy Laws	b6f49884b3	Use lower_bound to speedup texture hostMapping lookup	2022-11-19 12:49:05 +00:00
Billy Laws	e7fda28ac6	Skip over textures in cache which have been replaced with a layer/mip match	2022-11-19 12:49:05 +00:00
Billy Laws	88cc696c7f	Only use 2D array depth targets when depth > 1	2022-11-19 12:49:05 +00:00
Billy Laws	7fed971b2d	Take firstIndex into account when calculating index (quad) buffer size Without this we would miss any elements beyond indexCount in the index buffer and they would be filled with random garbage causing vertex bombs	2022-11-19 12:49:05 +00:00
Billy Laws	1f9de17e98	Begin command buffers asynchronously in command executor vkBeginCommandBuffer can take quite some time on adreno, move it to the cycle waiter thread where it won't block GPFIFO.	2022-11-19 12:49:05 +00:00
Billy Laws	4b3e906c22	Update cached buffer execution number when megabuffering	2022-11-19 12:49:05 +00:00
Billy Laws	3ae1e78544	Match mip layers and array layers in texture manager	2022-11-19 12:49:05 +00:00
Billy Laws	d502adb309	Avoid WRW hazard in subpass deps	2022-11-19 12:49:05 +00:00
Billy Laws	e9313cc291	Use view layer count over texture for attachments	2022-11-19 12:49:05 +00:00
Billy Laws	e65ca52d91	Avoid potential buffer copy race	2022-11-19 12:49:05 +00:00
Dima	720cfaafb6	Stub caps:su	2022-11-18 15:35:03 +00:00
Dima	74afca4aab	Stub caps:u	2022-11-18 15:35:03 +00:00
Dima	27ff1ae19b	Stub caps:c	2022-11-18 15:35:03 +00:00
Dima	ffb0546609	Stub caps:a	2022-11-18 15:35:03 +00:00
Dima	1c8736cb56	Stub IsLargeResourceAvailable	2022-11-18 12:52:25 +00:00
Dima	dcd9e4ff61	Stub SetIdleTimeDetectionExtension, SetAlbumImageTakenNotificationEnabled	2022-11-18 12:52:25 +00:00
Dima	60843269de	Stub GetBlockedUserListIds and UpdateUserPresence	2022-11-18 12:52:25 +00:00
Dima	2cdfc7640c	Stub GetPreviousProgramIndex	2022-11-18 12:52:25 +00:00
Dima	360306eb61	Stub GetAddOnContentListChangedEventWithProcessId	2022-11-18 12:52:25 +00:00
Dima	3d475ca122	Stub GetAccountId	2022-11-18 12:52:25 +00:00
Dima	0b452fe36b	Stub GetFriendList	2022-11-18 12:52:25 +00:00
Dima	cc37d2231d	Stub CheckFreeCommunicationPermission and IsFreeCommunicationAvailable	2022-11-18 12:52:25 +00:00
Dima	ec81c97fa9	Stub TryPopFromFriendInvitationStorageChannel	2022-11-18 12:52:25 +00:00
Dima	413f162cf2	Stub some account functions	2022-11-18 12:52:25 +00:00
lynxnb	b209ae8e90	Account for stick flat area when retrieving axes value from a `MotionEvent`	2022-11-17 21:54:15 +01:00
lynxnb	c966220bab	Zero-initialize axes history instead of using null values Use zero initialization for axes history instead of using null values. Fixes the first axis event after launching a game being completely ignored.	2022-11-17 21:54:15 +01:00
lynxnb	3a657c44cc	Don't ignore HAT axes input events Capture HAT axes events ourselves instead of relying on the android framework to turn them into KeyCodes. Fixes handling of DPAD button presses on most controllers.	2022-11-17 21:54:15 +01:00
lynxnb	f1ec771944	Fix inverted axis polarity In the case of axis value being zero, polarity would favor one side of the stick resulting in invalid values. Fix that by taking into account axis history when calculating polarity.	2022-11-17 21:54:15 +01:00
lynxnb	675e8dbb2e	Move input handling code to a dedicated class	2022-11-17 21:54:15 +01:00
lybxlpsv	18861d73a3	Set systemUiVisibility during onResume for A11	2022-11-17 14:04:57 +01:00
Dima	262ee28611	Stub some bsd functions Co-authored-by: Lunar-Pixel <83507264+Lunar-Pixel@users.noreply.github.com>	2022-11-15 16:24:33 +00:00
Dima	9afa8b881e	Stub nsd:u/nsd:a and sfdnsres services	2022-11-15 16:24:33 +00:00
Billy Laws	01e27bd2dd	Implement ldr:ro LoadModule	2022-11-15 16:23:40 +00:00
Billy Laws	e571066409	Stub ldr:ro IRoInterface Some games initialise this service on startup however don't actually use it. Add a simple stub to allow such games to boot.	2022-11-15 16:23:40 +00:00
Billy Laws	1fc2641746	Stub the web applet	2022-11-13 11:37:18 +00:00
Billy Laws	021f82ef08	Stub ListOpenContextStoredUsers	2022-11-13 11:35:40 +00:00
Billy Laws	e7bab27d85	Fixup nvdrv channel private memory allocation This was incorrectly allocated in words, rather than bytes, meaning that guest allocations could overwrite the private memory and break inline syncpt operations	2022-11-13 11:35:40 +00:00
Billy Laws	8b523fa1f0	Avoid inline syncpt increments sending OOB GpEntries In cases where no wfi is required, the space where the WFI commands would go needs to be zeroed out to avoid the GPU reading uninitialised memory.	2022-11-13 11:35:40 +00:00
Billy Laws	cd0b2636e5	Prevent truncation of big page start in GetVaRegions	2022-11-13 11:35:40 +00:00
Billy Laws	f650f32bf0	Avoid duplicating NvDrv buffer unmap code	2022-11-13 11:35:40 +00:00
Billy Laws	001064b7bf	Fix GraphicsBufferProducer recreation We need to use a shared_ptr to ensure that the present callback doesn't do any UAFs, also unlocks the GBP during presentation as if the queue is full a deadlock could a rise where the present callback wouldn't be able to run due to the (waiting) DequeueBuffer thread holding the lock.	2022-11-13 11:35:40 +00:00
Billy Laws	29e89a3950	Fix crashes when opening non-existent directories	2022-11-13 11:35:40 +00:00
Billy Laws	ec139b3027	Fixup CancelBuffer fence handling	2022-11-13 11:35:40 +00:00
Billy Laws	7f24c7b857	Store KMemory object ptrs in memory class to avoid linear-time unmap This is quite a horrible solution but fixing it properly would require a whole rewrite of how we handle memory.	2022-11-13 11:35:15 +00:00
lynxnb	cc71e7b56c	Run emulation in a separate process Exiting from emulation has always been a big issue for Skyline, with guest and host threads that would keep running in the background unless the app was manually killed. Running emulation in a separate process allows us to kill it when we are done, avoiding the need for complex exiting management code.	2022-11-11 11:49:33 +00:00
lynxnb	281562ccdb	Fix FABs ripple effect in `OnScreenEditActivity`	2022-11-09 23:07:23 +05:30
lynxnb	56f6f8a362	Reword the unsupported gpu drivers message The old message was being misinterpreted as if the device's gpu was not supported by the emulator. Reword that message to explicitly mention custom drivers.	2022-11-09 23:07:23 +05:30
lynxnb	e2a5da1d67	Fix `AppDialog` layout * Add a drag indicator at the top * Fix flex layout wrapping when buttons didn't fit on a single line * Fix BottomSheetDialog peek height too small on landscape orientation * General cleanup of the layout	2022-11-09 23:07:23 +05:30
lynxnb	4146261069	Create a unified style for section titles	2022-11-09 23:07:23 +05:30
lynxnb	86364a84a2	Allow content to be drawn behind the navigation bar	2022-11-09 23:07:23 +05:30
lynxnb	6a6e89f070	Make `BottomSheetDialog` go fullscreen when fully expanded	2022-11-09 23:07:23 +05:30
lynxnb	f93d3b78d3	Add a drag indicator element at the top of LicenceDialog A new `DragIndicatorView` had been introduced, which draws a small drag handle element. When used inside a `BottomSheetDialog`, this view will add a callback for hiding the indicator when the dialog is fully expanded.	2022-11-09 23:07:23 +05:30
lynxnb	6848e69638	Improve design consistency across the app Game images, buttons and dialogs now have a consistent corner radius, across all game list layouts.	2022-11-09 23:07:23 +05:30
lynxnb	f63fdf26c9	Partially revert 'Make all `Dialog`s use `@color/backgroundColor` as the background color' This partially reverts commit `36a1f2a2ec`.	2022-11-09 23:07:23 +05:30
lynxnb	dec04db647	`AppListItem` misc tweaks * Restore text marquee on all layouts * Text size and color tweaks * List layout image has round corners * Clean up unneeded attributes	2022-11-09 23:07:23 +05:30
lynxnb	5c76a57e6e	Use `NestedScrollView` for licence dialogs and minor layout tweaks	2022-11-09 23:07:23 +05:30
Billy Laws	388245789f	Restructure ConditionalVariableSignal to avoid potential deadlock Since InsertThread can block for paused threads, we need to ensure we unlock syncWaiterMutex when calling it.	2022-11-09 23:02:26 +05:30
PixelyIon	f4a8328cef	Implement Symbol Hooking Symbol hooking is required for HLE implementations of certain features in the future such as `nvdec` and for more in-depth debugging of games as we can inspect them on a SDK function level which allows us to debug issues far more easily.	2022-11-07 23:56:22 +05:30
PixelyIon	8892eb08e6	Fix `MoveRegister` to clear when value is 0 The register wouldn't be cleared with a `MOVZ` when a value was zero due to the condition for writing an instruction requiring the `offsetValue` to be non-zero.	2022-11-07 23:56:22 +05:30
Billy Laws	f7ab3abb86	Allow load balancing when waiting on condvars	2022-11-06 20:47:26 +00:00
german77	b6e2fb894c	service: bcat: Stub CreateDeliveryCacheStorageService	2022-11-06 20:39:41 +00:00
Billy Laws	80b65d5094	Update XDR names	2022-11-03 22:53:01 +00:00
Billy Laws	a940d6fd34	Update submodules	2022-11-02 17:46:07 +00:00
Billy Laws	026bb04386	Impl some more texture formats	2022-11-02 17:46:07 +00:00
Billy Laws	133f08ed14	Stash new register value before executing deferred draws/updates Since the register writes technically happen after the draw, issues can occur if they happen before: e.g. skyrim updates ctSelect and disables all RTs after a draw, but this would happen before it previously and crash the driver.	2022-11-02 17:46:07 +00:00
Billy Laws	c50852e546	Implement the draw(...)BeginEnd Maxwell3D draw registers Used by guest Vulkan games and nouveau.	2022-11-02 17:46:07 +00:00
Billy Laws	270ef3e0d2	Implement GPFIFO semaphore acquire operations	2022-11-02 17:46:07 +00:00
Billy Laws	2ce146e28f	Don't crash on the Grp0SetSubDevMask TertOp Used by Vulkan games to set the SLI mask, not applicable to the switch.	2022-11-02 17:46:07 +00:00
Billy Laws	1d83dadefb	Drop size restruction bypass for frequently synced buffers In cases where large buffers are updated every draw this could seriously increase memory usage beyond 3GB in the megabuffer.	2022-11-02 17:46:07 +00:00
Billy Laws	1088ed514c	Introduce texture usage system to ensure RPs are split when necessary Vulkan doesn't allow sampling a texture and using it as an RT in the same RP, by tracking the texture usage status and splitting RPs when this occurs we can avoid such potential sync errors.	2022-11-02 17:46:07 +00:00
Billy Laws	2dd4698441	Adjust texture matching hacks	2022-11-02 17:46:07 +00:00
Billy Laws	4f5c9047ef	Add some additional texture formats used by Vulkan games	2022-11-02 17:46:07 +00:00
Billy Laws	6a830dfac5	Use shader-compiler side {S,U}Scaled format emulation	2022-11-02 17:46:07 +00:00
Billy Laws	579fd04117	Fixup ReadTextureType shader compiler callback	2022-11-02 17:46:07 +00:00
Billy Laws	b04d18eba5	Add support for split mappings to I2M uploads Used by Super Mario Sunshine and other Vulkan games.	2022-11-02 17:46:07 +00:00
Billy Laws	db5e208379	Clear images even when aspects mismatch	2022-11-02 17:46:07 +00:00
Billy Laws	3c8df327f1	Fixup subpass barriers and flags	2022-11-02 17:46:07 +00:00
Billy Laws	5ab80901c6	Drop some debug code	2022-11-02 17:46:07 +00:00
Billy Laws	4de89c8839	GPU NEW MARGEBAC	2022-11-02 17:46:07 +00:00
Billy Laws	7670c83405	Ensure textures are clean before paging them out	2022-11-02 17:46:07 +00:00
Billy Laws	1a2351386d	Add u64 iova ctor	2022-11-02 17:46:07 +00:00
Billy Laws	93d43e0115	Fully fill in swizzle component mappings Avoids the rest being default initialised to identity, which would break the intended effect of them.	2022-11-02 17:46:07 +00:00
Billy Laws	37ff0ab814	Add buffer manager support for accelerated copies These will be sequenced on the GPU/CPU depending on what's optimal and avoid any serialisation	2022-11-02 17:46:07 +00:00
Billy Laws	cac287d9fd	Implement accelerated uploads/copies through buffer manager Previously, both I2M uploads and DMA copies would force GPU serialisation if they happened to hit a trap or were used to copy GPU dirty buffers. By using the buffer manager to implement them on the host GPU we can avoid such slowdowns entiely.	2022-11-02 17:46:07 +00:00
Billy Laws	c5ec484d9a	Avoid redundantly passing executor in ctors when it's already in ChannelCtx	2022-11-02 17:46:07 +00:00
Billy Laws	463394ba72	Pass correct size for XFB buffers	2022-11-02 17:46:07 +00:00
Billy Laws	bd976676f4	Fix SNorm vertex formats	2022-11-02 17:46:07 +00:00
Billy Laws	b74098570f	Zero-out unused XFB varyings before passing to hades	2022-11-02 17:46:07 +00:00
Billy Laws	22f3ba6b93	Mark XFB buffers as GPU dirty	2022-11-02 17:46:07 +00:00
Billy Laws	26aeeaecf5	Add constant buffer GPU write pipeline barrier	2022-11-02 17:46:07 +00:00
Billy Laws	0b5d9308c4	Be more careful about potentially-unneeded GPU->CPU syncs These can be especially expensive so should be avoided as much as possible.	2022-11-02 17:46:07 +00:00
Billy Laws	e6530e2386	Delete graphics_context F	2022-11-02 17:46:07 +00:00
Billy Laws	ac2e6c125b	Switch to Roboto for Korean font	2022-11-02 17:46:07 +00:00
Billy Laws	b24a8465da	Don't require depthClamp	2022-11-02 17:46:07 +00:00
Billy Laws	9055c98e09	Only enable debug/verbose logs in (rel)debug builds	2022-11-02 17:46:07 +00:00
Billy Laws	0ebdbcf0ff	Don't lock stateMutex when updating buffer cycle	2022-11-02 17:46:07 +00:00
Billy Laws	dd360b8f75	Pass correct wait semaphore array size to queue submit	2022-11-02 17:46:07 +00:00
Billy Laws	c78a4b9699	Fixup buffer recreation to avoid deadlock when waiting on srcs	2022-11-02 17:46:07 +00:00
Billy Laws	d236bfe454	Enable depthClamp VK device feature	2022-11-02 17:46:07 +00:00
Billy Laws	95d849e1f6	Check FenceCycle signalled flag immediately before waiting The lock release within the wait for submission means that another thread could end up signalling the cycle and then the VK wait still happen after when the lock has been reacquired.	2022-11-02 17:46:07 +00:00
Billy Laws	1a23b929a7	Avoid chaining cycles in buffer recreation This had a chance of creating circular chains which obviously caused issues, just do a wait instead for now.	2022-11-02 17:46:07 +00:00
Billy Laws	a15db9cb06	Update hades submodule	2022-11-02 17:46:07 +00:00
Billy Laws	cfc55e60b0	Add robin map submodule	2022-11-02 17:46:07 +00:00
Billy Laws	6c0f084aae	Introduce hack to ignore frequently read-back textures Readback can be especially slow on mobile due to the varying load pattern it creates which often prevents the CPU/GPU from clocking up. Since some games perform texture readback but don't actually use it for anything significant implement a hack to skip it and significantly improve performance in such cases.	2022-11-02 17:46:07 +00:00
Billy Laws	e45e7546c8	Redesign buffer megabuffering Due to the frequency at which is is called megabuffering performance is critical to the performance of the entire emulator, especially in high-drawcall-count scenarios. After the view redesign, megabuffering on a per-view level was no longer possible nor desirable, and thus megabuffering was modified to just copy for every usage of a view. This worked great at the time since there were other bottlenecks, however gpu-new has since removed almost all of them and megabuffering is now a major sore point. Fix this by megabuffering small chunks and storing them in a page-table like structure within the buffer, these chunks can be referenced by multiple views and will be smartly invalidated whenever the sequence number or execution number changes to avoid any sequencing issues. In addition to this, to help the case where almost the whole buffer is read every single frame across a set of multiple views, an optimisation to skip the chunked tracking and use one large single megabuffer allocation and one single memcpy has been introduced. This reduces the overall amount of time spent in memcpy since large memcpys are quicker.	2022-11-02 17:46:07 +00:00
Billy Laws	7ea9aa52f5	Speed up reported guest GPU time Avoids triggering DRS in games in cases where it wouldn't actually benefit anything due to being CPU bottlenecked.	2022-11-02 17:46:07 +00:00
Billy Laws	31c2fb7d7a	Fixup IDirectory read	2022-11-02 17:46:07 +00:00
Billy Laws	7491178a9e	Pass base array layer to texture views	2022-11-02 17:46:07 +00:00
Billy Laws	ff57d2fbbf	Enforce stronger format and weaker dimension texture compat checks Rather than using just bpb for format compat, additionally check that the exact component bit layout matches since many games end up reusing RTs for unrelated textures. The texture size requirements have also been weaked to only check the resulting layer size as opposed to width/height - this is somewhat hacky but it gets around the problem of blocklinear alignment.	2022-11-02 17:46:07 +00:00
Billy Laws	14af383238	Only allow submitting `swapchainImageCount` images for host present at a time Prevents situations where nothing would otherwise be waiting on the GPU and since presentation no longer blocks too many images would be submitted for presentation.	2022-11-02 17:46:07 +00:00
Billy Laws	bcd96ac77d	Fixup A8R8G8B8 TIC format mapping 8-bit formats are inverted in TICs compared to Vulkan	2022-11-02 17:46:07 +00:00
Billy Laws	90466b8830	Implement depth clamp rasterisation state Used in SMO for shadows.	2022-11-02 17:46:07 +00:00
Billy Laws	1cfc4278f9	Disable preserve buffer/texture attachment opt for now Causes several issues and crashes in Pokemon without an obvious cause.	2022-11-02 17:46:07 +00:00
Billy Laws	e483cf9634	Use shader memory mirror when reading guest shaders Avoids triggering any traps that may be present on the region	2022-11-02 17:46:07 +00:00
Billy Laws	f6e4328b5a	Ensure blit src/dst textures are attached as execution cycle dependencies Since they're not in the TIC pool they would otherwise be freed	2022-11-02 17:46:07 +00:00
Billy Laws	77a131df60	Support using in-app renderdoc API to capture individual executions	2022-11-02 17:46:07 +00:00
Billy Laws	576bc6f37e	Add CommandExecutor slot count setting	2022-11-02 17:46:07 +00:00
Billy Laws	1a0819fb76	Use semaphores for presentation engine frame synchronisation Avoids waits on the CPU which can be costly and confuse the scheduler, also reduces latency significantly.	2022-11-02 17:46:07 +00:00
Billy Laws	0670e0e0dc	Support using Vulkan semaphores with fence cycles In some cases like presentation, it may be possible to avoid waiting on the CPU by using a semaphore to indicate GPU completion. Due to the binary nature of Vulkan semaphores this requires a fair bit of code as we need to ensure semaphores are always unsignalled before they are waited on and signalled again. This is achieved with a special kind of chained cycle that can be added even after guest GPFIFO processing for a given cycle, the main cycle's semaphore can be waited and then the cycle for the wait attached to the main cycle and it will be waited on before signalling.	2022-11-02 17:46:07 +00:00
Billy Laws	5b72be88c3	Stub ldn:u service	2022-11-02 17:46:07 +00:00
Billy Laws	77d76ed05a	Batch contiguous GMMU ranges into one	2022-11-02 17:46:07 +00:00
Billy Laws	e52dbf202f	Pass more Maxwell3D registers into interconnect	2022-11-02 17:46:07 +00:00
Billy Laws	83c7ed314e	Setup KThread pthread handle in StartThread Avoids a race with starting the thread and the handle not being set yet	2022-11-02 17:46:07 +00:00
Billy Laws	9784ae23e9	Skip checking affinity before taking load-balance WaitScheduler path The affinity mask may be set after the wait has began	2022-11-02 17:46:07 +00:00
Billy Laws	ad3195e06f	Split out guest texture layer size calcs into a seperate func	2022-11-02 17:46:07 +00:00
Billy Laws	8fa83fdf13	Fix deswizzling non-pow2 block size formats We need to use DivideCeil to avoid rounding off part of the texture. Fixes texture in Nier Automata: Game of the YoRHa edition.	2022-11-02 17:46:07 +00:00
Billy Laws	27de42f8df	Use surfaceClip as a hint for the underlying rendertarget size TIC sizes may not be aligned to block linear dimensions whereas RT sizes are and then limited by the surface clip. By using this to determine surface size we are more likely to get a match in texture manager for any future usages.	2022-11-02 17:46:07 +00:00
Billy Laws	297597f697	Fix texture manager depth compat comparison	2022-11-02 17:46:07 +00:00
Billy Laws	500f817a28	Synchronize all non-matching textures back to host before recreation	2022-11-02 17:46:07 +00:00
Billy Laws	05581f2230	Remove now redundant buffer/texture/megabuffer manager locks They have been superseeded by the global channel lock	2022-11-02 17:46:07 +00:00
Billy Laws	f5a141a621	Add dirty resource operator*	2022-11-02 17:46:07 +00:00
Billy Laws	b72720e8db	Finish off transform feedback implementation	2022-11-02 17:46:07 +00:00
Billy Laws	36fd885b49	Pack all draw state into a struct to avoid std::function allocations	2022-11-02 17:46:07 +00:00
Billy Laws	b5d0060c3f	Only use scissor for clear rect when enabled	2022-11-02 17:46:07 +00:00
Billy Laws	f93df35e6c	Only set line width when wideLines feature is supported	2022-11-02 17:46:07 +00:00
Billy Laws	4cebdfc8d3	Pass texture and cbuf state into pipeline manager for hades callbacks	2022-11-02 17:46:07 +00:00
Billy Laws	9ce848d4e0	Implement descriptor update batching and push descriptors Batching helps to avoid the need to attach so many objects to the fence cycle, which ends up taking a fair bit of time due to the allocation required.	2022-11-02 17:46:07 +00:00
Billy Laws	62a165b51e	Reformat maxwell3d interconnect codebase	2022-11-02 17:46:07 +00:00
Billy Laws	3766be59e7	Zero out vertex attribute state when disabled to avoid creating redundant pipelines	2022-11-02 17:46:07 +00:00
Billy Laws	751e3356e1	Keep shader trap lock held for the duration of an execution Avoids constant relocking on the GPFIFO thread (~0.5% of total time)	2022-11-02 17:46:07 +00:00
Billy Laws	314a9bccbc	Allow megabuffering readonly SSBOs	2022-11-02 17:46:07 +00:00
Billy Laws	4c2db0ba01	Implement ReadCbufValue and ReadTextureType hades callbacks Used for bindless and BRX instruction emulation.	2022-11-02 17:46:07 +00:00
Billy Laws	2163f8cde6	Implement alpha test pipeline state	2022-11-02 17:46:07 +00:00
Billy Laws	c86ad638c4	Keep track of transform feedback varyings pipeline state	2022-11-02 17:46:07 +00:00
Billy Laws	7ad2d94345	Zero out blend state when disabled to avoid creating redundant pipelines	2022-11-02 17:46:07 +00:00
Billy Laws	4052a93051	Force non-pushdescriptors for blit helper shader	2022-11-02 17:46:07 +00:00
Billy Laws	cb2a8c6d24	Enable wideLines Vulkan feature	2022-11-02 17:46:07 +00:00
Billy Laws	a3369637a9	Don't entirely wipe out per-index TIC cache efter each execution Keep a copy of the old TIC entry and view even after purge caches and use the execution number to check validity instead, if that doesn't match then just memcmp can be used as opposed to a full hash and map lookup.	2022-11-02 17:46:07 +00:00
Billy Laws	98c0cc3e7f	Impl preserve attached buffers/textures to avoid GPFIFO lock thrashing When profiling SMO, it became obvious that the constant locking of textures and buffers in SyncDescriptors took up a large amount of CPU time (3-5%), a precious resource in intensive areas like Metro. This commit implements somewhat of a workaround to avoid constant relocking, if a buffer is frequently attached on the GPU and almost never used on the CPU we can keep the lock held between executions. Of course it's not that simple though, if the guest tries to lock a texture for the first time which has already been locked as preserve on the GPFIFO we need to avoid a deadlock. This is acheived through a combination of two things: first we periodically clear the locked attachments every 2*SlotCount submissions, preventing a complete deadlock on the CPU (just a long wait instead) and meaning that the next time the resource is attached on the GPU it will not be marked for preservation due to having been locked on the guest before; second, we always need to unlock everything when the GPU thread runs out of work, as the perioding clearing will not execute in this case which would otherwise leave the textures locked on the GPFIFO thread forever (if guest was waiting on a lock to submit work). It should be noted that we don't clear preserve attached resources in the latter scenario, only unlock them and then relock when more work is available.	2022-11-02 17:46:07 +00:00
Billy Laws	0e8ccf1e99	Use memory_order_release for new descriptor set allocations	2022-11-02 17:46:07 +00:00
Billy Laws	34db5097da	Avoid using a shared_ptr reference to cycle for command buffer submission Somehow without this we can sometimes get crashes during appending to circular queue	2022-11-02 17:46:07 +00:00
Billy Laws	0428e8c7da	Support forcing regular descriptor sets in VK pipeline cache	2022-11-02 17:46:07 +00:00
Billy Laws	0f394d516b	Lock FenceCycle inbetween waiting on chained cycles and checking signalled Avoids one race where we would end up hogging all the locks of chained cycles and ourself when waiting for submission of previous cycles and prevent any forward progress due to another thread locking one of the chained cycles.	2022-11-02 17:46:07 +00:00
Billy Laws	a015fe753d	Only write npad controllerInfo entry on the HID thread if it is valid	2022-11-02 17:46:07 +00:00
Billy Laws	b5446846f7	Stub IsSixAxisSensorAtRest	2022-11-02 17:46:07 +00:00
Billy Laws	6719572b3b	Keep track of how often textures/buffers are locked on the CPU For the upcoming preserve attachment optimisation, which will keep buffers/textures locked on the GPU between executions, we don't want to preserve any which are frequently locked on the CPU as that would result in lots of needless waiting for a resource to be unlocked by the GPU when it occasionally frees all preserve attachments when it could have been done much sooner. By checking if a resource has ever been locked on the CPU and using that to choose whether we preserve it we can avoid such waiting.	2022-11-02 17:46:07 +00:00
Billy Laws	993ffb56f4	Avoid waiting on texture/buffer fence with trapMutex locked This could cause deadlocks under certain circumstances by preventing the GPFIFO from making any progress while also waiting on it at the same time.	2022-11-02 17:46:07 +00:00
Billy Laws	3e8bd26978	Add a global gm20b channel lock Allowing for parallel execution of channels never really benefitted many games and prevented optimisations such as keeping frequently used resources always locked to avoid the constant overhead of locking on the hot path.	2022-11-02 17:46:07 +00:00
Billy Laws	57a4699bd1	Add IOCTL trace events	2022-11-02 17:46:07 +00:00
Billy Laws	7861968c05	Fix memory::Buffer move constructor	2022-11-02 17:46:07 +00:00
Billy Laws	ef0ae30667	Implement Maxwell3D texture pool management and view creation Ontop of the TIC cache from previous code a simple index based lookup has been added which vastly speeds things up by avoding the need to hash the TIC structure every time.	2022-11-02 17:46:07 +00:00
Billy Laws	5542459c75	Use a SpinLock for guest shader code cache trap mutex	2022-11-02 17:46:07 +00:00
Billy Laws	3e12cde4d5	Make active Vulkan pipeline public	2022-11-02 17:46:07 +00:00
Billy Laws	2556966ec5	Don't attach textures to the active cycle in AttachTexture	2022-11-02 17:46:07 +00:00
Billy Laws	7dc3dde815	Introduce support for waiting for submission to FenceCycle Introducing async record resulted in breaking the assumption that any work submitted through command scheduler would be submitted in order with graphics submits. Since async record now unlocks the texture before it's submitted a seperate mechanism is needed to ensure ordering of submits. This is achieved by building support into fence cycle itself, with a conditional variable that is waited on for submission before any fence waits occur.	2022-11-02 17:46:07 +00:00
Billy Laws	54b85583ae	Fix layout transition in Texture::CopyFrom	2022-11-02 17:46:07 +00:00
Billy Laws	0f7c04ffb4	Use target format bpb when calculating linear mip level size	2022-11-02 17:46:07 +00:00
Billy Laws	849184452c	Add function to check if any guest texture mappings are unmapped	2022-11-02 17:46:07 +00:00
Billy Laws	98cb94ca6c	Bind an empty uniform buffer in place of unbound constant buffers	2022-11-02 17:46:07 +00:00
Billy Laws	55b85d0691	Implement combined image samplers and make descriptor code common between quick/normal updates	2022-11-02 17:46:07 +00:00
Billy Laws	ccf2d59351	Fixup input rate reading from packed pipeline state	2022-11-02 17:46:07 +00:00
Billy Laws	13970a5644	Refresh pipeline cached storage buffer bindings after each execution	2022-11-02 17:46:07 +00:00
Billy Laws	6bb2853ca0	Keep track of combined image samplers for quick bind	2022-11-02 17:46:07 +00:00
Billy Laws	040db37a28	Fix descriptor copies to be one per descriptor type	2022-11-02 17:46:07 +00:00
Billy Laws	d482e0ea98	Fix shader stage iteration to not miss the pixel stage	2022-11-02 17:46:07 +00:00
Billy Laws	0b808cc22b	Use Sint/Uint attribute type in place of Sscaled/Uscaled Scaled formats are not supported by any mobile GPUs.	2022-11-02 17:46:07 +00:00
Billy Laws	92ce220d3a	Ignore constant buffer selector size for updates Clustertruck performs a giant (0x3000 byte) update with a selector size of only 0x500, without this the update would only partially go through	2022-11-02 17:46:07 +00:00
Billy Laws	33f16ca26e	Handle unmapped blocks in CachedMappedBufferView	2022-11-02 17:46:07 +00:00
Billy Laws	5c0e4a839d	Fix SW BC2 decoding pitch	2022-11-02 17:46:07 +00:00
Billy Laws	60863fa162	Make viewports fallback to viewport 0 when their dimensions are invalid OpenGL games use a zero width/height for viewports 1-15 when they're unused	2022-11-02 17:46:07 +00:00
Billy Laws	586f872655	Update indexed quad conversion for new API	2022-11-02 17:46:07 +00:00
Billy Laws	498b4966d3	Avoid crashing on unmapped buffers Just log a warning instead	2022-11-02 17:46:07 +00:00

... 6 7 8 9 10 ...

2108 Commits