skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2024-11-06 06:55:09 +01:00

Author	SHA1	Message	Date
Billy Laws	dbb684835f	Fix depthClampDisable register offset in Maxwell 3D	2022-07-29 20:07:14 +01:00
Billy Laws	7fd9d347e3	Use per-RT blend enable registers even when independent blend is disabled The common blend enable register seems to be used for something else. This is required for blending to work correctly in OpenGL games	2022-07-29 20:07:14 +01:00
Billy Laws	048c2fdd29	Fix Vulkan framebuffer dimensions calculations The framebuffer needs to be large enough to contain both the render area extent and offset	2022-07-29 20:07:14 +01:00
Billy Laws	0e1aa765fc	Prevent CNTVCT_EL0 reads from being optimised out by the compiler Without this the compiler will assume the read always produces the same value, causing issues when the register is used to time function execution	2022-07-29 20:07:14 +01:00
lynxnb	d183d14e2a	Make accesses to setting values thread-safe	2022-07-26 20:16:24 +05:30
lynxnb	30667a0899	Remove unused `Compact Logs` settings Since we don't have a log viewer in the app anymore, the setting was left unused and can be safely removed.	2022-07-26 20:16:24 +05:30
lynxnb	5aa2a4cd1c	Rename `SettingsValues` to `NativeSettings` The previous name was chosen as an afterthought and didn't clearly indicate what the purpose of the class is. We needed a separate, simple class without delegates members (like PreferenceSettings), so that its fields can be easily accessed via JNI to get settings values from native code.	2022-07-26 20:16:24 +05:30
lynxnb	f734c4d145	Make log level setting changes immediately active	2022-07-26 20:16:24 +05:30
lynxnb	bb4937121f	Remove settings from SharedPreference if they are of the wrong type	2022-07-26 20:16:24 +05:30
lynxnb	2840a126dd	Introduce `AndroidSettings` class and use inheritance The `Settings` class now has a pure virtual `Update` method, and uses inheritance over template specialization for platform-specific behavior override.	2022-07-26 20:16:24 +05:30
lynxnb	3905728447	Make every setting observable individually A `Setting` delegate class has been introduced, holding the raw value of the setting and adding support for registering callbacks to that setting. Callbacks will then be called when the value of that setting changes. As a result of this, raw setting values have been made accessible through pointer dereference semantics.	2022-07-26 20:16:24 +05:30
lynxnb	5b4ca79dc8	Rename `Settings` Kotlin class to `PreferenceSettings` SharedPreferences will be partially swapped out in the future to support per-game settings. In the meantime, make it clear from which class settings are coming from.	2022-07-26 20:16:24 +05:30
lynxnb	3b27540250	Rename `operationMode` setting to `isDocked`	2022-07-26 20:16:24 +05:30
lynxnb	69cf25b1a7	Initial support for updating settings during emulation + observing settings changes	2022-07-26 20:16:24 +05:30
lynxnb	c5dde5953a	Rework how settings are shared between Kotlin and native side Settings are now shared to the native side by passing an instance of the Kotlin's `Settings` class. This way the C++ `Settings` class doesn't need to parse the SharedPreferences xml anymore.	2022-07-26 20:16:24 +05:30
lynxnb	4be8b4cf66	Add missing SPDX licence header	2022-07-26 20:16:24 +05:30
lynxnb	365ca66b1b	Make integer settings use IntegerListPreference Avoids unnecessary type casting of setting values and duplication in resource files.	2022-07-26 20:16:24 +05:30
lynxnb	cbc896c8f8	Fix `waitForFences` crash on Mali drivers Mali GPU drivers utilize the `ppoll()` syscall inside `waitForFences` which isn't correctly restarted after a signal, which we can receive at any time on a guest thread. This commit fixes that by recursively calling the function on failure till it succeeds or returns an unexpected error. Co-authored-by: PixelyIon <pixelyion@protonmail.com> Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-07-14 20:34:16 +02:00
MCredstoner2004	942e22f275	Write `ApplicationErrorArg` `ErrorApplet`s to log These applets are used by applications to display a custom error message to the user. Both the error message and the detailed error message are printed to the error log. Co-authored-by: lynxnb <niccolo.betto@gmail.com>	2022-07-02 09:48:59 +05:30
MCredstoner2004	f9a0394577	Implement Software Keyboard applet This implements the non-inline version of the Software Keyboard (swkbd) applet, which games use to get text input from the user.	2022-07-01 15:19:53 -05:00
MCredstoner2004	a9ee06914d	Add ByteBufferSerializable This allows sending C-like structs between Kotlin and C++ without struct-specific code	2022-06-30 01:17:32 +05:30
Billy Laws	a0275418d6	Add a single-header linear allocator implementation This conforms to the C++ 'Allocator' named requirement allowing it to be used with any STL type and allows drastically reducing allocation times in cases which are suited for linear allocation.	2022-06-28 21:33:04 +01:00
Billy Laws	e816256220	Add blend, scissor, viewport and vertex state to shader hash These caused a ton of additional comparisons in Zelda Link's Awakening as many shaders would have the same hash.	2022-06-28 21:32:59 +01:00
lynxnb	e6cfdeb06a	Fix non-indexed quad draws Certain non-indexed quad draws would mistakenly take the indexed quad path because of the assumption that they would not have a bound index buffer. This resulted in a crash for most games using quads due to a faulty exception `Indexed quad conversion is not supported`, when in fact they were not using indexed quads. Co-authored-by: PixelyIon <pixelyion@protonmail.com> Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-06-23 10:57:11 +02:00
lynxnb	8fc3bc75f4	Allow providing an index type to calculate quad conversion buffer size	2022-06-23 00:15:44 +02:00
Billy Laws	7709dc8cf6	Rewrite buffer megabuffering to be per view and more efficient This commit implements several key optimisations in megabuffering that are all inherently interlinked. - Megabuffering is moved from per-buffer to per-view copies, this makes megabuffering possible for small views into larger underlying buffers which is often the case with even the simplest of games, - Megabuffering is no longer the default option, it is only enabled for buffer views that have had inline GPU writes applied to them in the past as that is the only case where they are beneficial. In any other case the cost of copying, even with a 128KiB limit can be significant. - With both of these changes, there is now possibility for overlapping views where one uses megabuffering and one does not. In order to allow GPU inline writes to work consistently in such cases a system of 'host immutability' has been implemented, when a buffer is marked as host immutable for a given cycle, all writes to the buffer from that point to the point the cycle is signalled will be performed on the GPU, ensuring that the backing contents are correctly sequenced	2022-06-11 17:05:39 +05:30
MCredstoner2004	2e356b8f0b	Use spans instead of ptr and size in kernel memory	2022-06-11 17:05:39 +05:30
Billy Laws	22039df301	Transition to std::unordered_set for buffer view tracking Has the same guarantees of pointer stabilty while also being significantly faster in cases where a buffer has thousands of views. This is the case in RE4 and this change leads to an almost 1000% performance improvement in that game.	2022-06-09 23:52:13 +01:00
Billy Laws	b75a06af1b	Support forcing 60Hz display on Xiaomi MIUI Uses an API found through RE since none of the AOSP APIs work, additionaly the code for setting RR was consolidated to a single function that can be ran after all display updates.	2022-06-09 19:29:18 +01:00
PixelyIon	a5ca370c36	Implement thread-safe MegaBuffer pool We currently have a global `MegaBuffer` instance that is shared across all channels, this is very problematic as `MegaBuffer` fundamentally works like a state machine with allocations (especially resetting/freeing) and is thread-specific. Therefore, we now have a pool of several `MegaBuffer`s which is allocated from by the `CommandExecutor` and kept channel specific as a result which also limits its usage to a single thread, this allows for individually resetting or freeing any allocations.	2022-06-05 13:04:40 +05:30
PixelyIon	3e08494146	Minor `CommandScheduler` refactor There was a lot of redundant code in the `CommandScheduler` when the same functionality could be achieved with much shorter and cleaner code which this commit fixes. This includes no changes to the user-facing API and does not require any changes on the user side as a result.	2022-06-05 13:04:40 +05:30
Billy Laws	bd99d79b51	OsFileSystem: Close directory after file listing is finished	2022-06-04 21:46:23 +01:00
Billy Laws	4888919515	Stub GetFriendInvitationStorageChannelEvent (0x8C)	2022-06-04 21:45:53 +01:00
Billy Laws	d9f6540831	Fix VFS CreateFile directory creation	2022-06-04 19:19:30 +01:00
Billy Laws	f5bcb40c41	Return number of audio outs in ListAudioOuts	2022-06-04 19:12:37 +01:00
Billy Laws	5d6902b3f8	Stub audin:u	2022-06-04 19:11:57 +01:00
Billy Laws	54999957a2	Remove RGB565 format workaround Will soon be redundant with new texture manager and is quite hacky so drop it.	2022-06-04 17:49:13 +01:00
Billy Laws	d79832091d	Force append slash to directory path in OsFilesystem::CreateDirectory The recursive path creation algorithm requires this to be the case	2022-06-04 17:44:49 +01:00
Billy Laws	616f7b7826	Correct instanced draw topology changed warning location Before it would trigger even when the draw had the instanceNext flag set and thus wasn't part of the instanced draw at all.	2022-06-04 17:43:03 +01:00
Billy Laws	deb7a0e22a	Implement 5x5 and 10x10 ASTC texture formats	2022-06-04 17:42:37 +01:00
Billy Laws	cc5a3f99c1	Reformat format description file	2022-06-04 17:42:13 +01:00
Billy Laws	a476bbaf4d	Add 11_11_10 vertex buffer format	2022-06-04 17:41:10 +01:00
Billy Laws	71c37dd6c4	Add D24X8Unorm depth RT format support	2022-06-04 17:40:49 +01:00
Billy Laws	d3af629b83	Support R32G32B32A32 int RT formats	2022-06-04 17:38:57 +01:00
Billy Laws	0f5f04ade3	Set default surfaceflinger parameters based off of preallocated buffers Required by resident evil 4 as otherwise Dequeue would fail due to it using BGRA buffers but the default being RGBA.	2022-06-04 16:55:08 +01:00
Billy Laws	106ad597db	Support BGRA8888 surfaceflinger format A swizzle is applied to R8G8B8A8 to transform it to BGRA since BGRA isn't a commonly supported swapchain format on Android.	2022-06-04 16:49:26 +01:00
Billy Laws	2bbeb6b08f	Fix OsFileSystem initial directory creation By passing basePath as an argument the CreateDirectory function did mkdir(basePath+basePath) which is obviously not the intended behaviour, fix this.	2022-06-03 19:33:31 +01:00
Billy Laws	84dec7561c	Dont cache rendertarget mappings Some games remap rendertargets or map them late which would lead to weird graphical bugs or crashes. Drop the caching since VMM lookup is fairly cheap anyway.	2022-06-03 19:31:52 +01:00
Billy Laws	581a016991	Add GuestTexture::GetSize helper function This code was getting duplicated a bit so commonise into a helper function.	2022-06-03 19:30:54 +01:00
Billy Laws	31d418ad54	Fix 3D semaphore counter type 0 handling Counter type 0 actually releases the semaphore payload rather than a constant zero as was previously thought. This is required by Skyrim.	2022-06-02 22:03:19 +01:00
Billy Laws	0202bf5531	Add semaphore release debug logs	2022-06-02 22:02:59 +01:00
Billy Laws	3736d36b75	Fix KPrivateMemory remap permissions	2022-06-02 18:10:35 +01:00
Billy Laws	389ab0fb50	Add {Map,Unmap}Physical memory debug logs	2022-06-02 18:10:10 +01:00
PixelyIon	2712b3276b	Fix incorrect `VkBufferImageCopy` offset calculations The `VkBufferImageCopy` offset calculations were wrong inside `CopyIntoStagingBuffer` as it multiplied the mip level's linear size by `levelCount` rather than `layerCount`. This led to substantial UB in games which called this function as it led to an overflow and resulted in writing to other areas of the buffer which caused major issues such as vertex/index buffer corruption and corresponding graphical glitches alongside likely being the cause of some crashes.	2022-06-02 22:14:22 +05:30
PixelyIon	06901ef22a	Fix BC7 output swizzling from BGRA to RGBA BC7 CPU decoding had the red and blue channels swapped around as it outputted a BGRA image after decoding while we expected an RGBA image to be produced. This should fix the colors of certain textures in titles such as Cuphead or Sonic Forces.	2022-06-02 19:48:55 +05:30
Billy Laws	9cb68c31e1	Stub nfp IUser::AttachAvailabilityChangeEvent	2022-06-02 00:04:01 +01:00
Billy Laws	33c9731eca	Implement IFileSystem::CreateDirectory	2022-06-02 00:04:01 +01:00
Billy Laws	a09414424b	Fix broken VFS directory creation	2022-06-02 00:04:01 +01:00
Billy Laws	3518e04a18	Correct Directory EntryType to be u8 rather than u32	2022-06-02 00:04:01 +01:00
Billy Laws	0c11d9e294	Implement IDirectory::GetEntryCount	2022-06-02 00:04:01 +01:00
MCredstoner2004	c15b3a8d40	Make Applet accesses to the data queues lock Avoids potential races when the guest access the same applet from more than one thread.	2022-06-02 03:47:38 +05:30
Billy Laws	91b2c47991	Fix potential nvdrv submission race The syncpoint maximum value represents the maximum possible syncpt value at a given time, however due to PBs being submitted before max was incremented, for a brief moment of time this is not the case which could lead to crashes or other such behaviour if a game waits on the fence at the right moment.	2022-06-01 17:15:25 +01:00
PixelyIon	37453ed7fa	Use `DocumentsProvider` for log sharing We used a `FileProvider` for log sharing prior, this is no longer necessary since it comes under the `DocumentsProvider` now which can be utilized to share the log document directly.	2022-06-01 21:41:14 +05:30
PixelyIon	8efa9298f9	Fix name conflict resolution for `copyDocument` Any documents with the same name existing in a directory that is copied to would cause an exception due to existing already, this fixes that by handling conflict resolution in those cases and automatically determining a file name that would avoid a conflict.	2022-06-01 21:41:14 +05:30
Billy Laws	c4bd9c47e4	Stub NVGPU_GPU_IOCTL_ZBC_SET_TABLE nvdrv ioctl This was missed in the original implementation and caused crashes in some games.	2022-06-01 16:59:14 +01:00
Billy Laws	c639fdcf06	Fixup NFP service stub state handling Previously a broken state value was returned from GetState that caused crashes in games using newer SDKs and NFP, correctly handle state now by updating it after initialisation.	2022-06-01 15:00:26 +01:00
Billy Laws	c745e0e02b	Move image type logic to GuestTexture, allowing 2D array views for 3D RTs We can't render to a 3D texture through a 3D view, we instead have to create a 2D array view into it and render to that. The texture manager previously didn't support having a different view type/layer count between a guest texture view and the underlying storage texture that is required to support this so that was also implemented by reading the view layer count from the dimensions depth instead if the underlying texture is 3D (and the view type is 2D array). Additionally move away from our own view type enum to Vulkan, inline with other guest texture member types.	2022-05-31 22:09:53 +01:00
Billy Laws	22695c4feb	Stub nim services used for eShop communication We obviously don't need to implement these so add a simple set of stubs to satify games using them (mainly demos such as DQXII)	2022-05-31 22:07:01 +01:00
Billy Laws	ff12dc9c10	Add R32_SFLOAT to adreno validation layer format filtering	2022-05-31 22:03:53 +01:00
Billy Laws	6cc925c2d3	Reset RT mappings on dimension and format changes	2022-05-31 17:49:16 +01:00
Billy Laws	8180bf852e	Lock textures before attaching in BlitContext	2022-05-31 16:54:13 +01:00
Billy Laws	cb2b36e3ab	Allow providing a callback that's called after every CPU access in GMMU Required for the planned implementation of GPU side memory trapping.	2022-05-31 16:04:27 +01:00
Billy Laws	46ee18c3e3	Require depthBiasClamp Vulkan device feature Used in some UE4 games and supported by 95% of devices so skip implementing a fallback path.	2022-05-31 14:46:45 +01:00
PixelyIon	e592b11039	Drop `samplerAnisotropy` as a required GPU feature Sampler anisotropy was made a required feature in an earlier commit due to its widespread availability but this was determined to be incorrect as certain Mali GPUs that can otherwise run 2D games in Skyline do not have this feature, while they are still not officially supported as this was the only roadblock to support them, it has now been made an optional feature.	2022-05-31 01:37:40 +05:30
PixelyIon	4336134b07	Reintroduce `android:hasFragileUserData` due to stable signature `android:hasFragileUserData` was added in an earlier commit but then removed due to it not functioning because of signature checks. Now that signatures are consistent across builds, it has been readded and should now allow carrying data across CI and developer builds.	2022-05-31 01:37:07 +05:30
PixelyIon	e1cc8676cf	Add option to view internal directory With the Skyline document provider, easy access to the internal directory is required which may be hard to navigate to through the system file manager. This adds an option in settings to directly open up the directory in the system file manager.	2022-05-31 01:25:18 +05:30
PixelyIon	ba97985b55	Revise `DocumentsProvider` URI structure The URIs (Document ID + Root) of the Skyline `DocumentsProvider` was unoptimal as it wasn't relative to a base directory. This is required for opening a root without knowledge of the full path in advance, it is therefore cleaner to provide a uniform `ROOT_ID` in a companion class.	2022-05-31 01:25:18 +05:30
Mylah Dee	ee7da31fc6	Add `DocumentProvider` for accessing internal files On Android 12 and above, files from an application's external storage directory cannot be accessed by the user. The only proper SAF-compliant way to solve this is to create a `DocumentProvider` which proxies access to internal storage accordingly.	2022-05-30 15:09:30 +05:30
Narr the Reg	7aa6a5c4ca	Add HID touch attribute and index reporting Adds missing parameter TouchAttribute and emulates correctly the touch point index. Both changes are necessary on Voez to keep track of each finger.	2022-05-29 10:28:51 +01:00
PixelyIon	80c8fb8791	Implement CPU BCn Texture Decoding Certain GPU vendors such as ARM's Mali do not have support for BCn textures whatsoever while other vendors such as AMD only have partial support (BC1-BC3). Most titles on the guest utilize BC textures and to address this on host GPUs without support for BCn, we need to decompress the texture on the CPU. This commit implements a CPU BCn texture decoder based off Swiftshader's BC decoder, it also adds the necessary infrastructure to have different formats for the `GuestTexture` and `Texture` objects.	2022-05-28 21:22:24 +05:30
PixelyIon	fe615b1e03	Clarify texture swizzling inner-loop iteration count The iterations of the inner loop for sector deswizzling was miscalculated as `SectorWidth * SectorHeight` while the result was correct at `32`, it should be determined by the amount of sector lines within a GOB i.e.: `(GobWidth / SectorWidth) * GobHeight`.	2022-05-28 21:22:24 +05:30
PixelyIon	7d4e0a7844	Implement Mipmapped Texture Support Support for mipmapped textures was not implemented which is fairly crucial to proper rendering of games as the only level that would load is the first level (highest resolution), that might result in a lot more memory bandwidth being utilized. Mipmapping also has associated benefits regarding aliasing as it has a minor anti-aliasing effect on distant textures. This commit entirely implements mipmapping support but it does not extend to full support for views into specific mipmap levels due to the texture manager implemention being incomplete.	2022-05-28 21:22:24 +05:30
PixelyIon	da7e6a7df7	Replace Maxwell DMA `GuestTexture` usage with new swizzling API Maxwell DMA requires swizzled copies to/from textures and earlier it had to construct an arbitrary `GuestTexture` to do so but with the introduction of the cleaner API, this has become redundant which this commit cleans up and replaces with direct calls to the API with all the necessary values.	2022-05-28 21:22:24 +05:30
PixelyIon	de300bfdbe	Refactor Texture Swizzling The API for texture swizzling is now more concrete and abstracted out from `GuestTexture`, this allows for neater usage in certain areas such as MaxwellDMA while having a `GuestTexture` wrapper as well allowing for neater usage in those cases. The code itself has also been cleaned up slightly with all usage of `u32`s being upgraded to `size_t` as this is simply more efficient due to the compiler not needing to emulate wraparound behavior for integer types smaller than the processor word size.	2022-05-19 17:13:55 +05:30
Billy Laws	72473369b6	Account for OOB copyOffsets in CircularBuffer::Read Caused crashes in Pokemon	2022-05-14 15:30:59 +01:00
Robin Kertels	0a3cf25823	Implement the Fermi 2D blitting engine The Fermi 2D engine implements both image blit and resolve operations, supporting subpixel sampling with both linear and point filtering. Resolve operations are performed by sampling from the center of each pixel in order to resolve the final image from the MSAA samples MSAA images are stored in memory like regular images but each pixels dimensions are scaled: e.g for 2x2 MSAA ``` 112233 112233 445566 445566 ``` These would be sampled with both duDx and duDy as 2 (integer part), resolving to the following: ``` 123 456 ``` Blit operations are performed by sampling from the corner of each pixel, scaling the image as one would expect. This implementation isn't fully complete as Vulkan blit doesn't support some combinations which Fermi does, most notably between colour and depth stencil. These will be implemented properly at a later date, likely after the texture manager rework. Out of Bounds Blit, used by some OpenGL games is also missing since supporting it requires texture aliasing, this will also be supported after the texture manager rework. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-05-13 22:37:37 +01:00
Billy Laws	be2546138d	Move IOVA class to GMMU so it can be used for other engines	2022-05-13 22:37:37 +01:00
Billy Laws	3ad640fcbc	Fix accidental graphics context member/parameter duplication	2022-05-13 22:37:37 +01:00
PixelyIon	7a6f27a19a	Fix texture swizzling OOB writes Certain writes during swizzling went out of bounds due to incorrect `blockExtentY` calculation, the previous commit to fix this ended up breaking it further. This commit returns to the original commit's calculations with the proper addendum of a check for exact alignment with a GOB which is the case that was broken earlier.	2022-05-13 14:52:41 +05:30
PixelyIon	168e51e7ad	Always use `GetLayerStride` for layer stride in Texture The `GuestTexture::GetLayerStride` function was not always being utilized to retrieve the layer stride inside `Texture`, it would instead directly access the `guestTexture::layerStride` member. This is problematic as it may not be initialized and return `0` which would lead to a broken image copy.	2022-05-13 14:21:37 +05:30
Billy Laws	b81d5bc865	Implement and cleanup semaphore operations in all engines Most engines have the capability to release a semaphore payload (or reduce in the case of GPFIFO) when a method is called or action is complete. Semaphores are used by games for both timing how long things take on GPU and waiting on resources so missing them can cause deadlocks or other related issues.	2022-05-12 19:40:24 +01:00
Billy Laws	bca88685bd	Stub nvdrv {Get,Dump}Status	2022-05-12 17:38:22 +01:00
Billy Laws	97e740c986	Fix slight locking bug with nvmap handle duplication	2022-05-12 17:38:22 +01:00
Billy Laws	57378457dc	Treat symbol file paths without slashes as filenames Prevents crashes printing backtrace if this occurs	2022-05-12 17:38:22 +01:00
Billy Laws	d08ac63bbf	Use TIC maximum index over TSC when tscIndexLinked is set	2022-05-12 17:38:22 +01:00
Billy Laws	8e021a9f1f	Load custom drivers from app private data dir Required since /sdcard doesn't have exec perm support	2022-05-12 17:38:21 +01:00
Billy Laws	dcef597345	Introduce TrivialObject concept and use where appropriate Simplifies type checking and handles excluding container types that are trivially copyable but contain pointers	2022-05-12 17:38:21 +01:00
PixelyIon	f2cc25ee9f	Implement Array Texture Swizzling Textures can have more than one layer which we currently don't handle, all layers past the initial one will be filled with random data or 0s, leading to incorrect rendering. This has now been implemented now which fixes any titles which utilize array textures, such as "Super Mario Odyssey" or "Hatsune Miku: Project DIVA MegaMix".	2022-05-12 18:23:45 +05:30
PixelyIon	2a99e1784d	Fix Maxwell3D RT Depth/Layer Count Logic The Maxwell3D RT layer count wasn't being set correctly as it has the same register as the depth values and is toggled between the two based on another register value.	2022-05-12 18:23:05 +05:30
Billy Laws	543ac3042e	Cleanup account services and stub StoreSaveDataThumbnail	2022-05-11 23:24:35 +01:00
Billy Laws	7d30ac0cd8	Add additional nifm stubs	2022-05-11 23:24:35 +01:00
Billy Laws	a164635f32	Stub LibraryAppletPlayerSelect	2022-05-11 23:24:35 +01:00
Billy Laws	dd0004e208	Set Host1x log tag correctly	2022-05-11 22:11:16 +01:00
Billy Laws	f89bacf8ae	Fixup Host1x syncpoint locking	2022-05-11 22:04:02 +01:00
Billy Laws	d8ff318a1a	Prevent infinite VFS read loop on EOF	2022-05-11 22:03:39 +01:00
shutterbug2000	f078a5d1ec	Stub `bt` and `btm:u` Stub BT services which is required by titles such as Pokémon Let's GO Pikachu and Eevee (non-Demo versions).	2022-05-11 20:44:09 +05:30
PixelyIon	588b4529ee	Implement 3D Texture Swizzling The Maxwell GPU supports 3D textures which are tiled with the block-linear layout which didn't handle swizzling 3D textures correctly till now. This commit addresses that by implementing proper swizzling for 3D textures. Titles such as Cluster Truck and Super Mario Odyssey utilize 3D textures alongside a vast majority of other titles.	2022-05-11 14:06:04 +05:30
Billy Laws	601d67e369	Use resource size rather than allocation size for staging buffer size As per VMA docs: 'Allocation size returned in this variable may be greater than the size requested for the resource e.g. as VkBufferCreateInfo::size. Whole size of the allocation is accessible for operations on memory e.g. using a pointer after mapping with vmaMapMemory(), but operations on the resource e.g. using vkCmdCopyBuffer must be limited to the size of the resource.'	2022-05-10 18:48:20 +01:00
Billy Laws	d2acec24f5	Handle VFS reads into trapped memory regions pread will refuse to read into any trapped regions so implement a manual path with a staging buffer and memcpy for such cases	2022-05-10 18:33:55 +01:00
Billy Laws	1609fd2a32	Account for layerCount in SynchronizeGuestWithBuffer staging buffer size	2022-05-10 18:33:31 +01:00
Billy Laws	5b97b87503	Restore previous cullMode when cullFace is enabled	2022-05-10 18:31:32 +01:00
Billy Laws	15e9fa1c80	Fix FillRandomBytes There were two issues here: - If a skyline span was passed as a param then the 'T &object' version would be called, filling the span itself with random values rather than its contents - Random numbers were repeated every call since independent_bits_engine copied generator state and thus it was never actually updated	2022-05-10 18:28:15 +01:00
Billy Laws	622ff2a8f1	Correctly track 5.1 audio channel sample count Size needs to be adjusted for 5.1 buffers since they're downsampled to stereo.	2022-05-10 18:26:20 +01:00
PixelyIon	56c9b03843	Fix incorrect swizzling Y extent calculation This calculation for the amount of lines on the Y axis relative to the start of the last block was wrong and would instead determine the amount of lines to the last Y-axis GOB which wasn't accurate when padding was considered, this resulted in titles like Celeste having broken texture decoding (on a 1922x1082 texture) for the last ROB as most pixels would be masked out.	2022-05-09 20:25:43 +05:30
Billy Laws	018df355f0	Replace some VFS exceptions with warnings These errors aren't necessarily fatal so tone them down.	2022-05-08 19:37:10 +01:00
Billy Laws	e1c13bbc08	Update hades	2022-05-08 19:37:10 +01:00
PixelyIon	b307fca115	Fix attachment reuse within the same subpass Certain titles such as BOTW trigger behavior to reuse an attachment within the same subpass, this caused an exception inside `RenderPassNode::AddAttachment` as it cannot find corresponding subpass for attachment. To fix this issue, we now assume that when it cannot find a subpass for an existing attachment, it is attached to the latest subpass and return the attachment.	2022-05-08 18:26:40 +05:30
PixelyIon	e027555796	Handle Y-axis GOB non-alignment for swizzling Certain textures may be unaligned with a GOB's height of 8 lines, we already handle the case of being unaligned with a GOB's width of 64-bytes. This case occurs on titles such as SMO when going in-game.	2022-05-07 18:37:22 +05:30
PixelyIon	c910e29168	Extend `HostSignalHandler`'s `SIGSEGV` debugger path The function now returns from a segmentation fault when a debugger is present, this allows the entire context to be intact which can allow the debugger to correctly pick up variables from all stack frames while it could not extrapolate most variables when trapped inside the signal handler without the values of all registers.	2022-05-07 18:37:22 +05:30
Billy Laws	4149ab1067	Implement Maxwell 3D instanced draw support In the Maxwell 3D engine, instanced draws are implemented by repeating the exact same draw in sequence with special flag set in vertexBeginGl. This flag allows either incrementing the instance counter or resetting it, since we need to supply an instance count to the host API we defer all draws until state changes occur. If there are no state changes between draws we can skip them and count the occurences to get the number of instances to draw.	2022-05-07 13:56:09 +01:00
Billy Laws	03594a081c	Ensure correct flushing for batched constant buffer updates Cbufs could be read by non-maxwell3D engines so force a flush when switching to them or before Execute.	2022-05-07 13:56:09 +01:00
PixelyIon	ad989750fc	Implement Maxwell3D Point Sprite Size Implements register state that corresponds to the size of a single point sprite in Maxwell 3D, this is emitted by the shader compiler in the preamble but needs to be only applied if the input topology is a point primitive and it is invalid to set the point size in any other case.	2022-05-07 03:46:25 +05:30
PixelyIon	874a6a2a6c	Fix `getTextureType` enum conversion fomatting	2022-05-07 03:46:25 +05:30
PixelyIon	ae5bcbdb5c	Fix Depth RT lock to be in scope Earlier texture locking design required the lock to be retained but since the introduction of `AttachTexture`, this no longer needs to be done. This being done caused deadlocks when the depth texture is sampled by the fragment shader while being bound as an RT since it would attempt to lock the texture again.	2022-05-07 02:37:48 +05:30
shutterbug2000	1c8d994161	Basic `bcat:u` implementation A basic `bcat:u` implementation to prevent titles such as "Kirby and the Forgotten Land" dependent on BCAT support from crashing due to the lack of an implementation.	2022-05-06 15:41:48 +05:30
PixelyIon	4fd64a53e0	Require Vulkan `samplerAnisotropy` feature This is a widely supported feature that games may require conditionally but due to it being supported on effectively all target devices, it was made mandatory. This is used by titles such as ARMS.	2022-05-06 15:41:48 +05:30
PixelyIon	1d9b4a865a	Add additional formats to Adreno filter `VK_FORMAT_R32G32B32A32_SFLOAT` and `D32_SFLOAT` have their capabilities misreported as well, this spams the logs in titles such as ARMS.	2022-05-06 15:41:48 +05:30
PixelyIon	b87295374e	Improve Controller Applet log Improves the readability of the log and replaces the previously uninformative prefix of `operator()` due to being in a lambda with `Controller support`.	2022-05-06 15:41:48 +05:30
PixelyIon	98c730a644	Implement linked TIC/TSC handle in Maxwell3D Maxwell3D has a register for linking the TIC/TSC index in bindless texture handles, this is used by games to implement bindless combined texture-sampler handles.	2022-05-06 14:58:20 +05:30
PixelyIon	23a091100d	Implement `ReadCbufValue` + `ReadTextureType` Implements `GraphicsEnvironment::ReadCbufValue` & `GraphicsEnvironment::ReadTextureType` with a framework of heterogeneous lookups for caching and callbacks for querying constant buffer or TIC values with validation checks for successive draws to ensure unique IR is generated.	2022-05-06 14:39:36 +05:30
PixelyIon	765c3f4e1f	Allow draws with no descriptor set resources The `descriptorSetWrites` being filled is now optional and the case of it being empty is handled correctly, this is done by certain titles such as ARMS and is entirely valid behavior. It should be noted that not doing this leads to errors in the guest due to invalid GPU state while working on the host GPU.	2022-05-06 10:33:47 +05:30
PixelyIon	37327f1955	Fix and refactor SVC `SignalToAddress`/`WaitForAddress` SVC `SignalToAddress` had a bug with the behavior of `SignalAndModifyBasedOnWaitingThreadCountIfEqual` which was entirely incorrect and led to deadlocks in titles such as ARMS that were dependent on it. This commit corrects the behavior and refactors both SVCs and moves their arbitration/waiting to inside the corresponding `KProcess` function rather than the SVC to avoid redundancies and improve code readability.	2022-05-05 19:15:37 +05:30
PixelyIon	396979e897	Extend Adreno format-based filtering for Validation Layer Filtering of validation logs is now extended beyond BCn formats and now covers other format which have their feature set misreported by the driver, this significantly drives down the amount of logs depending on the title.	2022-05-05 19:15:37 +05:30
PixelyIon	62ea2a6da5	Avoid format aliasing warnings on Adreno Implements an algorithm to determine formats that can be aliased as views without needing `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT`, this avoids spamming warning logs on view creation when the aliased formats will function in practice.	2022-05-05 19:15:37 +05:30
PixelyIon	7206ab4c67	Fix `exclusiveSubpass` by finishing render pass at end There was an oversight with exclusive subpasses which could lead to RPs with more than one subpass could be created even though one pass was exclusive, this oversight was not finishing the render pass at the end of `AddSubpass`. This could lead to a future subpass adding to the end of that RP even though it was intended to exclusively have a single subpass. This case occurs in titles such as Celeste (in-game) and breaks rendering on GPUs that may require exclusive subpasses for proper functionality.	2022-05-05 11:14:38 +05:30
PixelyIon	96fe5f0a0e	Set initial `subpassCount` value to 1 rather than 0	2022-05-05 11:07:43 +05:30
PixelyIon	5d08d6e06f	Disable unnecessary Khronos Validation Layer logs The Khronos Validation Layer can often generate warning/error logs due to our intentional breakage from Vulkan specification, these can occur several times a frame resulting in the logs being spammed and making it difficult to extract useful information out of logs. The scope of these logs has now been reduced with more general filtering and the introduction of specialized filtering to handle complex cases such as BCn hacks with `libadrenotools` on Adreno devices.	2022-05-04 13:20:59 +05:30
PixelyIon	23c9388caf	Fix `VK_KHR_push_descriptor`-less path for descriptor set updates Descriptor set updates were broken on the non-push-descriptor path due to lifetime issues with VkDescriptorSetLayout's usage during the execution phase which entirely broke rendering on AMD/Mali GPUs due to them not supporting `VK_KHR_push_descriptor`. This commit addresses that by moving the allocation of a descriptor set to outside the lambda and into the recording phase, it also simplifies the semantics and resources passed into the lambda by removing redundancies.	2022-05-04 00:49:21 +05:30
PixelyIon	47bc3b4d99	Fix Render Pass Cache The Vulkan render pass cache was fundamentally broken since it was designed around the Render Pass Compatibility clause due to being designed for framebuffer compatibility initially. As this scope was extended to a general render pass cache, the amount of data in the key was not extended to include everything it should have. This commit introduces the missing pieces in the RP cache and simplifies the underlying code in the process.	2022-05-01 20:31:36 +05:30
PixelyIon	25a29f9044	Skip zero-initializing shader bytecode backing The backing for shader data would implicitly be zero-initialized due to a `resize` on every shader parse, this was entirely unnecessary as we would overwrite the entire range regardless. We avoid this by using statically allocated storage and a span over it containing the shader bytecode which avoids any unnecessary clear semantics without resorting to more complex solutions such as a custom allocator.	2022-05-01 18:27:27 +05:30
PixelyIon	42573170c6	Implement Framebuffer Cache Implements a cache for storing `VkFramebuffer` objects with a special path on devices with `VK_KHR_imageless_framebuffer` to allow for more cache hits due to an abstract image rather than a specific one. Caching framebuffers is a fairly crucial optimization due to the cost of creating framebuffers on TBDRs since it involves calculating tiling memory allocations and in the case of Adreno's proprietary driver involves several kernel calls for mapping and allocating the corresponding framebuffer memory.	2022-05-01 18:27:27 +05:30
PixelyIon	af7f0c301e	Avoid redundant `VkImageView` recreation There are a lot of cases of `VkImageView` being recreated arbitrarily due to it being tied to the ephemeral object `TextureView` rather than `Texture`, this commit flips that by storing all `VkImageView`s inside `Texture` with `TextureView` simply holding a copy of the handle to them. Additionally, this change results in stable `VkImageView` handles and helps in paving the path for framebuffer caching when `VK_KHR_imageless_framebuffer` is unavailable.	2022-05-01 18:27:27 +05:30
PixelyIon	41b2c2dc7b	Add `profileable` attribute to `AndroidManifest.xml` As we desire more accurate profiling data in certain circumstances, making the app explicitly profilable will allow for this, it will also remove the (annoying) prompt to do this in the Android Studio profiler.	2022-05-01 18:27:27 +05:30
PixelyIon	da931cf07b	Implement Render Pass Cache Implements a cache for storing `VkRenderPass` objects which are often reused, they are not extremely expensive to create generally but this is a required step to build up to a framebuffer cache which is an extremely expensive object to create on TBDRs generally since it involves calculating tiling memory allocations and in the case of Adreno's proprietary driver involves several kernel calls for mapping and allocating the corresponding memory.	2022-05-01 18:16:53 +05:30
Billy Laws	ae77bde171	Fixup audio device name writing in services Games expect the output buffer the be entirely zero filled past the device name.	2022-04-30 16:00:33 +01:00
Billy Laws	194cbe6c7c	Stub several HID functions	2022-04-30 16:00:33 +01:00
Billy Laws	112c20cef2	Stub QueryAudioDevice{Input,Output}Event Used in many 3.0.0+ games	2022-04-30 16:00:33 +01:00
Billy Laws	8d7dbe2c4e	Add a way to get a readonly span of Buffer contents Avoids the need redundantly copy data when it is being directly processed on the CPU (e.g. quad coversion)	2022-04-30 16:00:33 +01:00
MK73DS	4c71ef5c31	Fix American English language code	2022-04-30 18:43:22 +05:30
PixelyIon	90c635bf78	Coalesce subpasses with compatible attachments together We run into a lot of successive subpasses with the exact same framebuffer configuration which we now exploit to avoid the creation of a new subpass due to the overhead involved with this. This provides significant performance boosts in certain cases due to the magnitude of difference in the amount of subpasses being created while providing next to no benefit in other cases.	2022-04-27 13:22:34 +05:30
PixelyIon	a947933bf0	Fix `Buffer` cycle check being inverted The check for the fence cycle being the same as the current cycle was incorrectly inverted to be the opposite of what it should have been, leading to bugs.	2022-04-27 13:07:36 +05:30
PixelyIon	54794f4b71	Move `Texture` locking and synchronization to `PresentationEngine` The responsibility for synchronizing a texture and locking it is now on the `PresentationEngine` rather than the API-user as this'll allow more fine grained locking and delay waiting until necessary.	2022-04-25 21:01:16 +05:30
Billy Laws	1dd230afde	Refactor all std::lock_guard usages to std::scoped_lock	2022-04-25 15:00:30 +01:00
PixelyIon	94e6f3cfa0	Add quirk for relaxed render pass compatibility As we require a relaxed version of the Vulkan render pass compatibility clause for caching multi-subpass render passes, we now utilize a quirk to determine if this is supported which it is on Nvidia/Adreno while AMD/Mali where it isn't supported we force single-subpass render passes.	2022-04-24 16:18:36 +05:30
PixelyIon	44615c8dd2	Implement per-vendor `VkQueue` maximum global priority We found out that certain vendors such as Nvidia had a limitation on the global priority of a queue and requesting `VK_QUEUE_GLOBAL_PRIORITY_HIGH_EXT` would result in `VK_ERROR_NOT_PERMITTED_EXT`. A quirk has been introduced to supply the maximum supported global priority which is currently set on a per-vendor basis to avoid future crashes.	2022-04-24 16:15:01 +05:30
PixelyIon	7ef4959060	Implement Graphics Pipeline Cache Implements a cache for storing `VkPipeline` objects which are fairly expensive to create and doing so on a per-frame basis was rather wasteful and consumed a significant part of frametime. It should be noted that this is not compliant with the Vulkan specification and will break unless the driver supports a relaxed version of the Vulkan specification's Render Pass Compatibility clause.	2022-04-24 14:31:00 +05:30
PixelyIon	50a8b69f7b	Optimize descriptor set writes using push descriptors We can use inline push descriptors for writing to descriptor rather than allocating a descriptor set for a one time write and freeing it as this is rather inefficient while an inline push descriptor generally ends up being a direct `memcpy` on the driver side designed for this use-case.	2022-04-24 13:45:09 +05:30
PixelyIon	5adafbff04	Set `VkQueue`'s global priority to high We want Skyline to have the most favorable GPU scheduling possible due to low latency and high throughput requirements, we request high priority scheduling due to this reason.	2022-04-24 13:34:09 +05:30
PixelyIon	f9c052d1b7	Implement Maxwell3D Tessellation State This implements all Maxwell3D registers and HLE Vulkan state for Tessellation including invalidation of the TCS (Tessellation Control Shader) state during state changes.	2022-04-24 13:23:00 +05:30
Billy Laws	de796cd2cd	Implement overhead-free sequenced buffer updates with megabuffers Previously constant buffer updates would be handled on the CPU and only the end result would be synced to the GPU before execute. This caused issues as if the constant buffer contents was changed between each draw in a renderpass (e.g. text rendering) the draws themselves would only see the final resulting constant buffer. We had earlier tried to fix this by using vkCmdUpdateBuffer however this caused significant performance loss due to an oversight in Adreno drivers. We could have worked around this simply by using vkCmdCopy buffer however there would still be a performance loss due to renderpasses being split up with copies inbetween. To avoid this we introduce 'megabuffers', a brand new technique not done before in any other switch emulators. Rather than replaying the copies in sequence on the GPU, we take advantage of the fact that buffers are generally small in order to replay buffers on the GPU instead. Each write and subsequent usage of a buffer will cause a copy of the buffer with that write, and all prior applied to be pushed into the megabuffer, this way at the start of execute the megabuffer will hold all used states of the buffer simultaneously. Draws then reference these individual states in sequence to allow everything to work without any copies. In order to support this buffers have been moved to an immediate sync model, with synchronisation being done at usage-time rather than execute (in order to keep contents properly sequenced) and GPU-side writes now need to be explictly marked (since they prevent megabuffering). It should also be noted that a fallback path using cmdCopyBuffer exists for the cases where buffers are too large or GPU dirty.	2022-04-23 22:48:28 +01:00
lynxnb	0d9992cb8e	Implement `QuadList` support for non-indexed draws	2022-04-20 18:17:10 +02:00
lynxnb	bcaf7dfe1c	Make `GetVertexBuffer` return a pointer to the requested buffer This avoids a redundancy in the `Draw` function and makes code easier to read	2022-04-20 18:16:45 +02:00
Billy Laws	5c3559e888	Revert "Implement support for GPU-side constant buffer updating" This reverts commit `d79635772f`.	2022-04-18 13:28:58 +01:00
Billy Laws	7bf3580031	Revert "Allow external synchronization for buffers" This reverts commit `372ab8befa`.	2022-04-18 13:28:58 +01:00
PixelyIon	ddc9622b90	Fix Shader Module Cache As bindings weren't correctly handled due to the fact that `EmitSPIRV` would change the bindings, the shader module cache would not correctly function and have no cache hits in `find` and rather have them in `try_emplace` which negated any performance benefit of it. This has now been fixed by retaining the initial cache key for insertion into the cache while also storing the post-emit bindings and restoring them during a cache hit.	2022-04-18 12:18:15 +05:30
Billy Laws	32fe01e145	Implement batch constant buffer updates Avoids spamming the driver with hundreds of cbuf updates per frame by batching all consecutive updates into one.	2022-04-17 00:35:00 +01:00
PixelyIon	02f99273ac	Implement Shader Module Cache Implements caching of the compiled shader module (`VkShaderModule`) in an associative map based on the supplied IR, bindings and runtime state to avoid constant recompilation of shaders. This doesn't entirely address shader compilation as an issue since host shader compilation is tied to Vulkan pipeline objects rather than Vulkan shader modules, they need to be cached to prevent costly host shader recompilation.	2022-04-16 18:45:56 +05:30
PixelyIon	76d8172a35	Implement Shader IR Cache This implements the first step of a full shader cache with caching any IR by treating the shared pointer as a handle and key for an associative map alongside hashing the Maxwell shader bytecode, it supports both single shader program and dual vertex program caching.	2022-04-16 18:45:56 +05:30
PixelyIon	0baa90d641	Implement `SpanEqual` and `SpanHash` We desire the ability to hash and check equality of data across spans to use associative containers such as `std::unordered_map` with spans. The implemented functions provide an easy way to do that.	2022-04-16 18:45:56 +05:30
Billy Laws	df5d1256c2	Implement an object backed IStorage backing This is more convinient and efficient to use when passing structured data out of applets	2022-04-16 18:45:56 +05:30
Billy Laws	d115ce3c05	Stub the controller applet Mostly based off of yuzu's implementation, this will need to be extended in the future to open up a UI for configuring controllers according to the applications requirements.	2022-04-16 18:45:56 +05:30
Billy Laws	9a8e39cba1	Slightly refactor controller code in HID Now uses ranges where possible and a function to get the number of connected controllers has been added.	2022-04-16 18:45:56 +05:30
Billy Laws	2873f11baa	Pass shared pointers by value in applet infrastructure This is more optimal than crefs when used together with std::move	2022-04-16 18:45:56 +05:30
PixelyIon	8ccef733ff	Fix UB with guest-less Texture/Buffers in `MarkGpuDirty` As there was no check for the lack of a `GuestTexture`/`GuestBuffer`, it would lead to UB when a texture/buffer that had no guest such as the `zeroTexture` from `GraphicsContext` would be marked as dirty they would cause a call to `NCE::RetrapRegions` with a `nullptr` handle that would be dereferenced and cause a segmentation fault.	2022-04-16 18:45:56 +05:30
PixelyIon	372ab8befa	Allow external synchronization for buffers In certain situations such as constant buffer updates, we desire to use the guest buffer as a shadow buffer forwarding all writes directly to it while we update the host using inline buffer updates so they happen in-sequence. This requires special behavior as we cannot let any synchronization operations take place as they would break the shadow buffer, as a result, an external synchronization flag has been added to prevent this from happening. It should be noted that this flag is not respected for buffer recreation which will lead to UB, this can and will break updates in certain cases and this change isn't complete without buffer manager support.	2022-04-16 18:44:53 +05:30
PixelyIon	c0c4db68a8	Fix `BufferView` offset not being added in `vkCmdUpdateBuffer` The offset of the view wasn't added to the `vkCmdUpdateBuffer`, this would cause the offset to be incorrect given the buffer was a view of a larger buffer that wasn't the start of it. This commit fixes that by adding the offset of the view to the buffer update.	2022-04-14 18:06:15 +05:30
PixelyIon	a1c06e0401	Mark GPU resources as dirty before GPU usage We didn't call `MarkGpuDirty` on textures/buffers prior to GPU usage, this would cause them to not be R/W protected when they should be and provide outdated copies if there were any read accesses from the CPU (which are not possible at the moment since we assume all accesses are writes at the moment). This has now been fixed by calling it after synchronizing the resource.	2022-04-14 17:20:05 +05:30
PixelyIon	41a6afed01	Fix `GraphicsContext` code formatting for auto formatter	2022-04-14 15:27:22 +05:30
PixelyIon	624df92616	Change `AddNonGraphicsPass` to `AddOutsideRpCommand` The terminology "Non-Graphics pass" was deemed to be fairly inaccurate since it simply covered all Vulkan commands (not "passes") outside the render-pass scope, these may be graphical operations such as blits and therefore it is more accurate to use the new terminology of "Outside-RenderPass command" due to the lack of such an implication while being consistent with the Vulkan specification.	2022-04-14 15:20:22 +05:30
Billy Laws	a31332e35f	Align Maxwell 3D macro newline slashes	2022-04-14 14:14:52 +05:30
Billy Laws	d79635772f	Implement support for GPU-side constant buffer updating Previously constant buffer updates would be handled on the CPU and only the end result would be synced to the GPU before execute. This caused issues as if the constant buffer contents was changed between each draw in a renderpass (e.g. text rendering) the draws themselves would only see the final resulting constant buffer. Fix this by updating cbufs on the GPU/CPU seperately, only ever syncing them back at the start or after a guest side CPU write, at the moment only a single word is updated at a time however this can be optimised in the future to batch all consecutive updates into one large one.	2022-04-14 14:14:52 +05:30
Robin Kertels	036faedabd	Implement a way to run non-graphics passes with command executor These commands will end the current renderpass and run on their own, this is useful for compute, blits etc.	2022-04-14 14:14:52 +05:30
Billy Laws	feb179fcff	Implement primitive restart support Maxwell3D also supports using an arbitrary restart index value however no games are known to use this so leave it for now.	2022-04-14 14:14:52 +05:30
Billy Laws	3f3acc31d8	Rework swizzle infrastructure to support arbritary format swizzles This is required to support R4G4B4A4 which has no directly corresponding Vulkan format. Co-authored-by: Lunar-Pixel <lunarn452@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	6f85a66151	Implement host-only `Buffer`s We require certain buffers to only be on the host while being accessible through the same abstractions as a guest buffer as they must be interchangeable in usage.	2022-04-14 14:14:52 +05:30
Billy Laws	2c697ec36a	Determine depth/stencil texture aspect based off of image swizzle Required since we can't have a non-rt image with both a depth/stencil aspect at the same time according to vk spec.	2022-04-14 14:14:52 +05:30
PixelyIon	1878e582ad	Add `ScopedStackBlocker` to `RomFile.populate` We needed to block stack frame lookups past JNI code as Java doesn't follow the ARMv8 frame pointer ABI which leads to invalid pointer dereferences. Any JNI function that throws or handles exceptions must do this now or it may lead to a `SIGSEGV`.	2022-04-14 14:14:52 +05:30
Billy Laws	68e693d9f4	Fix DMA Engine debug logs to not crash emu Address causes some type issues when printing directly so explicitly cast to u64 first to prevent them.	2022-04-14 14:14:52 +05:30
Billy Laws	8eaca87de8	Use an empty host texture in place of invalid TIC entries on guest Some games may pass empty TICs as inputs to shaders while not actually using them within the shader. Create an empty texture and pass this in instead when we hit this case, the nullDescriptor feature could be used but it's not supported by all devices so we chose to do it this way instead.	2022-04-14 14:14:52 +05:30
PixelyIon	41b98c7daa	Add stack tracing to `skyline::exception` Skyline's `exception` class now stores a list of all stack frames during the invocation of the exception. These can later be parsed by the exception handler to generate a human-readable stack trace. To assist with more complete stack traces, `-fno-omit-frame-pointer` is now passed on debug builds which forces the inclusion of frames on function calls.	2022-04-14 14:14:52 +05:30
PixelyIon	cd8fa66326	Fix NCE Destruction NCE is implicitly depended on by the `GPU` class due to the NCE Memory Trapping API so the destruction of it must take place after the destruction of the `GPU` class. Additionally, to prevent bugs the NCE destructor must set `staticNce` to `nullptr` as the signal handler will potentially access a destroyed instance of NCE otherwise.	2022-04-14 14:14:52 +05:30
Billy Laws	815f1f4067	Add support for sRGB TIC textures Without this sRGB textures would be interpreted as RGB leading to colours being slighly off. The sRGB flag isn't stored as part of format word so we reuse the _pad_ field of it to store the flag for the switch case.	2022-04-14 14:14:52 +05:30
Billy Laws	1ba4abf950	Add Astc{6x6,8x8} and R4G4B4A4 image formats	2022-04-14 14:14:52 +05:30
MCredstoner2004	dec0571eee	Infrastructure for applets to be implemented This removes a stub for an applet and implements several applet related service calls.	2022-04-14 14:14:52 +05:30
PixelyIon	164d4852fa	Sleep-loop rather than abort during termination We don't want to actually exit the process as it'll automatically be restarted gracefully due to a timeout after being unable to exit within a fixed duration so we just want to infinite sleep during termination. This should fix issues where exiting any game would cause the app to force close after some time as exception signal handling would fail in the background, the app should stay open now and automatically restart itself when another game is loaded in.	2022-04-14 14:14:52 +05:30
PixelyIon	ea00f1bb82	Flush emulation logs after exceptions A lot of logs are incomplete due to being unable to flush inside the signal handler, now we flush after any exceptions so that there is a guarantee of any exceptions being logged as this is crucial for proper debugging.	2022-04-14 14:14:52 +05:30
PixelyIon	62ba180550	Use R5G6B5 as Vulkan swapchain format rather than B5G6R5 B5G6R5 isn't generally supported by the swapchain and the format is used for R5G6B5 with swapped R/B channels to avoid aliasing so we reverse that by using R5G6B5 as the underlying Vulkan format for the swapchain which should be automatically handled by the driver for any copies from B5G6R5 textures and the data representation should be the same as B5G6R5 with swapped R/B channels so not reporting the correct texture::Format should be fine.	2022-04-14 14:14:52 +05:30
MK73DS	e54f86e923	Fix IApplicationFunctions::GetDisplayVersion id (https://switchbrew.org/wiki/Applet_Manager_services#IApplicationFunctions)	2022-04-14 14:14:52 +05:30
Billy Laws	77cf33b643	Trigger command executor before DMA copies DMA copies can use textures currently in active use on the GPU as dst/src so Execute before to prevent a deadlock	2022-04-14 14:14:52 +05:30
Billy Laws	dbbc5704d2	Implement DMA engine Block Linear->Linear copies	2022-04-14 14:14:52 +05:30
Billy Laws	3e4e8de1d2	Implement primitive Linear->Block Linear DMA engine copies Slightly inaccurate and misses some features but good enough for most games, should be revisted later.	2022-04-14 14:14:52 +05:30
Billy Laws	3c26921d54	Implement the Maxwell DMA engine The DMA engine is used to perform DMA buffer/texture copies directly on the GPU. It can deswizzle arbritary regions of input textures, perform component remapping and swizzle into output textures. This impl only supports 1D buffer copies, 2D ones will come later.	2022-04-14 14:14:52 +05:30
Billy Laws	3df76e84c3	Stub IRequest::GetAppletInfo in nifm	2022-04-14 14:14:52 +05:30
Billy Laws	6c5f9941ad	Stub additional IAddOnContentManager functions Used mainly by UE4 games	2022-04-14 14:14:52 +05:30
Billy Laws	486a835d0a	Use guest texture view type to determine the underlying image type If we have a Nx1x1 image then determining the type from dimensions will result in a 1D image being created thus preventing us from creating a 2D view. By using the image view type we can avoid this for textures from TICs since we know in advance how they will be used	2022-04-14 14:14:52 +05:30
Billy Laws	05966f34e5	Stub a pair of ISelfController functions Both used by SMO, SetScreenShotPermission and SetAlbumImageOrientation	2022-04-14 14:14:52 +05:30
Billy Laws	fe37d7c9be	Implement ICommonStateGetter::SetRequestExitToLibraryAppletAtExecuteNextProgramEnabled	2022-04-14 14:14:52 +05:30
Billy Laws	9813f9f8dc	Implement ICommonStateGetter::GetDefaultDisplayResolutionChangeEvent	2022-04-14 14:14:52 +05:30
Billy Laws	7e7c0252ca	Implement IApplicationFunctions::GetDisplayVersion	2022-04-14 14:14:52 +05:30
Billy Laws	b1f10865a0	Attach depth RT to command executor before draws This enforces that the depth RT outlives the draw, without this the depth RT could be freed while in active use by command executor leading to UAFs and crashes.	2022-04-14 14:14:52 +05:30
Billy Laws	0182fabc50	Stub {Set,Get}NpadHandheldActivationMode in HID	2022-04-14 14:14:52 +05:30
Billy Laws	2e197cead5	Support D32S8_Float_Uint_Unorm_Unorm depth/stencil format	2022-04-14 14:14:52 +05:30
Billy Laws	7717a86fb1	Implement VMM region->region copies Required by the DMA engine, a simple memcpy doesn't work since the buffers could span multiple blocks.	2022-04-14 14:14:52 +05:30
Billy Laws	af90d4f977	Implement audren Surround->Stereo downmixing	2022-04-14 14:14:52 +05:30
PixelyIon	ad0005f398	Remove guard-page from main thread stack This was erroneously included while migrating from older code where stack creation was entirely handled with host constructs such as `mmap` directly to using `KPrivateMemory` to manage it, we would create a guard page with `mprotect` that the guest was unaware about and would cause a segfault when a guest accessed the extents of the stack as reported to the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	de81d28b1d	Implement SVC `GetThreadContext3` A partial implementation of the `GetThreadContext3` SVC, we cannot return the whole thread context as the kernel only stores the registers we need according to the ARMv8 ABI convention and so far usages of this SVC do not require the unavailable registers but all future usage must be monitored and potentially require extending the amount of saved registers.	2022-04-14 14:14:52 +05:30
PixelyIon	b706aa3463	Implement SVC `SetThreadActivity` This SVC can pause/resume a thread, it is used by engines like Unity to pause a thread during a GC world stop.	2022-04-14 14:14:52 +05:30
PixelyIon	36a7ad06bd	Use built-in vibrator by default for controller #0 The vibration device had to be set manually prior which led to it generally not being set at all even though a user might want vibration, this commit fixes that by making controller #0 use the built-in vibrator by default.	2022-04-14 14:14:52 +05:30
PixelyIon	b45437b78b	Move Skyline internal files to external directory Any Skyline files that should have been user-accessible were moved from `/data/data/skyline.emu/files` to `/sdcard/Android/data/skyline.emu/files` as the former directory is entirely private and cannot be accessed without either adb or root. This made retrieving certain data such as saves or loading custom driver shared objects extremely hard to do while this can be trivially done now.	2022-04-14 14:14:52 +05:30
Billy Laws	e5e20f39c9	Implement a simple constant buffer cache In some games such as SMO thousands of constant buffers are bound per frame which was causing an unreasonable number of lookups in both vmm and the buffer manager. Work around this by introducing a simple hashmap based cache, eviction is currently unsupported but not really necessary yet due to the small size of the buffers in the cache.	2022-04-14 14:14:52 +05:30
PixelyIon	cb2614f80e	Handle host accesses for NCE Memory Trapping API We cannot ignore accesses from the host to a region protected by the NCE Memory Trapping API, there's often access to regions which have overlap with a protected region unintentionally and those accesses need to be handled correctly rather than leading to a crash. This is done by implementing an additional signal handler `NCE::HostSignalHandler` to lookup any potential traps on a `SIGSEGV` and handle them correctly or when there isn't a corresponding trap raise a `SIGTRAP` when debugger is connected or delegate to `signal::ExceptionalSignalHandler` when it isn't.	2022-04-14 14:14:52 +05:30
PixelyIon	b04a0c386a	Page out RW-trapped memory in NCE Memory Trapping To cut down memory usage we now page out memory that is RW trapped via the NCE memory trapping API, the callbacks are supposed to page in the memory. This behavior is backed up by Texture/Buffer syncing which would read the host copies of data and write it to the guest, by paging the corresponding data on the guest we're avoiding redundant memory usage.	2022-04-14 14:14:52 +05:30
PixelyIon	344c5f2a62	Implement RAII wrapper over file descriptors The `FileDescriptor` class is a RAII wrapper over FDs which handles their lifetimes alongside other C++ semantics such as moving and copying. It has been used in `skyline::kernel::MemoryManager` to handle the lifetime of the ashmem FD correctly, it wasn't being destroyed earlier which can result in leaking FDs across runs.	2022-04-14 14:14:52 +05:30
PixelyIon	7ce2a903a1	Update LLVM + Oboe Initially this commit was only intended to update LLVM but due to a compilation error on latest LLVM libcxx due to the C++ stdlib header `<algorithm>` being a transitive dependency that is no longer transitively included on the latest LLVM libcxx (as of https://reviews.llvm.org/D119667), this required changes in Skyline and Oboe which were done in https://github.com/google/oboe/pull/1521 and the submodule has been updated to include those changes.	2022-04-14 14:14:52 +05:30
Billy Laws	01c027b9f6	Fix GetBlockLinearLayerSize to avoid incorrectly calculating a zero size	2022-04-14 14:14:52 +05:30
lynxnb	08e24915d8	Add support for drawing inside the display cutout areas	2022-04-14 14:14:52 +05:30
MK73DS	6e929e6f6a	Stub ICommonStateGetter::SetCpuBoostMode This makes Metroid Dread boot	2022-04-14 14:14:52 +05:30
Billy Laws	d033ff2478	Fix draws when no colour RTs and only depth is bound	2022-04-14 14:14:52 +05:30
Billy Laws	d137051833	Add basic support for 3d/cubemap textures These are mostly used in 3D games like SMO, support is still quite basic and synchronising block linear 3D texture will crash in most cases due to them being unimplemented.	2022-04-14 14:14:52 +05:30
Billy Laws	bcc00216b7	Fix incorrect Bc2/3 block sizes	2022-04-14 14:14:52 +05:30
PixelyIon	7e9b0fec77	Increase reported `audren` revision to 11 Some games crash due to requiring an `audren` version greater than 7. The `audren` version can be increased without any issues as `audren` is stubbed and therefore the reported version doesn't matter.	2022-04-14 14:14:52 +05:30
PixelyIon	e294fa8c91	Add subpass limit quirk to fix Adreno driver bug Older Adreno proprietary drivers (5xx and below) will segfault while destroying the renderpass and associated objects if more than 64 subpasses are within a renderpass due to internal driver implementation details. This commit introduces checks to automatically break up a renderpass when that limit is hit.	2022-04-14 14:14:52 +05:30
PixelyIon	65d5a3bce5	Align all `Buffer`s to page boundary We have support for overlapping buffers which allows us to merge a lot of smaller buffers located on a single page into a single larger buffer which allows for better performance. It additionally ensures that all host buffers match the alignment guarantees of the guest and adequately fulfill host alignment requirements.	2022-04-14 14:14:52 +05:30
PixelyIon	cb1ec9a7f4	Rework `BufferManager`, `Buffer` and `BufferView` This commit encapsulates a complex sequence of cascading changes in the process of supporting overlaps for buffers: * We determined that it is impossible to resolve overlaps with multiple intervals per buffer within the constraints of each overlap being a contiguous view, support for multiple intervals was therefore dropped. The older buffer manager code was entirely reworked to be simpler due to only handling one interval per buffer with code now being based off `IntervalMap` but tailored specifically for buffers. * During overlap resolution, the problem of how existing views into the buffer being recreated would be updated, it had to be replaced with a larger buffer that could contain all overlaps and all existing views would need to be repointed to it. This was addressed by a buffer owning all views to itself, we could automatically recalculate the offset of all views and update the buffers with it. * We still needed to update usage of existing views which was done by handling all access (such as inside a recorded draw) to buffer view properties via `BufferView::RegisterUsage` which dispatches a callback with the view and the corresponding backing buffer. This callback can be stored and called during overlap resolution with the new buffer. * We had issues with lifetime of the buffer with the handle-like semantics of `BufferView` introduced in the last buffer-related commit, if we updated the view to be owned by a new buffer we'd need to extend the lifetime of the new buffer not the older one and the only way to do this was a proxy owner object `BufferDelegate` which holds a shared pointer to the real `Buffer` which in-turn holds a pointer to all `BufferDelegate` objects to update on repointing. A `BufferView` is effectively just a wrapper around `std::shared_ptr<BufferDelegate>` with more favorable semantics but generally just forwarding calls. It should be additionally noted that to support usage of `RegisterUsage` the code around buffers in `GraphicsContext` was refactored to defer truly binding till the recording phase.	2022-04-14 14:14:52 +05:30
PixelyIon	a6781b38f4	Clear `syncBuffers` after `CommandExecutor` execution Due to an oversight, we weren't clearing the list of buffers that needed to be synced after every execution which led to them building up. Due to the relatively cheap synchronization of buffers and only doing so on faults this wasn't caught until now, it does depress the framerate significantly over time due to the size of the list growing to be in the range of 100k buffer views depending on the title.	2022-04-14 14:14:52 +05:30
kaikecarlos	49c0ba1207	Implement `IAccountServiceForApplication::IsUserRegistrationRequestPermitted`	2022-04-14 14:14:52 +05:30
kaikecarlos	e8cc760b10	Implement IHidServer Functions Add GetVibrationDeviceInfo and StartSixAxisSensor	2022-04-14 14:14:52 +05:30
kaikecarlos	9f51664b1d	Stub IRS Service	2022-04-14 14:14:52 +05:30
lynxnb	707c0cc0af	Stub `aocsrv::IAddOnContentManager::ListAddOnContent`	2022-04-14 14:14:52 +05:30
lynxnb	873ed641ea	Stub `nfp::IUser::ListDevices` and `nfp::IUser::GetState`	2022-04-14 14:14:52 +05:30
lynxnb	7d518cba2b	Stub `am::ICommonStateGetter::IsVrModeEnabled`	2022-04-14 14:14:52 +05:30
Robin Kertels	594f061b21	Implement SSBOs Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
Billy Laws	82d2a9ab56	Unify engine related macros to avoid excessive code duplication	2022-04-14 14:14:52 +05:30
Billy Laws	ae41ddf4f0	Implement a skeleton compute engine The Kepler compute engine is used to run compute jobs encapsulated in to QMDs on the GPU, this commit doesn't implement compute itself but adds the register and QMD structs that will be needed for it in the future.	2022-04-14 14:14:52 +05:30
Billy Laws	0298a7b1f6	Implement the actual inline to memory engine on subch 2 Used mostly by OGL games for copying stuff around.	2022-04-14 14:14:52 +05:30
Billy Laws	ba7111d33a	Add maxwell3d I2M support	2022-04-14 14:14:52 +05:30
Billy Laws	8c73b62b2c	Implement basic inline2memory engine support Not currently used by anything but will be used by both compute, 3D and its own engine in the future. Block linear copies are currently unsupported.	2022-04-14 14:14:52 +05:30
Billy Laws	5c387f5c5a	Fixup depth mode init value to allow ignoring redundant calls	2022-04-14 14:14:52 +05:30
PixelyIon	7a5c771f44	Rework GPU BufferView to have handle-like semantics We wanted views to extend the lifetime of the underlying buffers and at the same time preserve all views until the destruction of the buffer to prevent recreation which might be costly in the future when we need `VkBufferView`s of the buffer but also require a centralized list of all views for recreation of the buffer. It also removes the inconsistency between `BufferView*` being returned in `GetXView` in `GraphicsContext`.	2022-04-14 14:14:52 +05:30
Billy Laws	fae5332f20	Disable descriptor aliasing on Adreno to workaround shader compiler bug Alised descriptor sets are incorrectly interpreted by the shader compiler causing it to bugger up LLVM function argument types and crash Co-authored-by: PixelyIon <pixelyion@protonmail.com>	2022-04-14 14:14:52 +05:30
Billy Laws	fc2c123ae2	Implement GPU depthMode register This controls the depth range used by the shader, hades already has support for the necessary patching so we only need to pass the current mode over to it and it'll do the necessary work.	2022-04-14 14:14:52 +05:30
Billy Laws	7e088ca465	Fix constbuf updates to actually increment the write offset Uses the register directly now as when we modify it we want the changes to be visible from macros too.	2022-04-14 14:14:52 +05:30
PixelyIon	d2f3479610	Use `eB5G6R5UnormPack16` VkFormat for `B5G6R5Unorm` and `R5G6B5Unorm` Using `eB5G6R5UnormPack16` (with a swizzle for `R5G6B5Unorm`) removes the need for `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` when those formats are aliased which happens in Sonic Mania among other titles.	2022-04-14 14:14:52 +05:30
PixelyIon	24d7066d8b	Add quirk to avoid `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` on Adreno GPUs Adreno GPUs have significant performance penalties from usage of `VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT` which require disabling UBWC and on Turnip, forces linear tiling. As a result, it's been made an optional quirk which doesn't supply the flag in `VkImageCreateInfo` and logs a warning if a view with a different Vulkan format from the original image is created.	2022-04-14 14:14:52 +05:30
PixelyIon	731d06010d	Set `eMutableFormat` in Texture Image Creation We often need to alias the underlying data as multiple Vulkan formats which requires the `eMutableFormat` bit to be set in `VkImageCreateInfo`, without doing this there'll be validation layer errors and potentially GPU bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	dafcfa68ca	Transition texture layout to `eGeneral` after creation As we no longer set the layout to general inside the Texture constructor, yet, we need it to be set prior to the image being used as an attachment. We need to transition the layout to `eGeneral` after creation of the texture object.	2022-04-14 14:14:52 +05:30
PixelyIon	5dea15632c	Add Controller Setup Guide A setup guide for controllers that goes through every available button/stick sequentially and opens up a corresponding dialog to map them.	2022-04-14 14:14:52 +05:30
PixelyIon	e2cae74425	Fix `RecyclerView` height in `CoordinatorLayout` for non touch-mode Any `RecyclerView`s with an app bar in a `CoordinatorLayout` would end up going off-screen due to the layout behavior implementing an offset by using a transform which would not correctly handle focusing on off-screen objects. This has now been fixed by manually adjusting height to be clipped to what is visible on the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	3ae62c7fcc	Collapse `appBarLayout` on `appList` focus We collapse the app bar when the focus is on the app list which only occurs while using a controller, this is required as the app bar will never be collapsed otherwise. It also removes the older code to work around the limitation on `View.FOCUS_DOWN` by collapsing only when the end of the list was reached.	2022-04-14 14:14:52 +05:30
PixelyIon	3e4ec7323b	Tweak grid compact items Removes card elevation as it visually conflicts with the scrim, this also makes the scrim a bit darker to emphasize the text and slightly reduces the border radius.	2022-04-14 14:14:52 +05:30
PixelyIon	1d984b6de3	Add padding to end of `app_list` A small amount of padding at the end of `app_list` to signify that the end of the list has been reached was added.	2022-04-14 14:14:52 +05:30
PixelyIon	bac7c526ef	Make layout selectable for grid items The entire layout is now selectable for grid items rather than just the card, this greatly increases the visibility of the selection when not in touch mode as the contrast of a darken effect on the icon can be minimal depending on how dark the icon already is.	2022-04-14 14:14:52 +05:30
PixelyIon	1d070e6332	Close `InputStream` after usage in `KeyReader` The `InputStream` would not be closed after reading the key file in `KeyReader#import`, it's now wrapped with `use{ }` which handles closing the stream after usage.	2022-04-14 14:14:52 +05:30
MK73DS	647cb07dc8	Stub functions in IAccountServiceForApplication: - GetUserCount - InitializeApplicationInfo - IsUserAccountSwitchLocked	2022-04-14 14:14:52 +05:30
PixelyIon	b41d8b7997	Use `Surface#setFrameRate` for suggesting display refresh rate Setting the refresh rate via the Display API's`preferredDisplayModeId` is an outdated method to do it on Android 11 and above, we now use `Surface#setFrameRate` alongside it to suggest a refresh rate for the display.	2022-04-14 14:14:52 +05:30
PixelyIon	730bf504f8	Correct Adreno texture binding quirk We incorrectly determined an Adreno driver bug to require padding between binding slots but the real issue was not supporting consecutive binding writes for `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` and was fixed by the padding slot unintentionally requiring individual writes. The quirk has now been corrected to explicitly specify this as the bug and the solution is more apt.	2022-04-14 14:14:52 +05:30
PixelyIon	da8cb48933	Fix Interval Map `GetAlignedRecursiveRange` lookup bug Any lookups done using `GetAlignedRecursiveRange` incorrectly added intervals in the exclusive interval entry lookups as the condition for adding them was the reverse of what it should've been due to a last minute refactor, it led to graphical glitches and crashes. This has been fixed and the lookups should return the correct results.	2022-04-14 14:14:52 +05:30
PixelyIon	f2faa74707	Fix crashes due to `SEGV_ACCERR` check On certain devices, accesses to a protected memory region can return `si_code` as non-`SEGV_ACCERR` values, this leads to a crash as we only pass access violations to the trap handler and would lead to not doing so on those devices which would then result in going to the crash handler.	2022-04-14 14:14:52 +05:30
PixelyIon	62c16fa73e	Upgrade Gradle (7.4), AGP (7.1.2) and Kotlin Dependencies	2022-04-14 14:14:52 +05:30
PixelyIon	77e2797219	Delete expired `weak_ptr`s for Texture/Buffer views A large amount of Texture/Buffer views would expire before reuse could occur in `Texture::GetView`/`Buffer::GetView`. These can lead to a substantial memory allocation given enough time and they are now deleted during the lookup while iterating on all entries. It should be noted that there are a lot of duplicate views that don't live long enough to be reused and the ultimate solution here is to make those views live long enough to be reused.	2022-04-14 14:14:52 +05:30
PixelyIon	881bb969c4	Implement access-driven Buffer synchronization Similar to constant redundant synchronization for textures, there is a lot of redundant synchronization of buffers. Albeit, buffer synchronization is far cheaper than texture synchronization it still has associated costs which have now been reduced by only synchronizing on access.	2022-04-14 14:14:52 +05:30
PixelyIon	7532eaf050	Attach Texture to Cycle in `Texture::TransitionLayout` Not doing so could result in the texture being destroyed before the completion of a transition and lead to undefined behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	3268b3779a	Implement access-driven Texture synchronization There was a lot of redundant synchronization of textures to and from host constantly as we were not aware of guest memory access, this has now been averted by tracking any memory accesses to the texture memory using the NCE Memory Trapping API and synchronizing only when required.	2022-04-14 14:14:52 +05:30
PixelyIon	3e33d49faf	Implement NCE Memory Trapping API An API for trapping accesses to guest memory and performing callbacks based on those accesses alongside managing protection of the memory. This is a fundamental building block for avoiding redundant synchronization of resources from the guest and host. Note: All accesses are treated as write accesses at the moment, support for picking up read accesses will be implemented later	2022-04-14 14:14:52 +05:30
PixelyIon	08510d75b0	Implement Interval Map An interval map is a crucial piece of infrastructure required for memory faulting to track any regions that have an associated callback and their protection. Additionally, efficient page-aligned lookups with semantics optimal for memory faulting are also a requirement and the ability to associate multiple regions with a single callback/protection entry rather than doing so on a per-region basis as we deal with split-mapping resources.	2022-04-14 14:14:52 +05:30
PixelyIon	5c9e42e384	Use mirror mappings for Textures and Buffers This is a prerequisite to memory trapping as we need to write to the mirror to avoid a race condition with external threads writing to a texture/buffer while we do so ourselves for the sync on a read/write, it also avoids an additional `mprotect` to `-WX`/`RWX` on a read access. An additional advantage for textures especially is that we now support split-mapping textures due to laying them out in a contiguous mirror and they will not require costly algorithmic changes. Buffers should also benefit from not needing to iterate over every region when they are split into multiple mappings.	2022-04-14 14:14:52 +05:30
PixelyIon	577a67babd	Support mirrors of multiple non-contiguous memory regions `CreateMirror` is limited to creating a mirror of a single contiguous region which does not work when creating a contiguous mirror of multiple non-contiguous regions. To support this functionality, `CreateMirrors` which expects a list of page-aligned regions and maps them into a contiguous mirror.	2022-04-14 14:14:52 +05:30
PixelyIon	e35ab6d1e0	Move to mapping guest AS as shared memory We want to create arbitrary mirrors in the guest address space and to make this possible, we map the entire address space as a shared memory file. A mirror is mapped by using `mmap` with the offset into the guest address space.	2022-04-14 14:14:52 +05:30
Billy Laws	a5dd961f01	Add support for batched method sending Important for constbuf updates which would be very slow if done one at a time.	2022-04-14 14:14:52 +05:30
Robin Kertels	43879e2476	Round up when calculating size of compressed texture in bytes	2022-04-14 14:14:52 +05:30
Robin Kertels	d889550e84	Don't set COLOR_ATTACHMENT_BIT for compressed formats. The better solution would be to only set this for formats that support it on original HW but this will get rid of the validation errors for now.	2022-04-14 14:14:52 +05:30
Robin Kertels	82296ac5b8	Use buffer size instead of allocation size for Buffer constructor Fixes a validation error.	2022-04-14 14:14:52 +05:30
Robin Kertels	752245c3c8	Enable provoking vertex feature	2022-04-14 14:14:52 +05:30
Robin Kertels	dd45d054e7	Enable shaderDrawParameters	2022-04-14 14:14:52 +05:30
Billy Laws	7e16c1f989	Heavily optimise GPFIFO command dispatch to reduce redundant checks Previously for methods with count > 1 the subchannel and engine would be looked up for each part of the method rather than only doing so at the start. Each call also needed to be looked up to see if it touched a macro or GPFIFO method. Fix this by doing checks outside of the main dispatch loop with templated helper lambdas to avoid needing to repeat lots of code. Maxwell3D is the only subchannel with a fast path for now but more can be added later if needed.	2022-04-14 14:14:52 +05:30
Billy Laws	b4927d0138	Add support for turnip and driver file redirection via libadrenotools	2022-04-14 14:14:52 +05:30
Billy Laws	dd91d063a5	Pass native library dir to OS + reorder OS init order so paths are first This is required for integrating libadrenotools, which needs access to library and app directories in the GPU class constructor.	2022-04-14 14:14:52 +05:30
Billy Laws	011de98940	Rework formats to support passing through guest swizzle values Almost every Maxwell format now directly corresponds to a Vulkan format. This allows formats to be passed through and the swizzle used directly from guest (with some extra swizzle handling for edge cases) thus saving the need to explicitly support each swizzle combination which is adds a lot of code bloat. The format header is additionally reordered with line breaks to separate formats by their bits-per-block.	2022-04-14 14:14:52 +05:30
Billy Laws	6f17d1351f	Fixup ordering for B10G11R11Float texture format	2022-04-14 14:14:52 +05:30
Billy Laws	78238d550a	Add 6 channel downmixing support for Audout The specific attenuations used for each channel are taken from Ryujinx.	2022-04-14 14:14:52 +05:30
Billy Laws	2e1a1a965d	Fixup AudioTrack locking	2022-04-14 14:14:52 +05:30
PixelyIon	727f83e969	Fix Incorrect Vertex Binding Divisor State Submission We always submit pipeline divisor descriptions regardless of binding input rate being vertex rather than instance. This is invalid behavior and has been fixed by only submitting binding descriptors when the input rate is per-instance.	2022-04-14 14:14:52 +05:30
PixelyIon	9f7e80cf8f	Fix Adreno Texture Sampler Binding Bug Adreno proprietary drivers suffer from a bug where `VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER` requires 2 descriptor slots rather than one, we add a padding slot to fix this issue. `QuirkManager` was introduced to handle per-vendor/per-device errata and allow enabling this on Adreno proprietary drivers specifically as to not affect the performance of other devices.	2022-04-14 14:14:52 +05:30
PixelyIon	ddb2ba8a1b	Rename `QuirkManager` to `TraitManager` Quirk terminology was deemed to be inappropriate for describing the features/extensions of a device. It has been replaced with traits which is far more fitting but quirks will be used as a terminology for errata in devices.	2022-04-14 14:14:52 +05:30
PixelyIon	0b2ce6a8f3	Fix Texture Handle Offset Calculation The texture handle offset calculation involved an incorrect shift by descriptor size which was found to be unnecessary and would result in an invalid handle that had the wrong TIC/TSC index and caused broken rendering.	2022-04-14 14:14:52 +05:30
PixelyIon	aa57ec6d55	Destroy `CommandExecutor` Nodes Before Waiting on Execution `nodes` and `syncTextures` were cleared after waiting on the `CommandExecutor` fence rather than before, this wasted execution time after the wait for something that could be performed prior to the wait.	2022-04-14 14:14:52 +05:30
PixelyIon	90a1b3348c	Implement D24S8 + R11G11B10 Formats	2022-04-14 14:14:52 +05:30
PixelyIon	bd718175ce	Enable `VK_KHR_uniform_buffer_standard_layout` when available We now attempt to enable `VK_KHR_uniform_buffer_standard_layout` when present as lax UBO layout significantly reduces complexity. If a device doesn't support this extension, we still assume that the device supports it implicitly as this has proven to be true across all major mobile GPU vendors regardless of the driver version but enabling this prevents validation layer errors.	2022-04-14 14:14:52 +05:30
PixelyIon	22ce531e6f	Force Memory Barrier at `VkRenderPass` Start We depend on past commands to have completed execution in a renderpass, a subpass dependency on all graphics stages from `VK_SUBPASS_EXTERNAL` to subpass #0 is used to enforce this. Nvidia and Adreno proprietary drivers implicitly do this but Turnip or Mali drivers require this or they execute out of order.	2022-04-14 14:14:52 +05:30
PixelyIon	35fde2cd0b	Rework Blocklinear Texture Deswizzling Blocklinear texture decoding was broken for padding blocks and would incorrectly decode them resulting in major texture corruption for any textures with their widths not aligned to 64 bytes. This has now been fixed with neater code which avoids redundant repetition of any code using lambdas and functions where necessary.	2022-04-14 14:14:52 +05:30
PixelyIon	043be4d8f7	Implement Maxwell3D Two-Side Stencil Toggle Stencil operations are configurable to be the same for both sides or have independent stencil state for both sides. It is controlled via the previously unimplemented `stencilTwoSideEnable`.	2022-04-14 14:14:52 +05:30
PixelyIon	80ae7b255a	Implement Maxwell3D Front Face Flip	2022-04-14 14:14:52 +05:30
PixelyIon	40a3887695	Implement Maxwell3D Viewport Y Swizzle & Lower-Left Origin	2022-04-14 14:14:52 +05:30
Billy Laws	3be30e68c3	Add D16 depth format and ZF32 TIC format Used by One Piece Unlimited World Red	2022-04-14 14:14:52 +05:30
Billy Laws	be007c4ccc	Fixup texture swizzling to actually function Before this we were not applying the supplied swizzles, will be superseeded in the future by using guest swizzle values.	2022-04-14 14:14:52 +05:30
Billy Laws	6e48460c0d	Add BC2/3 format support	2022-04-14 14:14:52 +05:30
Billy Laws	2253bc3151	Reorder GPU quirks member to prevent it constructing after device init	2022-04-14 14:14:52 +05:30
Billy Laws	62db21fb78	Rework GPFIFO method distribution and macros to support multiple engines Fermi2D supports macros in addition to Maxwell3D, these both share code memory. To support this we rework the macro interpreter to support passing in a target engine and abstract the communications out into an interface that can be implemented by applicable engines. ``` GPFIFO <-> MME <-> Maxwell3D ^ ^---> Fermi2D X------------> I2M X------------> MaxwellComputeB X--Flush-----> MaxwellDMA ```	2022-04-14 14:14:52 +05:30
Billy Laws	8d5463ef28	Drop engine base class usage from GPFIFO This class does nothing since we made stopped GPFIFO submits from using virtual functions so it can be dropped.	2022-04-14 14:14:52 +05:30
PixelyIon	41aad83c33	Tie Shader `ObjectPool` Lifetime to Shader `Program` Shader programs allocate instructions and blocks within an `ObjectPool`, there was a global pool prior that was never reaped aside from on destruction. This led to a leak where the pool would contain resources from shader programs that had been deleted, to avert this the pools are now tied to shader programs.	2022-04-14 14:14:52 +05:30
PixelyIon	e747de37cf	Implement Blocklinear TIC Type	2022-04-14 14:14:52 +05:30
PixelyIon	723189a948	Calculate Blocklinear Texture Aligned Size Correctly The size of blocklinear textures did not consider alignment to Block/ROB boundaries before, it is aligned to them now. Incorrect sizes led to textures not being aliased correctly due to different size calculations for GraphicBufferProducer surfaces and Maxwell3D color RTs.	2022-04-14 14:14:52 +05:30
Billy Laws	95685b8207	Avoid iterator invalidation segfault when unregistering a syncpt waiter erase invalidated `it` leading to a potential segfault if the GPU was very far behind, bail out early to avoid that since there can only be one occurence at most in the buffer anyway.	2022-04-14 14:14:52 +05:30
Billy Laws	e7bfd93541	Implement BC7 format support Used by ARMS	2022-04-14 14:14:52 +05:30
Billy Laws	99652c5eda	Support partially mapped cbufs Buggy games sometimes supply an incorrect cbuf size so limit buffers to the first unmapped region.	2022-04-14 14:14:52 +05:30
PixelyIon	6a6f51ea84	Implement Maxwell3D Depth/Stencil State Implements the entirety of Maxwell3D Depth/Stencil state for both faces including compare/write masks and reference value. Maxwell3D register `stencilTwoSideEnable` is ignored as its behavior is unknown and could mean the same behavior for both stencils or the back facing stencil being disabled as a result of this it is unimplemented.	2022-04-14 14:14:52 +05:30
PixelyIon	9f5c3d8ecd	Force Textures to be Optimal on Host GPU We don't respect the host subresource layout in synchronizing linear textures from the guest to the host when mapped to memory directly, this leads to texture corruption and while the real fix would involve respecting the host subresource layout, this has been deferred for later as real world performance advantages/disadvantages associated with this change can be observed more carefully to determine if it's worth it.	2022-04-14 14:14:52 +05:30
Billy Laws	ab4962c4e4	Implement additional texture formats, including BCn BCeNabler is required for BCn textures, the pre-swizzled formats will be removed when arbitary swizzle support is added later.	2022-04-14 14:14:52 +05:30
Billy Laws	600b94505c	Fix A2R10G10B10 render target format This was wrongly described as R10G10B10A2 in the enum when it's actually A2R10G10B10, a format natively supported in Vulkan with just a swizzle.	2022-04-14 14:14:52 +05:30
Billy Laws	175ba11f07	Integrate BCeNabler support into QuirkManager Allows using BCn format textures on devices where they are unsupported by the driver.	2022-04-14 14:14:52 +05:30
Billy Laws	47d920d91e	Make GPU private static functions file-local	2022-04-14 14:14:52 +05:30
PixelyIon	edd51c3dfa	Fix Color RT Disabling Bug Color RTs are disabled by setting their format as `None`, it was removed while transitioning to macros and resulted in a missing format exception. It has been readded as several applications depend on this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	a2285669b3	Use static vector for shader bytecode to prevent constant reallocation Using `std::vector` for shader bytecode led to a lot of reallocation due to constant resizing, switching over the static vector allows for a single static allocation of the maximum possible guest shader size (1 MiB) to be done for every stage resulting in a 6 MiB preallocation which is unnoticeable given the total memory overhead of running a Switch application.	2022-04-14 14:14:52 +05:30
PixelyIon	21a6866def	Fix Maxwell3D Blend Enum Conversion Bugs The `OneMinusSourceAlpha` blending factor was converted to `eOneMinusSrcColor` rather than `eOneMinusSrcAlpha` leading to incorrect blending behavior in certain titles. A similar issue with the order of `MinimumGL`/`MaximumGL` and `SubtractGL`/`ReverseSubtractGL` being the opposite of what it should've been, both of these issues have been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	0a506088f4	Fix `NextSubpassNode` Subpass Index Bug `NextSubpassNode` didn't increment `subpassIndex` which runs commands with the wrong subpass index resulting in them accessing invalid attachments or other bugs that may arise from using the wrong subpass.	2022-04-14 14:14:52 +05:30
PixelyIon	defbfe8f78	Serialize Maxwell3D Draw State for Subpass All Maxwell3D state was passed by reference to the draw command lambda, this would break if there was more than one pass or the state was changed in any way before execution. All state has now been serialized by value into the draw command lambda capture, retaining state regardless of mutations of the class state.	2022-04-14 14:14:52 +05:30
PixelyIon	934130b3e6	Remove Implicit Command Executor Resource Attachment Any usage of a resource in a command now requires attaching that resource externally and will not be implicitly attached on usage, this makes attaching of resources consistent and allows for more lax locking requirements on resources as they can be locked while attaching and don't need to be for any commands, it also avoids redundantly attaching a resource in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	f0e9c42097	Fix Fence Cycle Double Insertion Lifetime Bug If an object is attached to a `FenceCycle` twice then it would cause `FenceCycleDependency::next` to be overwritten and lead to destruction of dependencies prior to the fence being signaled causing usage of deleted resources. This commit fixes this by tracking what fence cycle a dependency is currently attached to and doesn't reattach if it's already attached to the current fence cycle.	2022-04-14 14:14:52 +05:30
PixelyIon	6a831f6ed7	Add `VK_EXT_shader_demote_to_helper_invocation` Quirk An assumption was hardcoded into `Shader::Profile` regarding devices supporting demotion of shader invocations to helpers. This assumption wasn't backed by enabling the `VK_EXT_shader_demote_to_helper_invocation` extension via a quirk leading to assertions when it was used by the shader compiler, a quirk has now been added for the extension and is supplied to the shader compiler accordingly.	2022-04-14 14:14:52 +05:30
lynxnb	58c871ed9a	Correctly hide system bars in `EmulationActivity` on Android >= 11	2022-04-14 14:14:52 +05:30
Billy Laws	3ff8075151	Move vertex and RT format conv to macros and fill them fully in Makes the format conversions easier to read and shorter, and adds in some new formats needed to complete the RT table properly.	2022-04-14 14:14:52 +05:30
PixelyIon	8f0db18624	Fix `ControllerActivity` Controller Type Change Crash If the controller type was changed from a type with a larger amount of buttons/axes to one with a fewer amount, a crash would occur due to the transition animation retaining those elements as children yet returning `NO_POSITION` from `getChildAdapterPosition` in `DividerItemDecoration` which was an unhandled case and led to an OOB array access.	2022-04-14 14:14:52 +05:30
PixelyIon	2c46709064	Fix `ControllerPreference`'s `index` not being passed to Activity A bug caused by not passing the index argument to `ControllerActivity` led to all preferences opening the activity that pertained to Controller #1. This was fixed by passing the `index` argument in the activity launch intent.	2022-04-14 14:14:52 +05:30
PixelyIon	98b366c1f5	Fix Texture Synchronization Bug Fixes texture corruption due to incorrect synchronization, the barrier would not enforce waiting till the texture was entirely rendered causing an incomplete texture to be downloaded which lead to rendering bugs for certain GPUs including ARM's Mali GPUs.	2022-04-14 14:14:52 +05:30
PixelyIon	aea40e6496	Fix `enabledFeature2` Unlinking Assertion Bug A bug caused an assertion if both `VK_EXT_custom_border_color` and `VK_EXT_vertex_attribute_divisor` due to mistakenly unlinking `PhysicalDeviceVertexAttributeDivisorFeaturesEXT` instead of `PhysicalDeviceCustomBorderColorFeaturesEXT` when `VK_EXT_custom_border_color` isn't supported which would potentially lead to unlinking the same structure twice and cause the assertion.	2022-04-14 14:14:52 +05:30
Billy Laws	68f31c3688	Use macros for defining texture formats and their conversions Avoids the need to repeat all the possible component types for each texture format while also making them simpler to add and easier to read.	2022-04-14 14:14:52 +05:30
lynxnb	a9d4e6bb1a	Add screen orientation setting	2022-04-14 14:14:52 +05:30
PixelyIon	bc29b23972	Implement CPU-only Maxwell3D Inline Constant Buffer Updates Implements inline constant buffer updates that are written to the CPU copy of the buffer rather than generating an actual inline buffer write, this works for TIC/TSC index updates but won't work when the buffer is expected to actually be updated inline with regard to sequence rather than just as a buffer upload prior to rendering. GPU-sided constant buffer updates will be implemented later with optimizations for updating an entire range by handling GPFIFO `Inc`/`NonInc`directly and submitting it as a host inline buffer update.	2022-04-14 14:14:52 +05:30
PixelyIon	08f29f7da4	Make `ActiveDescriptorSet` movable and non-copyable There should only ever be a single instance of a `ActiveDescriptorSet` that tracks the lifetime of a descriptor set as the destructor is responsible for freeing the descriptor set. There are cases where a new object inheriting the descriptor set needs to be created in these cases we need to have move semantics and make the destructor of the prior object inert, this allows for moving to the new object without any side effects. If the copy constructor was used in these cases the older object would free the set on its destruction which would lead to the set being invalid on existing instances which is incorrect behavior and would likely lead to driver crashes.	2022-04-14 14:14:52 +05:30
PixelyIon	bb14af4f7a	Implement Maxwell3D Sampled Textures The descriptor sets should now contain a combined image and sampler handle for any sampled textures in the guest shader from the supplied offset into the texture constant buffer. Note: Games tend to rely on inline constant buffer updates for writing the texture constant buffer and due to it not being implemented, the value will be read as 0 which is incorrect.	2022-04-14 14:14:52 +05:30
PixelyIon	d9a9e52350	Use `ConstantBuffer` instead of `BufferView` for Shader Constant Buffers We want read semantics inside the constant buffer object via the mappings to avoid a pointless GPU VMM mapping lookup. It is a fairly frequent operation so this is necessary, the ability to write directly will be added in the future as well.	2022-04-14 14:14:52 +05:30
PixelyIon	adb0a16873	Implement Maxwell 3D Textures Implements parsing for the Maxwell 3D TIC pool and conversion of a TIC into a `GuestTexture`, support is limited to pitch-linear RGB565/A8R8G8B8 textures at the moment but will be extended as games utilize more formats and layouts. Support for 1D buffers is also omitted at the moment since they need special handling with them effectively being treated as buffers in Vulkan rather than images.	2022-04-14 14:14:52 +05:30
PixelyIon	a7b90e7825	Change Texture Pitch Unit to Bytes from Pixels The pitch of the texture should always be supplied in terms of bytes as it denotes alignment on a byte boundary rather than a pixel one, it is also always utilized in terms of bytes rather than pixels so this avoids an unnecessary conversion. Note: GBP stride unit was assumed to be pixels earlier but is likely bytes which is why there are no changes to the supplied value there, if this is not the case it'll be fixed in the future	2022-04-14 14:14:52 +05:30
PixelyIon	a9aa16798f	Add `-fsigned-bitfields` for defined bitfield `int` behavior We want consistent behavior between signed `int`s in bitfields and outside of bitfields, the `-fsigned-bitfields` flag enforces this behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	87c8dc94d2	Implement Maxwell3D Samplers Maxwell3D `TextureSamplerControl` (TSC) are fully converted into Vulkan samplers with extension backing for all aspects that require them (border color/reduction mode) and approximations where Vulkan doesn't support certain functionality (sampler address mode) alongside cases where extensions may not be present (border color).	2022-04-14 14:14:52 +05:30
PixelyIon	e48a7d7009	Fix Mapping Caching For Maxwell 3D Buffers Code involving caching of mappings was copied from `RenderTarget` without much consideration for applicability in buffers, the reason for caching mappings in RTs was that the view may be invalidated by more than the IOVA/Size being changed but this doesn't hold true for buffers generally so invalidation can only be on the view level with the mappings being looked up every time since the invalidation would likely change them.	2022-04-14 14:14:52 +05:30
PixelyIon	ff27dce24c	Implement `ObjectHash` for hashing trivial objects in maps `std::hash` doesn't have a generic template where it can be utilized for arbitrary trivial objects and implementing this might result in conflicts with other types. To fix this a generic templated hash is now provided as a utility structure, that can be utilized directly in hash-based containers such as `unordered_map`.	2022-04-14 14:14:52 +05:30
PixelyIon	97cfcba0da	Add Nullability for Optional Semantics to `span` Nullability allow for optional semantics where a span may be explicitly invalidated with `nullptr` being used as a sentinel value for it and a boolean operator that allows trivial checking for if the span is valid or not.	2022-04-14 14:14:52 +05:30
PixelyIon	c11962e8e4	Implement Maxwell3D Bindless Texture Constant Buffer Index The index of the constant buffer with bindless texture descriptors is now retrieved from Maxwell3D register state and passed to the shader compiler.	2022-04-14 14:14:52 +05:30
PixelyIon	1c3f62b7b4	Implement Maxwell3D Indexed Drawing	2022-04-14 14:14:52 +05:30
PixelyIon	23cdfe2139	Implement Maxwell3D Index Buffers Adds support for index buffers including U8 index buffers via the `VK_EXT_index_type_uint8` extension which has been added as an optional quirk but an exception will be thrown if the guest utilizes it but the host doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	a4041364e1	Address CR comments Note: CR comments regarding `ShaderSet` and `PipelineStages` will be addressed at a later date with a common class for associative enum arrays.	2022-04-14 14:14:52 +05:30
PixelyIon	e1e14e781f	Support Dual Vertex Shader Programs Add support for parsing and combining `VertexA` and `VertexB` programs into a single vertex pipeline program prior to compilation, atomic reparsing and combining is supported to only reparse the stage that was modified and recombine once at most within a single pipeline compilation.	2022-04-14 14:14:52 +05:30
PixelyIon	974cf03c18	Add Atomic Pipeline Stage Invalidation Atomically invalidate pipeline stages as runtime information that pertains to them changes rather than never recompiling pipelines on runtime information being updated resulting in out of date pipelines or recompiling all pipelines on any runtime information updates.	2022-04-14 14:14:52 +05:30
PixelyIon	5414db8411	Rework Maxwell3D Shader/Pipeline Stages Compilation with UBO support Shader compilation is now broken into shader program parsing and pipeline shader compilation which will allow for supporting dual vertex shaders and more atomic invalidation depending on runtime state limiting the amount of work that is redone. Bindings are now properly handled allowing for bound UBOs to be converted to the appropriate host UBO as designated by the shader compiler by creating Vulkan Descriptor Sets that match it.	2022-04-14 14:14:52 +05:30
PixelyIon	055d315048	Seperate Maxwell3D Stages into Shader/Pipeline We need this to make the distinction between a shader and pipeline stage in as shader programs are bound at a different rate than that of pipeline stage resources such as UBO.	2022-04-14 14:14:52 +05:30
PixelyIon	492dd47218	Implement Vulkan Descriptor Set Allocator A fixed descriptor set allocator which manages the size of the pool with automatic reallocations when any allocations run out of descriptors.	2022-04-14 14:14:52 +05:30
PixelyIon	9af9f1d41a	Implement Maxwell3D Constant Buffer Selector The Constant Buffer Selector is used to point to a constant buffer that will be bound to a shader stage or updated with inline data.	2022-04-14 14:14:52 +05:30
PixelyIon	afa34e320a	Retain Shader Binding State Across Stages An instance of `Shader::Backend::Bindings` must be retained across all stages for correct emission of bindings, which is now done inside `GraphicsContext::GetShaderStages`.	2022-04-14 14:14:52 +05:30
PixelyIon	550d12b7fa	Set Shader Runtime Generic Vertex Attribute Types Correctly The vertex attribute types supplied prior were just the default which is `Float`, this works for some cases but will entirely break if the attribute type isn't a float. The attribute types are now set correctly.	2022-04-14 14:14:52 +05:30
PixelyIon	a2de6b9255	Fix Maxwell3D `vertexEndGl` Register Offset The offset was set to 0x586 which is the location of `vertexBeginGl`, it's been corrected now and set to 0x585.	2022-04-14 14:14:52 +05:30
Billy Laws	5815cda7a7	Update Vulkan-Hpp to v1.2.202	2022-04-14 14:14:52 +05:30
PixelyIon	bd6cd0056c	Support Multi-Aspect Copy in `Texture::CopyIntoStagingBuffer` Only copying a single aspect was supported by `CopyIntoStagingBuffer` earlier due to not supplying a `VkBufferImageCopy` for each aspect separately, this has now been done with Color/Depth/Stencil aspects having their own `VkBufferImageCopy` for the `VkCmdCopyImageToBuffer` command.	2022-04-14 14:14:52 +05:30
PixelyIon	daff17c776	Order `TextureView` Definition Correctly The definition of the `TextureView` class was spread across `texture.cpp` and has now been moved to the top of the file above the other half of the definition.	2022-04-14 14:14:52 +05:30
PixelyIon	189b9533f2	Disable Vertex Buffers With 0 as IOVAs A buffer with 0 as the start/end IOVA should be invalid as there shouldn't be any mappings at 0 in GPU VA, titles such as Puyo Puyo Tetris configure the Vertex Buffer with 0 IOVAs which leads to a segmentation fault without this exception.	2022-04-14 14:14:52 +05:30
PixelyIon	cfeb8098db	Attach `TextureView`/`BufferView` Lifetime to `FenceCycle` The lifetime of a texture and buffer view is now bound by the `FenceCycle` in `CommandExecutor`, this ensures that a `VkImageView` isn't destroyed prior to usage leading to UB.	2022-04-14 14:14:52 +05:30
PixelyIon	34fc1e32b8	Remove `Texture`s from `RenderPassNode::Storage` The lifetime of all textures bound to a RenderPass alongside syncing of textures is already handled by `CommandExecutor` and doesn't need to be redundantly handled by `RenderPassNode`. It's been removed as a result of this.	2022-04-14 14:14:52 +05:30
PixelyIon	45c7a89fc3	Cleanup `BufferView`/`TextureView` Locking Code Renames the variable to be neater and less confusing alongside adding comments for `try_lock()` to make the goal of the function more apparent.	2022-04-14 14:14:52 +05:30
PixelyIon	7776ef2cd0	Support Depth/Stencil RT in Draw Adds the depth/stencil RT as an attachment for the draw but with `VkPipelineDepthStencilStateCreateInfo` stubbed out, it'll not function correctly and the contents will not be what the guest expects them to be.	2022-04-14 14:14:52 +05:30
PixelyIon	525850ae09	Stub `VkPipelineDepthStencilStateCreateInfo` Maxwell3D Depth State is composed of several registers and will be implemented at a later date, for the time being it's been stubbed.	2022-04-14 14:14:52 +05:30
PixelyIon	9e63ecf05d	Implement Maxwell3D Depth/Stencil Clears Support for clearing the depth/stencil RT has been added as its own function via either optimized `VkAttachmentLoadOp`-based clears or `vkCmdClearAttachments`. A bit of cleanup has also been done for color RT clears with the lambda for the slow-path purely calling the command rather than creating the parameter structures.	2022-04-14 14:14:52 +05:30
PixelyIon	bf89f96bf5	Implement Optimized LoadOp Clears for Depth/Stencil Attachments Implements `AddClearDepthStencilSubpass` in `CommandExecutor` which is similar to `ClearColorAttachment` in that it uses `VK_ATTACHMENT_LOAD_OP_CLEAR` for the clear which is far more efficient than using `VK_ATTACHMENT_LOAD_OP_LOAD` then doing the clear.	2022-04-14 14:14:52 +05:30
PixelyIon	6f6413f02d	Fix `VkSubpassDependency` for Depth/Stencil Attachments The stage/access mask for `VkSubpassDependency` were hardcoded to only be valid for color attachments earlier, this has now been fixed by branching based on the format aspect.	2022-04-14 14:14:52 +05:30
PixelyIon	aa32f6b017	Add Depth/Stencil Format Support to `Texture` Sets `VkImageUsageFlags` correctly rather than hardcoding it for color attachments and adds multiple `VkBufferImageCopy` to `VkCmdCopyBufferToImage` for Color/Depth/Stencil aspects of an image.	2022-04-14 14:14:52 +05:30
PixelyIon	68c990c041	Implement Maxwell3D Depth/Stencil Render Target Support the Maxwell3D Depth RT for Z-buffering, this just creates an equivalent `RenderTarget` object with no support on the API-user side (IE: `Draw` and `ClearBuffers`).	2022-04-14 14:14:52 +05:30
PixelyIon	2a8bcc60c7	Make Render Targets Abstract for Color/Depth RTs This prefixes all RT functions that deal with color RTs with `Color` and abstracts out common functions that will be used for both color and depth RTs. All common Maxwell3D structures are also moved out of the `ColorRenderTarget` (`RenderTarget` previously) structure.	2022-04-14 14:14:52 +05:30
PixelyIon	b0f084ae32	Implement Shader Compiler Input Topology Sets the input toplogy in the runtime information for the shader compiler correctly based on the Maxwell3D input topology.	2022-04-14 14:14:52 +05:30
PixelyIon	7a63ad7d3d	Implement `VkPipelineCache` for host pipeline caching To allow for caching of pipelines on the host a `VkPipelineCache` has been added, it is entirely in-memory and is not flushed to the disk which'll be done in the future alongside caching guest shaders to further avoid translation where possible.	2022-04-14 14:14:52 +05:30
PixelyIon	4dcf12c4c0	Implement Maxwell3D Draws Uses all Maxwell3D state converted into Vulkan state to do an equivalent draw on the host GPU, it sets up RT/Vertex Buffer/Vertex Attribute/Shader state and creates a stubbed out `VkPipelineLayout` for the draw. Any descriptor state isn't currently handled and is yet to be implemented, currently there's no Vulkan pipeline cache supplied which will be implemented subsequently.	2022-04-14 14:14:52 +05:30
PixelyIon	57b0d6a2fb	Stub `VkPipelineMultisampleStateCreateInfo` Multisampling will be worked on later and for the time being is being safely stubbed by setting the sample count to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	56b3a01a59	Track `VkRenderPass` and Subpass Index for Subpass Function Nodes We require a handle to the current renderpass and the index of the subpass in certain cases, this is now tracked by the `CommandExecutor` and passed in as a parameter to `NextSubpassFunctionNode` and the newly-introduced `SubpassFunctionNode`.	2022-04-14 14:14:52 +05:30
PixelyIon	cb7f68b98d	Allow Attaching Texture/Buffers to `CommandExecutor` Switch from `SubmitWithCycle` to manually allocating the active command buffer to tag dependencies with the `FenceCycle` that prevents them from being mutated prior to execution. This new paradigm could also allow eager recording of commands with only submission being deferred.	2022-04-14 14:14:52 +05:30
PixelyIon	aeea3e6f66	Allow manual allocation of `ActiveCommandBuffer` `CommandScheduler` API users can now directly allocate an active command buffer that they need to manage alongside its fence, this can allow for more efficient recording as it doesn't need to be immediately submitted after, it can also allow attaching objects to a `FenceCycle` prior to submission that can be useful for locking resources.	2022-04-14 14:14:52 +05:30
PixelyIon	8989305637	Implement Host Vertex Buffer Translation Uses the buffer cache to retrieve an equivalent host vertex buffer for a corresponding guest vertex buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	b6ba770a27	Implement Maxwell3D Shader Compilation Compiles shaders supplied by the guest with caching and automatic invalidation, the size of the shader is also automatically determined by looking for `BRA $` instructions which cause an infloop, it should be noted that we have a maximum shader bytecode size, any shader above this size will not be supported.	2022-04-14 14:14:52 +05:30
PixelyIon	08afda6ac4	Implement Graphics Shader Compilation in `ShaderManager` Graphics shaders can now be compiled using the shader compiler and emit SPIR-V that can be used on the host. The binding state isn't currently handled alongside constant buffers and textures support in `GraphicsEnvironment` yet.	2022-04-14 14:14:52 +05:30
PixelyIon	353ca8ec84	Fix Viewport X/Y Translation The operands of the subtraction in the X/Y translation calculation were the wrong way around which led to negative translations that would translate the viewport off the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	f06a12170f	Set Default Color Write Mask to RGBA The default color write mask should mask no channels and write all of them and should be mutated to mask out certain channels as required by the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	23faf1370c	Use Static Arrays for Vertex Buffer Bindings & Attributes We cannot statically construct the vertex buffer/attribute arrays for Vulkan due to inactive attributes or buffers which isn't possible on Vulkan, we also cannot just change the count dynamically as there might be disabled buffers or attributes in the middle. We just have a `static_array` which should dynamically be filled in with buffer binding/attribute Vulkan structures before submission.	2022-04-14 14:14:52 +05:30
PixelyIon	8652edb07b	Make `GuestBuffer` format-less Buffers generally don't have formats that are fundamentally associated with them unless they're texel buffers, if that is the case it can be manually set in `BufferView`.	2022-04-14 14:14:52 +05:30
PixelyIon	03314ec7d2	Introduce `BufferManager` The Buffer Manager handles mapping of guest buffers to host buffer views with automatic handling of sub-buffers and eventually supporting recreation of overlapping buffers to create a single larger buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	bde61d72cc	Introduce `Buffer` and `BufferView` Implements infrastructure for using guest buffers on the host for rendering, a `BufferManager` is still missing which'd handle mapping from guest buffers to host buffers and will be subsequently committed. It should be noted that `BufferView` is also disconnected from `Buffer` and shared for every instance with the same properties like `TextureView` is now.	2022-04-14 14:14:52 +05:30
PixelyIon	6eda1777c5	Rework `TextureView` to be disconnected from `Texture` We want `TextureView`(s) to be disconnected from the backing on the host and instead represent a specific texture on the guest with a backing that can change depending on mapping of new textures which'd invalidate the backing but should now be automatically repointed to an appropriate new backing. This approach also requires locking of the backing to function as it is mutable till it has been locked or the backing has an attached `FenceCycle` that hasn't been signaled which will be added for `CommandExecutor` in a subsequent commit.	2022-04-14 14:14:52 +05:30
PixelyIon	82916657fb	Only Enable Shader Compiler Debug Mode in Debug Builds Sets properties that relate to debugging in `Shader::Settings` to `true` only for debug builds while leaving them disabled for release builds.	2022-04-14 14:14:52 +05:30
PixelyIon	b09f28c0ba	Implement Missing Shader Compiler Quirks Introduces the `supportsShaderViewportIndexLayer` quirk and sets `Shader::Profile::support_int64_atomics` depending on if the `supportsAtomicInt64` quirk is available.	2022-04-14 14:14:52 +05:30
PixelyIon	f3e81094a2	Implement Shader Compiler Property Quirks Introduces the `floatControls`, `supportsSubgroupVote` and `subgroupSize` quirks for the shader compiler which are based on Vulkan `PhysicalDevice` properties.	2022-04-14 14:14:52 +05:30
PixelyIon	51c4df24b5	Switch from `VK_VERSION_` to `VK_API_VERSION_` macros Vulkan has officially deprecated `VK_VERSION_*` macros for versioning as it has introduced the variant into the version. It should however be `0` for the Vulkan APi and doesn't need to be printed.	2022-04-14 14:14:52 +05:30
PixelyIon	0588a525b4	Implement Shader Compiler Extension/Feature Quirks Introduces several quirks for optional features used by the shader compiler which are now reported in the `Shader::HostTranslateInfo` and `Shader::Profile` structure. There are still property-related quirks for the shader compiler which haven't been implemented in this commit.	2022-04-14 14:14:52 +05:30
PixelyIon	8f3887c56a	Create `memory::Buffer` & Implement `StagingBuffer` as derivative A `Buffer` class was created to hold any generic Vulkan buffer object with `span` semantics, `StagingBuffer` was implemented atop it as a wrapper for `Buffer` that inherits from `FenceCycleDependency` and can be used as such.	2022-04-14 14:14:52 +05:30
PixelyIon	a55aca76c6	Rename `TextureView::backing` to `TextureView::texture` It was determined that `backing` wasn't a very descriptive name and that it conflicted with the texture's own backing, the name was changed to `texture` to make it more apparent that it was specifically the `Texture` object backing the view.	2022-04-14 14:14:52 +05:30
PixelyIon	482c573b81	Introduce `FlatMemoryManager::ReadTill` for scanning semantics A memory manager function to read into a vector till it satisfies the supplied function or hits an early stop condition like hitting the end of vector or reaching an unmapped region. This can be used to efficiently scan for values in GPU VA.	2022-04-14 14:14:52 +05:30
PixelyIon	31c4f1ca4e	Unlink `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when disabled When `VK_EXT_vertex_attribute_divisor` is not available, `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` is unlinked from the device enabled feature list as it is undefined behavior to link a structure provided by an extension without enabling that extension.	2022-04-14 14:14:52 +05:30
PixelyIon	032866c9b1	Allow Injecting External Vulkan Layers Set `com.android.graphics.injectLayers.enable` to allow injection of external Vulkan layers which is done by GPU debuggers such as RenderDoc.	2022-04-14 14:14:52 +05:30
PixelyIon	7df2670ece	Fix `QuirkManager`'s `EXT_SET_V` macro bug `EXT_SET_V` would enable the extension regardless of if it was actually the correct extension or if the version was high enough as long as the hash matched. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	e9ed771b48	Check for `supportsMultipleViewports` feature before usage If the host only supports a single viewport then we set `viewportCount` and `scissorCount` in `VkPipelineViewportStateCreateInfo` to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	3e45006d14	Make `shaderImageGatherExtended` a required `VkDevice` feature `shaderImageGatherExtended` is required by the shader compiler, to avoid complications associated with making it optional and considering that it's supported by the vast majority of Vulkan mobile devices, it was made a mandatory feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ece2785582	Introduce `ShaderManager` with Proxy Shader Compiler Logger/Settings This class will be entirely responsible for any interop with the shader compiler, it is also responsible for caching and compilation of shaders in itself.	2022-04-14 14:14:52 +05:30
PixelyIon	def9cedbee	Add yuzu Shader Compiler as a submodule We plan to use our fork of yuzu's shader compiler for GPU shader compilation so it's been added as a submodule.	2022-04-14 14:14:52 +05:30
PixelyIon	746af4cb4c	Add Sirit as a submodule We require Sirit as it is a dependency for yuzu's shader compiler where it uses it to emit SPIR-V in an easy and efficient manner.	2022-04-14 14:14:52 +05:30
PixelyIon	dbc94f36d3	Add Range v3 as a submodule We want to utilize features from C++ 20 ranges but they haven't been entirely implemented in libc++ so in the meantime we use the reference implementation for it which is Ranges v3.	2022-04-14 14:14:52 +05:30
PixelyIon	89e9a41a86	Implement `VkPipelineViewportStateCreateInfo` "Viewport Transforms" and "Viewport Scissors" were combined into one section to reflect their state in Vulkan correctly like all other sections.	2022-04-14 14:14:52 +05:30
PixelyIon	38119e21d4	Implement Vulkan-Supported Maxwell3D Primitive Topologies Any primitive topologies that are directly supported by Vulkan were implemented but the rest were not and will be implemented with conversions as they are used by applications, they are: * LineLoop * QuadList * QuadStrip * Polygon	2022-04-14 14:14:52 +05:30
PixelyIon	138f884159	Implement Maxwell3D Vertex Attributes Translates all Maxwell3D vertex attributes to Vulkan with the exception of `isConstant` which causes the vertex attribute to return a constant value `(0,0,0,X)` which was trivial in OpenGL with `glDisableVertexAttribArray` and `glVertexAttrib4(..., 0, 0, 0, 1)` but we don't have access to this in Vulkan and might need to depend on undefined behavior or manually emulate it in a shader. This'll be revisited in the future after checking host GPU behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	4b9f99bb27	Make `ENUM_STRING` function `static` `ENUM_STRING` can be used inside a `class`/`struct`/`union` for `enum`s contained within them. Making the function `static` allows doing this and doesn't require supplying a `this` pointer of the enclosing class for usage.	2022-04-14 14:14:52 +05:30
PixelyIon	c2a6da6431	Implement Maxwell3D Vertex Buffer Limit Sets the end of VBOs based on the `vertexArrayLimits` register array which provides an IOVA to the end of the VBO.	2022-04-14 14:14:52 +05:30
PixelyIon	d8890f13e1	Explicitly make `default` case `break` for `Maxwell3D::HandleMethod` This being made implicit removes any confusion that all cases would need to be implemented and explicitly define that the CF should continue onto the 2nd switch-case when it cannot find any matches in the first one.	2022-04-14 14:14:52 +05:30
PixelyIon	612f324e78	Implement Maxwell3D Vertex Buffer Instance Rate Implements the `isVertexInputRatePerInstance` register array which controls if the vertex input rate is either per-vertex or per-instance. This works in conjunction with the vertex attribute divisor for per-instance attribute repetition of attributes.	2022-04-14 14:14:52 +05:30
PixelyIon	476c070c7a	Fix Minor Maxwell3D Register Ordering Issues We order all registers in ascending order, a few registers namely `colorLogicOp`, `colorWriteMask`, `clearBuffers` and `depthBiasClamp` were erroneously not following this order which has now been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	32de7e5150	Use `decltype` over `typeof` globally We inconsistently utilized `typeof` and `decltype` all over the codebase, this has now been fixed by uniformly using `decltype` as `typeof` is a GCC extension and not in the C++ standard alongside having the hidden side effect of removing references from the determined type.	2022-04-14 14:14:52 +05:30
PixelyIon	841ee9fc15	Check for `vertexAttributeInstanceRateZeroDivisor` feature before usage Check for `vertexAttributeInstanceRateZeroDivisor` in `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when the Maxwell3D register corresponding to the vertex attribute divisor is set to 0. If it isn't then it logs a warning and sets the value anyway which could result in UB since the only alternative is an exception that stops emulation which might not be optimal if the game mostly works fine without this, we will add a user-facing warning when we intentionally allow UB like this in the future.	2022-04-14 14:14:52 +05:30
PixelyIon	c3895a8197	Support `VkPhysicalDeviceFeatures2` Extensions Implement the infrastructure to depend on `VkPhysicalDeviceFeatures2` extended feature structures which can be utilized to retrieve the specifics of features from extensions. It is implemented in the form of `vk::StructureChain` with `vk::PhysicalDeviceFeatures2` that can be extended with any extension feature structures.	2022-04-14 14:14:52 +05:30
PixelyIon	ff5515d4d1	Implement Maxwell3D Vertex Buffer Bindings This implements everything in Maxwell3D vertex buffer bindings including vertex attribute divisors which require the extension `VK_EXT_vertex_attribute_divisor` to emulate them correctly, this has been implemented in the form of of a quirk. It is dynamically enabled/disabled based on if the host GPU supports it and a warning is provided when it is used by the guest but the host GPU doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	d163e4ffa6	Introduce `IOVA` union for flipped words of IOVAs in `GraphicsContext` The Maxwell3D `Address` class follows the big-endian register ordering for addresses while on the host we consume them in little-endian, the `IOVA` class is the host equivalent to the `Address` class with implicitly flipped 32-bit register ordering. It shares implicit decomposition semantics from `Address` due to similar requirements with a minor difference of being returned by reference rather than value as we want to have value setting semantics with implicit decomposition while we don't for `Address`.	2022-04-14 14:14:52 +05:30
PixelyIon	73646c4da8	Implicitly decompose `Address` into `u64` The semantics of implicitly decomposing the `Address` class into a `u64` were determined to be appropriate for the class. As it is an integer type this effectively retains all semantics from using an integer directly for the most part.	2022-04-14 14:14:52 +05:30
PixelyIon	48d0b41f16	Implement Maxwell3D Common/Independent Color Write Mask Maxwell3D supports both independent and common color write masks like color blending but for common color write masks rather than having register state specifically for it, the state from RT 0 is extended to all RTs. It should be noted that color write masks are included in blending state for Vulkan while being entirely independent from each other for Maxwell, it forces us to use the `independentBlend` feature even when we are doing common blending unless the color write mask is common as well but to simplify all this logic the feature was made required as it supported by effectively all targeted devices.	2022-04-14 14:14:52 +05:30
PixelyIon	92809f8a78	Implement Maxwell3D Independent/Common Color Blending Maxwell3D supports independent blending which has different blending per-RT and common blending which has the same blending for all RTs. There is a register determining which mode to utilize and we simply have two arrays of `VkPipelineColorBlendAttachmentState` for the RTs that we toggle between to make the transition between them extremely cheap.	2022-04-14 14:14:52 +05:30
PixelyIon	2ceb6465e8	Make `independentBlend` a required `VkDevice` feature Independent blending is supported by effectively every Vulkan 1.1 Android GPU, it gives us the ability to architecture Maxwell3D blending emulation better as we can avoid additional checks for independent blending state and having a fallback path for when the host doesn't support the feature.	2022-04-14 14:14:52 +05:30
PixelyIon	cd737fbdd8	Add Required `VkDevice` Features A prior commit added the ability to utilize features with quirks but this implements the ability to require a feature be present on the host or an exception will be thrown. It allows us to make useful assumptions that result in a better architecture in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	081d3277c1	Enable Quirk + Required `VkDevice` Extensions Implements the infrastructure required to enable optional extensions set in `QuirkManager` alongside the required extensions in the `GPU` class. All extensions should be correctly resolved now and according to what the device supports.	2022-04-14 14:14:52 +05:30
PixelyIon	6099b1ead5	Fix Maxwell3D Register `lineWidthAliased` Offset The offset was incorrectly set to `0x4D` rather than `0x4ED` which is what it should be. This would've led to bugs in line width determination and likely broken any aliased line rendering entirely.	2022-04-14 14:14:52 +05:30
PixelyIon	51659e1329	Enable `VkDevice` Features Selectively We selectively enable GPU features that we require as enabling all of them might result in extra driver overhead in certain circumstances. Setting them is handled by `QuirkManager` with the new `FEAT_SET` function that ties a quirk with a feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ec378814aa	Stub Maxwell3D Alpha Testing We stub alpha testing as it doesn't exist in Vulkan and few titles use it, it can be emulated in the future using a shader patch with manually discarding fragments failing the alpha test function but this'll be added in later as it isn't high priority at the moment and has associated overhead with it so other options might be explored at the time.	2022-04-14 14:14:52 +05:30
PixelyIon	83ec99fa48	Print GPU Quirks At Startup It is essential to know what quirks a certain GPU may have to debug an issue, these are now printed at startup into the log alongside all other GPU information. A new `QuirkManager::Summary` function was implemented to provide this functionality.	2022-04-14 14:14:52 +05:30
PixelyIon	01eb16e59a	Implement Maxwell3D Color Logic Operations Implements a basic part of Vulkan blending state which are color logic operations applied on the framebuffer after running the fragment shader. It is an optional feature in Vulkan and not supported on any mobile GPU vendor aside from ImgTec/NVIDIA by default.	2022-04-14 14:14:52 +05:30
PixelyIon	662935c35d	Attempt Flushing `Logger` During Fatal Signals Any signals that lead to exception handling being triggered now attempt to flush all logs given that the log mutex is unoccupied, this is to mostly help logs be more complete when exiting isn't graceful.	2022-04-14 14:14:52 +05:30
PixelyIon	586bee2c59	Remove Maxwell3D Zero Initialization Calls A lot of calls in Maxwell3D register initialization ended up setting the register to 0 which should be implicit behavior and most calls would be eliminated by the redundancy check which had to be manually disabled. It was determined to be better to move this responsibility to the called function to initialize to state equivalent to the corresponding register being 0. All initialization calls with the argument as 0 have been removed now due to this, it was the vast majority of calls.	2022-04-14 14:14:52 +05:30
PixelyIon	49cc0964e2	Initialize Maxwell3D Registers Correctly Maxwell3D Registers weren't initialized to the correct values prior, this commit fixes that by doing `HandleMethod` calls with all the register values being initialized. This is in contrast to the registers being set without calling the methods in `GraphicsContext` or otherwise resulting in bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	ea87b8878c	Implement `B8G8R8A8{Unorm/Srgb}` RT Format Needed by Maxwell3D Register Initialization	2022-04-14 14:14:52 +05:30
PixelyIon	69e7d8b574	Remove Vulkan to Skyline Format Conversion Function The function `GetFormat` was seemingly no longer required due to us never converting from a Vulkan format to a Skyline format, most conversions only went from Skyline to Vulkan and were generally lossy due to certain formats being missing in Vulkan and approximated using channel swizzles. As a result of this, it was pointless to maintain and has now been removed.	2022-04-14 14:14:52 +05:30
PixelyIon	5ed26fef23	Implement Maxwell3D Rasterizer State Maxwell3D registers relevant to the Vulkan Rasterizer state have been implemented aside from certain features such as per-face polygon modes that cannot be implemented due to Vulkan limitations. A quirk was utilized to dynamically support the provoking vertex being the last vertex as opposed to the first as well.	2022-04-14 14:14:52 +05:30
PixelyIon	8ef225a37d	Introduce `QuirkManager` for runtime GPU quirk tracking We require a way to track certain host GPU features that are optional such as Vulkan extensions, this is what the `QuirkManager` class does as it checks for all quirks and holds them allowing other components to branch based off these quirks.	2022-04-14 14:14:52 +05:30
PixelyIon	8803616673	Reorder `GraphicsContext` Members All members are now placed at the start of sections they are relevant to rather than at the start of the class.	2022-04-14 14:14:52 +05:30
PixelyIon	107d337d77	Fix `MacroInterpreter::MethodAddress` Bitfield Padding Due to compiler alignment issues, the bitfield member `increment` of `MacroInterpreter::MethodAddress` was mistakenly padded and moved to the next byte. This has now been fixed by making its type `u16` like the member prior to it to prevent natural alignment from kicking in.	2022-04-14 14:14:52 +05:30
PixelyIon	26966287c7	Implement Maxwell3D Shader Program Registers This commit added basic shader program registers, they simply track the address a shader is pointed to at the moment. No parsing of the shader program is done within them.	2022-04-14 14:14:52 +05:30
PixelyIon	93ea919c8f	Fix warnings from `NVRESULT` due to unused lambda capture A previously used `this` capture is no longer used since the introduction of the static logger.	2022-04-14 14:14:52 +05:30
lynxnb	882b939335	Automatically import key files from search location	2022-03-25 09:40:21 +05:30
Robin Kertels	6cf2ef8fb9	Grant Uri permission when sharing logs	2022-03-24 03:39:41 +05:30
lynxnb	092dcb18c8	Stub `ectx:w` and `ectx:aw` Glue services	2022-02-06 21:57:38 +05:30
Robin Kertels	2993f65a1c	Ignore empty lines in key files	2022-02-04 23:51:46 +05:30
MCredstoner2004	0ceecbba4f	Implement fixed aspect ratio surface Adds a setting to letterbox the surface with black bars to 16:9 or 21:9 aspect ratio to avoid stretching out the rendered image	2022-01-12 22:09:39 +05:30
lynxnb	6913a90361	Use the new log file name & ext for every logger context	2021-11-11 16:32:19 +01:00
lynxnb	5cd1f01690	Refactor all logger calls	2021-11-11 16:13:24 +01:00
lynxnb	769e6c933d	Make `Logger` class static and introduce `LoggerContext` A thread local LoggerContext is now used to hold the output file stream instead of the `Logger` class. Before doing any logging operations, a LoggerContext must be initialized. This commit will not build successfully on purpose.	2021-11-11 16:13:24 +01:00
PixelyIon	69ef93bfa8	Add Dividers to `ControllerActivity`'s `RecyclerView` Dividers after titles were missing in `ControllerActivity` which made it look inconsistent with `SettingsActivity` which did have them. They have now been added by extending `DividerItemDecoration` to be drawn before any `ControllerHeaderItem`.	2021-11-11 20:20:19 +05:30
PixelyIon	b230afcd35	Fix FAB Icon color in `OnScreenEditActivity` The icons in these FABs had the same color as the FAB prior which led them to being invisible. This has been fixed by setting a white tint on them which makes the icons clearly visible.	2021-11-11 18:59:09 +05:30
PixelyIon	e4fbee1626	Style Improvements to `LicenseDialog` Additional padding has been added to the text alongside making it be left-aligned rather than center-aligned and justified. A newline has also been added to the copyright notice for Skyline to make it look nicer.	2021-11-11 16:35:30 +05:30
PixelyIon	36a1f2a2ec	Make all `Dialog`s use `@color/backgroundColor` as the background color We wanted the color of the modals used by the dialogs to be the same as our regular background color rather than a lighter grey. This has now been enforced with style attributes in the case of `AlertDialog` and `setBackground` in the case of `BottomSheetDialog`.	2021-11-11 16:34:38 +05:30
PixelyIon	3f3891839e	Make all `AlertDialog`s use `MaterialComponents` theme We inconsistently used `AppCompat`'s `AlertDialog` theme in Settings while using `MaterialComponents`'s theme in Controller Configuration. This has now been fixed by universally using the `MaterialComponents` theme.	2021-11-11 16:28:57 +05:30
PixelyIon	ff5887975b	Remove Logo from `MainActivity` The Skyline logo was added to the title area but it ended up being too distracting with a light theme as the logo was designed purely for a white background. Ultimately, even though it looked good with the dark theme we had to remove it.	2021-11-11 13:58:25 +05:30
PixelyIon	43b9d95dbc	Rework App Dialog Buttons Aligning the buttons to the bottom of the game image was determined to look odd due to the amount of padding between the title and buttons. They are now back to being below the title but the buttons have been resized with "Play" being a wide button while "Pin" has been replaced with Google Material Icons's "Add To Home Screen" icon and sized down to an icon-only button.	2021-11-11 13:47:20 +05:30
lynxnb	6a0ad25c27	Update `BottomSheetDialog` appearance * Fix incorrect dialog surface color * Adjust buttons' position	2021-11-10 17:41:07 +01:00
lynxnb	df120ab76d	Fix unicode characters being turned into emojis for some vendors	2021-11-10 17:41:07 +01:00
lynxnb	b7b0f07ba8	Update application branding - Logo is now displayed next to the app name - Remove search bar animation - New color accent - Improve visibility of controller binding setting's glyphs	2021-11-10 17:41:07 +01:00
lynxnb	827439d2d1	Use the new font for the application name in main activity	2021-11-10 17:41:07 +01:00
lynxnb	07f899e904	Update application icon	2021-11-10 17:41:07 +01:00
Billy Laws	d88b08d986	Address PR feedback	2021-11-10 21:35:36 +05:30
Billy Laws	1b453c04ca	Remove completed nvmap TODO Pins have been implemented so the to-do is no longer needed.	2021-11-10 21:35:36 +05:30
Billy Laws	d2d181725f	Remove unused virtEnd variable in FlatMemoryManager::{Read, Write}	2021-11-10 21:35:36 +05:30
Billy Laws	60fbfad4bc	Add virtual dtors to time service code	2021-11-10 21:35:36 +05:30
Billy Laws	ef10d3d394	Use semantic wrapping where appropriate for class initialiser lists	2021-11-10 21:35:36 +05:30
Billy Laws	6b33268d85	Remove unused gm20b EngineID enum	2021-11-10 21:35:36 +05:30
Billy Laws	73896c2e6b	Fixup nvdrv channel types to follow naming conventions	2021-11-10 21:35:36 +05:30
Billy Laws	ad900aba7a	s/Host1X/Host1x/ as per Nvidia naming	2021-11-10 21:35:36 +05:30
Billy Laws	dbfb1cfe20	Fully implement the nvdrv Host1xChannel::Submit operation This pushes a set of command buffers into the Host1x command FIFO allocated for the channel, returning fence thresholds that can be waited on for completion,	2021-11-10 21:35:36 +05:30
Billy Laws	baefb0fe93	Implement the Host1x command FIFO together with barebones Host1x classes The Host1x block of the TX1 supports 14 separate channels to which commands can be issued, these all run asynchronously so are emulated the same way as GPU channels with one FIFO emulation thread each. The command FIFO itself is very similar to the GPFIFO found in the GPU however there are some differences, mainly the introduction of classes (similar to engines) and the Mask opcode (which allows writing to a specific set of offsets much more efficiently). There is an internal Host1x class which functions similar to the GPFIFO class in the GPU, handling general operations such as syncpoint waits, this is accessed via the simple method interface. Other channels such as NVDEC and VIC are behind the 'Tegra Host Interface' (THI) in HW, this abstracts out the classes internal details and provides a uniform method interface ontop of the Host1x method one. We emulate the THI as a templated wrapper for the underlying class. Syncpoint increments in Host1x are different to GPU, the THI allows submitting increment requests that will be queued up and only be applied after a specific condition in the associated engine is met; however the option to for immediate increments is also available.	2021-11-10 21:35:36 +05:30
Billy Laws	2494cafee8	Cleanup GPFIFO comments and make Run() private	2021-11-10 21:35:36 +05:30
Billy Laws	2577658fc7	Avoid GetPointer on nvmap handles where they would be accessed via SMMU GetPointer sets the sharedMemMapped flag, which should only be set if other userspace processes have the handle mapped.	2021-11-10 21:35:36 +05:30
Billy Laws	fd0420443c	Add template utils for constructing all elements in an std::array This avoids the excessive repetition needed for the case where array members have no default constructor. eg: ```c++ std::array<Type, 10> channels{util::MakeFilledArray<Type, 10>(typeConstructorArg, <...>)}; ```	2021-11-10 21:35:36 +05:30
Billy Laws	34bf413661	Fix bitmask check for event IDs > 32 in Ctrl::SyncpointFreeEventBatch Doing 1 << 32 would result in an integer overflow rather than the desired behaviour of checking a bit, make 1 64 bit to present that.	2021-11-10 21:34:30 +05:30
Billy Laws	debab7c9c7	Implement nvmap handle pinning/unpinning nvmap allows mapping handles into the SMMU address space through 'pins'. These are refcounted mappings that are lazily freed when required due to AS space running out. Nvidia implements this by using several MRU lists of handles in order to choose which ones to free and which ones to keep, however as we are unlikely to even run into the case of not having enough address space a naive queue approach works fine. This pin infrastructure is used by nvdrv's host1x channel implementation in order to handle mapping of both command and data buffers for submit.	2021-11-10 21:34:30 +05:30
Billy Laws	a0c57256cc	Hookup FlatMemoryManager for SMMU into SoC The SMMU is used to control the mappings of peripherals such as the VIC and NVDEC.	2021-11-10 21:34:30 +05:30
Billy Laws	97dc053ffd	Move FlatAllocator allocation error handling to the caller This is a prerequisite for nvmap SMMU memory management, which only frees the memory handles refer to if an allocation fails.	2021-11-10 21:34:30 +05:30
Billy Laws	04e5237ec1	Stub host1x channel devices and IOCTLs host1x channels are generally similar to GPU channels however there is only one channel for each specific class (like a GPU engine) and an address space is shared between them all. This PR implements the simple IOCTLs with the larger ones that will depend on changes outside of nvdrv being left for future commits. This is enough to partly run oss-nvjpeg.	2021-11-10 21:34:30 +05:30
Billy Laws	5087d3dc2a	Reserve host1x channel syncpoints	2021-11-10 21:34:30 +05:30
Billy Laws	b0a5dab0f7	Support passing additional constructor arguments to nvdrv devices Needed for host1x channels which use the same class for multiple types.	2021-11-10 21:34:30 +05:30
Billy Laws	eb6f052873	Fixup GpuChannel::SetNvmapFd to take an FD rather than an nvmap handle	2021-11-10 21:34:30 +05:30
Billy Laws	386a3447a8	Introduce variable sized span support to nvdrv deserialisation The element containing the size first needs to be saved to a save slot with Save<T, slotId>, it can then be read back later as the size of a span with SlotSizeSpan<T, slotId>. This is needed to support the Host1XChannel Submit IOCTL.	2021-11-10 21:34:30 +05:30
Billy Laws	6eeaa343f8	Avoid crash when passing unallocated syncpoint IDs to EventWait	2021-11-10 21:34:30 +05:30
PixelyIon	fbfad21f03	Migrate `Maxwell3D::Registers` to `OffsetMember` Maxwell3D registers were primarily written with absolute offsets and ended up being fairly messy due to attempting to emulate this using struct relative positioning resulting in a lot of pointless padding members. This has now been improved by utilizing `OffsetMember` to directly use offsets resulting in much neater code.	2021-11-10 21:31:45 +05:30
PixelyIon	69761389ff	Extend `OffsetMember` with direct `operator=`/`operator[]` It was found to be semantically advantageous to directly pass-through certain operators such as the assignment (`=`) and array index (`[]`) operators. These would require a dereference prior to their usage otherwise but now can be directly used.	2021-11-10 21:30:02 +05:30
Billy Laws	cc7b2a0ab8	Introduce `OffsetMember` for offset-based union members The offset of a member in a structure was determined by its relative position and compiler alignment. This is unoptimal for larger structures such as those found in GPU Engines that are primarily named by offset rather than relative positioning and end up requiring a massive amount of padding to function as is. A solution to this problem was simply to supply member offsets directly which can be done by using `OffsetMember` alongside a `union`.	2021-11-10 21:17:31 +05:30
PixelyIon	fb476567ff	Introduce `JniString` as C++ wrapper over `jstring` We used to manually call JNI UTF-8 string allocation and deallocation functions when utilizing a `jstring` but this could be erroneous and is just inconvenient. All of this has now been consolidated into an class `JniString` which is a wrapper around `std::string` and creates a copy of the contents of the `jstring` in its constructor and passes them into the `std::string` constructor.	2021-11-09 21:18:47 +05:30
PixelyIon	79ceb2cf23	Improve Vulkan `Texture` Synchronization The Vulkan Pipeline Barriers were unoptimal and incorrect to some degree prior as we purely synchronized images and not staging buffers. This has now been fixed and improved in general with more relevant synchronization.	2021-11-09 21:08:03 +05:30
PixelyIon	bed9fbf5e7	Fix `EmulationActivity.vibrateDevice` assert due to `null` Vibrator `EmulationActivity.vibrateDevice` would assert when a `null` Vibrator is provided due to one not being set in the controller configuration. This has now been fixed by the code not playing a vibration when a vibration device isn't selected.	2021-11-01 00:28:11 +05:30
PixelyIon	414c0104c3	Rework Joy-Con Vibration Conversion The guest -> host vibration conversion code was entirely broken as it didn't set the vibration `start`/`end` timestamps correctly for a cycle nor did it subtract from the `totalAmplitude` (`currentAmplitude` now) when it a cycle ended due to an incorrect `if` statement and contents. It would just end up saturating the amplitude as much as possible by adding more and more to `totalAmplitude` on every cycle while never subtracting which is entirely wrong and led to a very noticeable drop in amplitude when a vibration was repeated. It's been entirely reworked to fix all the issues listed above and remove a lot of code that had no understandable purpose. More comments have also been added to the code to make it more readable with better variable and function naming alongside it.	2021-11-01 00:28:11 +05:30
PixelyIon	4b80e1f91c	Use libcxx from LLVM Project submodule The version of libcxx shipped with Android NDK is fairly outdated and doesn't contain several features we desire such as C++ 20 ranges. This has been fixed by using libcxx directly from the LLVM Project which has been added as a submodule and can be updated independently of NDK.	2021-10-31 16:04:44 +05:30
PixelyIon	82154f3ef6	Upgrade AGP to `7.1.0-beta01` & NDK to `24.0.7856742` We've moved to using a beta AGP as `7.0.2` is breaks `clangd` and other C++ features on Beta/Canary Android Studio. NDK was additionally updated with `mbedtls` to fix warnings caused by it alongside some other minor fixes to code for newer versions of libcxx. The new AGP has a bug where it does not look for executables specified in `android_gradle_build.json` in `PATH` that includes `ninja` which is provided by the `ninja-build` package on the system rather than Android SDK's CMake on GitHub Actions (Ubuntu 20.04). This has been fixed by symlinking `/usr/bin/ninja` to the project root which is searched in for the `ninja` executable.	2021-10-31 15:50:15 +05:30
PixelyIon	962d8dc4c8	Return immediately for non-joining `KProcess::Kill`s when already killed Locking `KProcess::threadMutex` when a process is being killed by another thread with `join` can lead to the non-joining killer effectively joining as it's waiting on the joining killer to relinquish the mutex. This has been fixed by having an atomic boolean tracking if the process has already been killed and if it has, immediately returning prior to locking the mutex for any non-joining killers.	2021-10-31 15:45:10 +05:30
Billy Laws	6f59cba68d	Adds bounds checks to resampler to avoid OOB reads Resampling would sometimes perform an OOB read into `inputBuffer` due it not containing enough data to calculate corresponding the output sample, this has been fixed by introducing bounds checking to ensure that the buffer has enough data.	2021-10-29 21:46:51 +05:30
PixelyIon	9e3b7a75b2	Use `finishAffinity` instead of `finishAndRemoveTask` The method used to finish (`finishAndRemoveTask`) an activity prior to going back to `MainActivity` or restarting the process led to the process prematurely exiting entirely and would result in it not being restarted or another activity not being launched. This has now been fixed by utilizing `finishAffinity` in its place which correctly only ends the activities with the same affinity as the caller.	2021-10-29 21:20:04 +05:30
PixelyIon	9f5ab13858	Implement `R16G16{Unorm/Sint/Uint}` RT Formats Utilized in "The Touryst"	2021-10-29 20:21:58 +05:30
PixelyIon	afebf77544	Zero-Initialize `GuestTexture` Members Members of `GuestTexture` were apparently not being initialized and this led to UB since they would be read as random values. Titles such as Super Mario Odyssey avoided setting `baseArrayLayer` which led to it being left at the default value which was completely random and this would lead to crashes. This commit fixes this by initializing said values correctly.	2021-10-27 22:49:45 +05:30
PixelyIon	a0921f8261	Implement `R16G16B16A16Snorm`/`R16G16B16A16Sint` RT Formats Utilized in "The Touryst"	2021-10-27 22:49:45 +05:30
PixelyIon	df64ff5d14	Zero-Fill `IAudioRenderer::RequestUpdate` Output Buffer Some titles don't clear the output buffer prior to submission, as the service is expected to fill all of it in, our audren implementation is incomplete and doesn't end up doing this leaving the contents of the buffer to be undefined leading to UB in the form of SEGFAULTs or the application throwing a fatal error. This has been patched over by 0-filling the buffer which is a sane default value for the fields that aren't filled in albeit not a replacement for a proper audren implementation.	2021-10-27 16:18:20 +05:30
PixelyIon	1f3519e6e3	Fix Logger Message OOB Access Certain titles can submit logs where the last field is one off by the buffer end, the logger loop now considers this and terminates if there isn't enough data left to read the field type and length.	2021-10-26 21:59:47 +05:30
PixelyIon	645183c903	Fix OOB Vibration Array Access in `VibrateDevice` Access to the `vibrations` field in `vibrations[3].period` could lead to UB, this has been replaced with a proper check which adds up the period over all vibrations instead. A minor cleanup with variable names and explicit types for integer arithmetic has also been done.	2021-10-26 21:57:28 +05:30
PixelyIon	cfdb2abf9e	Fix Non-`builtin` Uncached `Vibrator` Getter If a non-builtin vibrator was attempted to be fetched, it'd insert it in the vibrator cache and return directly as opposed to playing the vibration on it prior to returning. This has now been fixed, the value is both put into the cache and the vibration is played on it.	2021-10-26 21:54:33 +05:30
PixelyIon	dc3f7f1ab4	Fix Incorrect Scissor Extent The decomposition from `texture::Dimension` to `vk::Rect2D` was somehow implicit and completely incorrect resulting in wrong conversion with undefined values. It's now been fixed by explicitly setting `vk::Rect2D::extent` to `scissor` specifically.	2021-10-26 20:08:53 +05:30
PixelyIon	a60f238479	Fix `GPU::DebugCallback` Type String Extraction The second parameter of `std::string_view::substr` was assumed to be an end position (similar to `std::span`) rather than `count` which it is. As a result of this, it was entirely broken but only held together by a constant factor being subtracted from it which was derived by trial and error. It's now been fixed by returning a count rather than the absolute position.	2021-10-26 20:08:35 +05:30
Billy Laws	70d1b4994c	Enable Wconversion and fix warnings produced	2021-10-26 11:41:24 +01:00
PixelyIon	661396fc97	Resume `EmulationActivity` when launcher icon is used Fix a bug where attempting to launch Skyline from the launcher while emulation was in progress would result in `MainActivity` launching rather than `EmulationActivity`. We always want `EmulationActivity` to stay on top of the stack and be launched whenever Skyline is resumed, no other activity should be able to run in parallel to `EmulationActivity` in any user-accessible manner.	2021-10-26 10:46:36 +05:30
PixelyIon	39e924aec8	Resume rather than relaunch when same shortcut is used	2021-10-26 10:46:36 +05:30
Billy Laws	1e7347bf72	Use semantic wrapping for nvdrv where appropriate	2021-10-26 10:46:36 +05:30
PixelyIon	830a800d9e	Consolidate `AddAttachment` Loops + Rename `Renderpass` -> `RenderPass`	2021-10-26 10:46:36 +05:30
PixelyIon	92a21ea616	Cleanup & Use C++ Concepts in `utils.h`	2021-10-26 10:46:36 +05:30
PixelyIon	ea2626bcc6	Address CR Comments	2021-10-26 10:46:36 +05:30
PixelyIon	1d532628cb	Null-Check Optional NACP before extracting application title Not doing this can lead to the NACP being filled with invalid data and led to crashes on homebrew titles like SpaceNX.	2021-10-26 10:46:36 +05:30
PixelyIon	c8821c7313	Update `nvdrv` perms to 11.0.0+ & Implement `nvdrv:a` service `nvdrv:a` (For Applets) is used by some older homebrew such as SpaceNX which don't fall back to `nvdrv` (For Applications).	2021-10-26 10:46:36 +05:30
PixelyIon	3b4bbd2b38	Switch to using exceptions for guest exiting Guest-driven exiting could cause objects left on the heap due to a `std::longjmp` from high up in the host call stack, this has been fixed by introducing `ExitException` which implicitly unrolls the stack with the exception handling mechanism.	2021-10-26 10:46:36 +05:30
PixelyIon	eff5711c49	Split monolithic `common.h` into smaller chunks * Resolves dependency cycles in some components * Allows for easier navigation of certain components like `span` which were especially large * Some imports have been moved from `common.h` into their own files due to their infrequency	2021-10-26 10:46:36 +05:30
PixelyIon	1d57bab08f	Revamp `LicenseDialog` + Update Licenses + Stop Bintray Usage * Update licenses for dependent projects * Add copyright notices (as provided) * Revamp styling for `LicenseDialog` * Fix invisible `PreferenceDialog` buttons in Settings * Consolidating color variables into `colorPrimary`, `backgroundColor` and `backgroundColorVariant` * Use `com.google.android.flexbox:flexbox:3.0.0` (Google Maven) rather than `com.google.android:flexbox:2.0.1` (Bintray)	2021-10-26 10:46:36 +05:30
PixelyIon	bbf28d1942	Improve Clean Exit + Audio Pausing + Improve System Language Setting * Clean Exiting was improved by implementing a robust system for when to abandon clean exiting and simply restart the process alongside moving clean exiting to the background when the application is quit by using the back button * Audio is now automatically paused whenever the application is moved to the background and automatically resumed when it's brought to the foreground * The system language setting had several errors and inconsistencies which have now been fixed, it's been brought more in line with HOS language (Albeit not entirely due to no region setting in Skyline) * Fix a bug with `ThreadLocal` where the atomic `list` pointer was uninitialized resulting in a `SEGFAULT` during the destructor	2021-10-26 10:46:36 +05:30
PixelyIon	a7548c79a0	Android 12 Support + Update Libraries + Include Khronos Validation Layer * Fix handling `SA_EXPOSE_TAGBITS` bit being set in Android 12 `sigaction` * Fix CMake bug using `CMAKE_INTERPROCEDURAL_OPTIMIZATION_RELEASE` when not supported causing `-fuse-ld=gold` to be emitted as a linker flag * Support using `VIBRATOR_MANAGER_SERVICE` rather than `VIBRATOR_SERVICE` on Android 12 * Optimize Imports for Kotlin code * Move away from deprecated APIs in Kotlin or explicitly mark where it's not possible * Update SDK, NDK and libraries * Enable Gradle Configuration Cache	2021-10-26 10:46:36 +05:30
Billy Laws	b7d0f2fafa	Implement support for pushbuffer methods split across multiple GpEntries These are used heavily in OpenGL games, which now, together with the previous syncpoint changes, work perfectly. The actual implementation is rather novel as rather than using a per-class state machine for all methods we only use it for those that are known to be split across GpEntry boundaries, as a result only a single bounds check is added to the hot path of contiguous method execution and the performance loss is negligible.	2021-10-16 12:13:30 +01:00
Billy Laws	fc017e1e95	Implement pre-wait and post-increment syncpoint operations in submit These are used by both OpenGL and Vulkan games as opposed to including the operations inside the main commandbuffer.	2021-10-16 12:13:30 +01:00
PixelyIon	9b9bf8d300	Introduce `ThreadLocal` Class + Fix Several GPU Bugs * Fix `AddClearColorSubpass` bug where it would not generate a `VkCmdNextSubpass` when an attachment clear was utilized * Fix `AddSubpass` bug where the Depth Stencil texture would not be synced * Respect `VkCommandPool` external synchronization requirements by making it thread-local with a custom RAII wrapper * Fix linear RT width calculation as it's provided in terms of bytes rather than format units * Fix `AllocateStagingBuffer` bug where it would not supply `eTransferDst` as a usage flag * Fix `AllocateMappedImage` where `VkMemoryPropertyFlags` were not respected resulting in non-`eHostVisible` memory being utilized * Change feature requirement in `AndroidManifest.xml` to Vulkan 1.1 from OGL 3.1 as this was incorrect	2021-10-16 12:13:30 +01:00
Billy Laws	eb25f60033	Implement multichannel support for GPU Allows the execution of multiple channels at the same time, with locking being performed on the host GPU scheduler layer, address spaces can be bound to one or more channels.	2021-10-16 12:13:30 +01:00
PixelyIon	b762d1df23	Introduce Texture Always Sync + Wait on GPU Execution + More RT Formats Infrastructure for always syncing textures has been introduced now, they will be synced prior to and after every execution. This does considerably reduce the performance alongside waiting on GPU execution to finish but it will be partially recouped once conditional syncing is performed.	2021-10-16 12:13:30 +01:00
PixelyIon	f8acc1e131	Improve Shared Fonts + Fix AM `PopLaunchParameter` & Choreographer Bug * Move Shared Font TTFs to AAsset storage + Support external shared font loading from `/data/data/skyline.emu/data/fonts` * Fix bug in `IApplicationFunctions::PopLaunchParameter` caused by ignoring `LaunchParameterKind` * Fix bug with Choreographer causing it to be awoken and exit prior to the destruction of `PresentationEngine` * Fix bug with `IDirectory::Read` where it used `inputBuf` for the output buffer rather than `outputBuf` * Improve `GetFunctionStackTrace` logs when `dli_sname` or `dli_fname` are missing * Support more RT Formats	2021-10-16 12:13:30 +01:00
PixelyIon	95a08627e5	Subpass Support + More RT Formats + Fix `FenceCycle` Cyclic Dependencies Support for subpasses was added by reworking attachment reuse code to account for preserved attachments and subpass dependencies. A lot of RT formats were also added to allow SMO to boot up entirely, it should be noted that it doesn't render anything. `FenceCycle` had a cyclic dependency which broke clean exit, we now utilize `std::weak_ptr<FenceCycle>` inside the `Texture` object. A minor fix for broken stack traces was also made caused by supplying a `nullptr` C-string to libfmt when a symbol was unresolved which caused an `abort` due to invocation of `strlen` with it.	2021-10-16 12:13:30 +01:00
PixelyIon	239d2625e2	Introduce `CommandExecutor` + Implement `ClearBuffers` + More RT Formats This commit introduces the `CommandExecutor` which is responsible for creating and orchestrating a Vulkan command graph comprising of `CommandNode`s that construct all the objects required for rendering. As a result of the infrastructure provided by `CommandExecutor`, `ClearBuffers` could be implemented and be appropriately utilized. A bug regarding scissors was also determined and fixed in the PR, the extent of them were previously inaccurate and this has now been fixed. Note: We don't synchronize any textures from the guest for now as this would override the contents on the host, this'll be fixed with the appropriate write tracking but it also results in a black screen for anything that writes to FB	2021-10-05 01:13:22 +05:30
PixelyIon	3879d573d5	Fix Command Buffer Allocation & `FenceCycle` This commit fixes a major issue with command buffer allocation which would result in only being able to utilize a command buffer slot on the 2nd attempt to use it after it's freed, this would lead to a significantly larger amount of command buffers being created than necessary. It also fixes an issue with the command buffers not being reset after they were utilized which results in UB eventually. Another issue was fixed with `FenceCycle` where all dependencies are only destroyed on destruction of the `FenceCycle` itself rather than the function where the `VkFence` was found to be signalled.	2021-10-05 01:13:22 +05:30
PixelyIon	bee28aaf0d	Validation Layer Filter + Fix `Texture`, GPU & `PresentationEngine` bugs This commit implements a filter by type for any validation layer output, this allows filtering out any logs which may be unnecessary and additionally triggering a breakpoint as required. An issue concerning the `NDEBUG` flag never being set was fixed, it's now supplied as a release compiler flag. The issue can manifest itself by always relying on a validation layer even though it shouldn't on release, this is why the validation layer was mistakenly disabled entirely previously by using `#ifndef` rather than `#ifdef`. An issue with the initial layout of a texture being supplied as neither `VK_IMAGE_LAYOUT_UNDEFINED` or `VK_IMAGE_LAYOUT_PREINITIALIZED` was fixed, these cases are now handled by transitioning to those layouts after creating the image rather than supplying it within `initialLayout`. Another issue was fixed regarding not maintaining a transformation after a surface has been destroyed and recreated existed and manifested itself when the user would go out of the app and come back in, they would see the surface having an identity transformation rather than the desired one.	2021-10-05 01:13:22 +05:30
PixelyIon	54908afc44	Texture GMMU Address Resolution + Refactor `Maxwell3D::CallMethod` Fixes bugs with the Texture Manager lookup, fixes `RenderTarget` address extraction (`low`/`high` were flipped prior), refactors `Maxwell3D::CallMethod` to utilize a case for the register being modified + preventing redundant method calls when no new value is being written to the register, and fixes the behavior of shadow RAM which was broken previously and would lead to incorrect arguments being utilized for methods.	2021-10-05 01:13:22 +05:30
PixelyIon	270f2db1d2	Initial Texture Manager Implementation + Maxwell3D Render Target Implement the groundwork for the texture manager to be able to report basic overlaps and be extended to support more in the future. The Maxwell3D registers `RenderTargetControl`, `RenderTarget` and a stub for `ClearBuffers` were implemented. A lot of changes were also made to `GuestTexture`/`Texture` for supporting mipmapping and multiple array layers alongside significant architectural changes to `GuestTexture` effectively disconnecting it from `Texture` with it no longer being a parent rather an object that can be used to create a `Texture` object. Note: Support for fragmented CPU mappings hasn't been added for texture synchronization yet	2021-10-05 01:13:22 +05:30
PixelyIon	8cba1edf6d	Introduce Boost as a submodule + Minor Fixes Utilize Boost Container's `small_vector` for optimizing allocations, fix certain implicit casting issues and make `ILogger` not output an additional newline in the log when the application supplies one at the end of the log	2021-10-05 01:13:22 +05:30
PixelyIon	190fde110f	Introduce `GraphicContext` and Implement Viewport Transform + Scissors	2021-10-05 01:13:22 +05:30
PixelyIon	bc378ad135	Fix `GraphicBufferProducer` Format Bug The format provided in `GraphicBuffer` can be misleading and is supplied as `None` by the Deko3D Swapchain, it instead supplies the real format in the `NvGraphicHandle` which we now utilize instead of the one in `GraphicBuffer`.	2021-10-05 01:13:22 +05:30
PixelyIon	18cf0b82d6	Fix KTransferMemory Destruction Behavior + Add Missing SVC Handle Logs KTransferMemory needs to be reset to RW on destruction unlike KSharedMemory which is simply freed on destruction, not emulating this behavior accurately leads to `deko_examples` from `switch-examples` to lead to a SEGFAULT on selecting an example as it expects the memory to be R/W while it ends up being freed instead	2021-10-05 01:13:22 +05:30
Billy Laws	192459f726	Adjust AS block end predecessors to successors after reuse as end	2021-10-04 20:29:02 +01:00
PixelyIon	d45193874e	Fix NvDrv `CloseDevice` Bug The unique pointer to a device in the map was simply reset rather than deleting the entry from the map, this resulted in the device not being properly closed and when the device was reopened then the `emplace` was a NOP as the entry already existed. This resulted in a `nullptr` dereference down the line when an application attempted to issue an IOCTL to a device that was previously closed and reopened. This is known to occur in Deko3D as it recreates the context when loading an example which includes closing and reopening devices.	2021-10-03 18:02:32 +05:30
Billy Laws	7a5ca19c0b	Fix improper VMM error check in unmap	2021-10-01 20:58:22 +01:00
Billy Laws	4eff515c55	Add support for IOCTL3 CtrlGpu calls missed in refactor	2021-10-01 20:57:59 +01:00
PixelyIon	02bcf98b8c	Fix NvDrv Bugs + NACP Parsing Bug A hotfix for a nvdrv bug where an IOCTL with no input/output buffers would crash and a major `nvmap` bug which broke the `FromId` IOCTL as it didn't write back the handle ID, another minor bug existed in `nvhost` where the `ZCullGetInfo` IOCTL was `INOUT` rather than `OUT`. A bug with NACPs was also fixed caused by incorrect padding for the `NacpData` structure which resulted in the `saveDataOwnerId` member being read incorrectly and as a result the save data folder being incorrect. Co-authored-by: artem8086 <artemsvyatoha@gmail.com>	2021-10-02 00:15:53 +05:30
lynxnb	afc3dda1ee	Reintroduce `LanguageCode` alias type	2021-09-30 02:19:19 +05:30
lynxnb	c6d91c552b	Address CR	2021-09-30 02:19:19 +05:30
lynxnb	0077833667	Support game icon selection based on language	2021-09-30 02:19:19 +05:30
lynxnb	01ce09183b	Rework common languages Swapped out the usages of frozen constexpr unordered maps for switch statements, which are very likely to be turned into jump tables given the nature of the enums used, resulting in better performance than a map	2021-09-30 02:19:19 +05:30
sspacelynx	4d20d7a4d0	Address CR + Code formatting	2021-09-30 02:19:19 +05:30
sspacelynx	ca97517d81	Rework `GetDesiredLanguage` + Remove `LanguageCode` alias type	2021-09-30 02:19:19 +05:30
sspacelynx	6135f531ae	Add `refreshRequired` xml attribute in `IntegerListPreference` This attribute controls whether a rom list refresh will be triggered when the setting value changes.	2021-09-30 02:19:19 +05:30
sspacelynx	221039084e	Make languages setting use `IntegerListPreference`	2021-09-30 02:19:19 +05:30
sspacelynx	c1a3ddcd1c	Implement an `IntegerListPreference` Unlike `ListPreference`, this preference class uses integers as its values instead of strings, avoiding unnecessary casting. It also doesn't require an array for values: in that case it will be using the clicked entry position as its value.	2021-09-30 02:19:19 +05:30
sspacelynx	fdc7e1238a	Implement language support for rom loaders	2021-09-30 02:19:19 +05:30
sspacelynx	3c5509e52a	Update available languages to those post 10.1.0 Brazilian Portuguese has been added in firmware version 10.1.0	2021-09-30 02:19:19 +05:30
sspacelynx	3fffde3061	Rework OS to allow passing a language	2021-09-30 02:19:19 +05:30
Billy Laws	a4a6511177	Address feedback	2021-09-30 02:11:19 +05:30
Billy Laws	a18f1aa889	Address feedback	2021-09-30 02:11:19 +05:30
Billy Laws	8db2cf29f2	Implement AsGpu::FreeSpace and clean up locking and debug prints	2021-09-30 02:11:19 +05:30
Billy Laws	d3dc0bddc4	Cleanup IStorageAccessor validation code	2021-09-30 02:11:19 +05:30
Billy Laws	39faa739b9	Optimise GPFIFO command processing for higher throughput Using a u32 for the loop index prevents masking on all increments, giving a moderate performance increase. Passing methods as u32 parameters and stopping subChannel being passed gives quite a significant increase when combined with the inlining allowed by subchannel based engine selection.	2021-09-30 02:11:19 +05:30
Billy Laws	3d538a29da	Misc code cleanup and more comments in AS map	2021-09-30 02:11:19 +05:30
Billy Laws	30d41252f7	Use IPC raw data size to calculate C Buffer offset Some games don't have a u16 size array, the code for this is also kinda pointless when we can use the raw data size too.	2021-09-30 02:11:19 +05:30
Billy Laws	d03b288db6	NEEDS CLEANUP: Reimplement GPU VMM and rewrite nvdrv VM impl	2021-09-30 02:11:19 +05:30
Billy Laws	020aa0e43a	Add support for Ioctl2/3 and improve debug logging	2021-09-30 02:11:19 +05:30
Billy Laws	9bda574a4e	Move nvhost-ctrl-gpu to new device API	2021-09-30 02:11:19 +05:30
Billy Laws	39492d9365	Move nvhost-gpu to new device API The implementation of GPU channels and Host1X channels will be split up as it allows a much cleaner implementation and less undefined behaviour potential.	2021-09-30 02:11:19 +05:30
Billy Laws	3d4f7d9b44	Move nvhost-as-gpu to new device API	2021-09-30 02:11:19 +05:30
Billy Laws	1149158198	Make GraphicBufferProducer use the new nvmap API	2021-09-30 02:11:19 +05:30
Billy Laws	c97f5a9315	Entirely rewrite nvmap to separate global and per device state. This will be required later for NVDEC/SMMU support and fixes many significant issues in the previous implementation. Based off of my 2.0.0/12.0.0 nvdrv REs.	2021-09-30 02:11:19 +05:30
Billy Laws	78356fa789	Migrate syncpoint management over to the new device API The syncpoint manager has beeen given convinience functions for fences which remove the need to access the raw id/threshold most of the time and various accuracy fixes and cleanups to match HOS 12.0.0 have also been done.	2021-09-30 02:11:19 +05:30
Billy Laws	a19bf973b1	Rework the nvdrv core for accuracy and support new deserialisation Devices will need to be moved over in order to support this new interface, IOCTL2/3 support will be added in a later commit.	2021-09-30 02:11:19 +05:30
Billy Laws	db2abe2290	Use concepts for Backing::Read over enable_if	2021-09-30 02:11:19 +05:30
Billy Laws	3340c74332	Implement efficient template based IOCTL deserialisation This moves IOCTLs from frozen to a template based parser, this allows IOCTL lookup to be performed much more optimally and some IOCTLs to be inlined.	2021-09-30 02:11:19 +05:30
Billy Laws	75a67dcfa5	Allow supplying a custom ResultValue result type and optimise span This allows PosixResultValue to be created easily.	2021-09-30 02:11:19 +05:30
Billy Laws	3d3c13f90c	Add common PosixResult and PosixResultValue types Many services like bsd and nvdrv use these to represent the result values of functions so add a common type for these as an alternative to the macros.	2021-09-30 02:11:19 +05:30
sspacelynx	14dc42b991	Add `IntegerFor` template class + Update `GenerateUuidV4`	2021-09-22 06:24:05 +05:30
sspacelynx	ea3c7301b1	Make `UUID::GenerateUuidV4` use `util::GetRandomBytes`	2021-09-22 06:24:05 +05:30
sspacelynx	10d43c88c9	csrng service implementation Implements the 'csrng:' service using C++ <random>'s Mersenne Twister, this does make it insecure for cryptographic purposes but it is pointless to attempt to do this regardless as we cannot ensure that the guest will run in a secure environment which cannot be mutated by an attacker. Used by Prison Princess, Pokemon Cafe Mix, Paint your Pet and more.	2021-09-22 06:24:05 +05:30
sspacelynx	06674b3d07	hwopus decoder service implementation	2021-07-19 02:07:12 +05:30
PixelyIon	b533801bbe	Fix Scheduler Core Migration Deadlock Encountered in 不如帰大乱 when `HOS-3` is awoken at the same time as `HOS-0` called `SvcSetThreadCoreMask` resulting in a deadlock where `HOS-0` owns `HOS-3`'s `coreMigrationMutex` while `HOS-3` owns the core mutex with the both of them attempting to lock the other mutex	2021-07-12 21:27:49 +05:30
PixelyIon	dba99ce542	Implement `IApplicationDisplayService::ConvertScalingMode` Utilized by アイリス魔法学園	2021-07-12 21:27:49 +05:30
PixelyIon	1ad5ea6867	Move Mbed TLS to submodule + Update Java Libraries We've moved from using an AAR for Mbed TLS to a submodule as the AAR was packaged manually and used from a local repository which ended up being very hacky and resulted in Linter errors, it could also not be updated with ease as it would need to be repackaged. All of these issues have been solved by moving to a git submodule tied to the official Mbed TLS GitHub repository.	2021-07-12 21:27:49 +05:30
PixelyIon	ec1da1b3f5	Address CR Comments + Increase `minSdkVersion` to 29 (Android 10)	2021-07-12 21:27:49 +05:30
PixelyIon	19be9f4b66	Support Sticky Transforms + Minor Formatting Fixes Sticky transforms have been stubbed, as they are on HOS/Android. Certain titles like Xenoblade Chronicles end up setting the sticky transform even if it doesn't do anything, as a result of this we cannot throw an exception for it and stub it without an exception (Aside from the cases where the value isn't recognized).	2021-07-12 21:27:49 +05:30
PixelyIon	9f967862bd	Implement additional IGraphicBufferProducer transactions The following GraphicBufferProducer transactions were implemented: * `SetBufferCount` * `DetachBuffer` * `DetachNextBuffer` * `AttachBuffer` It should be noted that `preallocatedBufferCount` (previously `hasBufferCount`) and `activeSlotCount` were adapted accordingly with how they were effectively the same value as all active buffers were preallocated prior but now there can be a non-preallocated active slot. Additionally, a bug has been fixed where `SetPreallocatedBuffer` has the graphic buffer as an optional argument whereas it was treated as a mandatory argument prior and could lead to a SEGFAULT if an application were to not pass in a buffer.	2021-07-12 21:27:49 +05:30
PixelyIon	0d6d90c4cd	Implement SvcUnmapSharedMemory + Remove Redundant [[unlikely]]	2021-07-12 21:27:49 +05:30
PixelyIon	f1433ad0d9	Rework VI + IHOSBinder VI/IHOSBinder suffered from major inaccuracies in their function due to being quickly thrown together initially with little concern for accuracy, this has now been fixed with them being substantially more accurate now.	2021-07-12 21:27:49 +05:30
PixelyIon	ca4744c603	Improve `ENUM_STRING` + Fix Host1X Syncpoints `ENUM_STRING` now has a unified implementation in <common/macros.h> with a documented format and can be used throughout the codebase. A major performance regression was added in the Host1X Syncpoint revamp as it did a syscall if there were any waiters during `Increment` even if they would just be woken up and go back to sleep as the threshold wasn't hit. It has now been optimized to only do a wake if any waiting thread needs to be awoken. There was also a bug concerning increment where it would perform actions corresponding to the previous increment rather than the current one which has also been fixed.	2021-07-12 21:27:49 +05:30
PixelyIon	6595670a5d	Rework Performance Statistics We used instantaneous values for FPS previously which led to a lot of variation in it and the inability to determine a proper FPS value due to constant fluctuations. All FPS values are now averaged to allow reading out a stable value and a deviation statistic has been added for the frame-time to judge judder and frame-pacing which allows for a significantly better measure of overall performance. The formatting for all the floating-point numbers is now fixed-point to prevent shifting of position due to decimal digits becoming 0.	2021-07-12 21:27:49 +05:30
PixelyIon	ce804254ab	Support Swap Interval + Cropping + Transforms and more for QueueBuffer Support for the following parameters was added to `QueueBuffer`: * Earliest Present Timestamp * Swap Interval * Crop * Scaling Mode * Transform * Frame ID (Not returned to guest yet) We utilize ANativeWindow APIs directly to achieve all of this in an efficient manner since HWC will be used directly for it, we do plan to introduce Vulkan equivalents for all of these operations later down the line for a port to non-Android platforms.	2021-07-12 21:27:49 +05:30
PixelyIon	2e8ce2cbaf	Update C++ Libraries	2021-07-12 21:27:49 +05:30
PixelyIon	6854e07356	Replace LogActivity with "Share Logs" + Add Timestamp to Logger	2021-07-12 21:27:49 +05:30
PixelyIon	e511e158e3	Implement Display Settings + Attach Texture to FenceCycle	2021-07-12 21:27:49 +05:30
PixelyIon	b2bb855a02	Revamp Host1X Syncpoint + Address Review Comments	2021-07-12 21:27:49 +05:30
PixelyIon	216e5cee81	Separate Guest and Host Presentation + AChoreographer V-Sync Event We had issues when combining host and guest presentation since certain configurations in guest presentation such as double buffering were very unoptimal for the host and would significantly affect the FPS. As a result of this, we've now made host presentation have its own presentation textures which are copied into from the guest at presentation time, allowing us to change parameters of the host presentation independently of the guest. We've implemented the infrastructure for this which includes being able to create images from host GPU memory using VMA, an optimized linear texture sync and a method to do on-GPU texture-to-texture copies. We've also moved to driving the V-Sync event using AChoreographer on its on thread in this PR, which more accurately encapsulates HOS behavior and allows games such as ARMS to boot as they depend on the V-Sync event being signalled even when the game isn't presenting.	2021-07-12 21:27:49 +05:30
PixelyIon	36547cd5dc	Redesign Texture Class + Improve Presentation Engine This commit reworks the `Texture` class to include a Vulkan Image backing that can be optionally owning or non-owning and swapped in with consideration for Vulkan image layout, it also adds CPU-sided synchronization for the texture objects with FenceCycle. It also makes the appropriate changes to `PresentationEngine` and `GraphicBufferProducer` to work with the new `Texture` class while setting the groundwork for supporting swapchain recreation. It also fixes a log in `IpcResponse` and improves the display mode selection algorithm by further weighing refresh rate.	2021-07-12 21:27:49 +05:30
PixelyIon	b2132fd7aa	Implement Fence Cycle, Memory Manager and Command Scheduler Implements a wrapper over fences to track a single cycle of activation, implement a Vulkan memory manager that wraps the Vulkan-Memory-Allocator library and a command scheduler for scheduling Vulkan command buffers	2021-07-12 21:27:49 +05:30
PixelyIon	d8025e7178	Rework GraphicBufferProducer This commit makes GraphicBufferProducer significantly more accurate by matching the behavior of AOSP alongside mirroring the tweaks made by Nintendo. It eliminates a lot of the magic structures and enumerations used prior and replaces them with the correct values from AOSP or HOS. There was a lot of functional inaccuracy as well which was fixed, we emulate the exact subset of HOS behavior that we need to. A lot of the intermediate layers such as GraphicBufferConsumer or Gralloc/Sync are not emulated as they're pointless abstractions here.	2021-07-12 21:27:49 +05:30
PixelyIon	795472004e	Presentation Engine Vulkan Surface + Swapchain Initialization This commit adds in VkSurface/VkSwapchain initialization and recreation. It also adapts GraphicsBuffferProducer and Texture to fit in with those changes but it doesn't yet implement presenting those buffers nor uploading guest buffers onto the host.	2021-07-12 21:27:49 +05:30

... 10 11 12 13 14 ...

1577 Commits