skyline

mirror of https://github.com/skyline-emu/skyline.git synced 2024-12-27 11:01:50 +01:00

Author	SHA1	Message	Date
PixelyIon	aeea3e6f66	Allow manual allocation of `ActiveCommandBuffer` `CommandScheduler` API users can now directly allocate an active command buffer that they need to manage alongside its fence, this can allow for more efficient recording as it doesn't need to be immediately submitted after, it can also allow attaching objects to a `FenceCycle` prior to submission that can be useful for locking resources.	2022-04-14 14:14:52 +05:30
PixelyIon	8989305637	Implement Host Vertex Buffer Translation Uses the buffer cache to retrieve an equivalent host vertex buffer for a corresponding guest vertex buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	b6ba770a27	Implement Maxwell3D Shader Compilation Compiles shaders supplied by the guest with caching and automatic invalidation, the size of the shader is also automatically determined by looking for `BRA $` instructions which cause an infloop, it should be noted that we have a maximum shader bytecode size, any shader above this size will not be supported.	2022-04-14 14:14:52 +05:30
PixelyIon	08afda6ac4	Implement Graphics Shader Compilation in `ShaderManager` Graphics shaders can now be compiled using the shader compiler and emit SPIR-V that can be used on the host. The binding state isn't currently handled alongside constant buffers and textures support in `GraphicsEnvironment` yet.	2022-04-14 14:14:52 +05:30
PixelyIon	353ca8ec84	Fix Viewport X/Y Translation The operands of the subtraction in the X/Y translation calculation were the wrong way around which led to negative translations that would translate the viewport off the screen.	2022-04-14 14:14:52 +05:30
PixelyIon	f06a12170f	Set Default Color Write Mask to RGBA The default color write mask should mask no channels and write all of them and should be mutated to mask out certain channels as required by the guest.	2022-04-14 14:14:52 +05:30
PixelyIon	23faf1370c	Use Static Arrays for Vertex Buffer Bindings & Attributes We cannot statically construct the vertex buffer/attribute arrays for Vulkan due to inactive attributes or buffers which isn't possible on Vulkan, we also cannot just change the count dynamically as there might be disabled buffers or attributes in the middle. We just have a `static_array` which should dynamically be filled in with buffer binding/attribute Vulkan structures before submission.	2022-04-14 14:14:52 +05:30
PixelyIon	8652edb07b	Make `GuestBuffer` format-less Buffers generally don't have formats that are fundamentally associated with them unless they're texel buffers, if that is the case it can be manually set in `BufferView`.	2022-04-14 14:14:52 +05:30
PixelyIon	03314ec7d2	Introduce `BufferManager` The Buffer Manager handles mapping of guest buffers to host buffer views with automatic handling of sub-buffers and eventually supporting recreation of overlapping buffers to create a single larger buffer.	2022-04-14 14:14:52 +05:30
PixelyIon	bde61d72cc	Introduce `Buffer` and `BufferView` Implements infrastructure for using guest buffers on the host for rendering, a `BufferManager` is still missing which'd handle mapping from guest buffers to host buffers and will be subsequently committed. It should be noted that `BufferView` is also disconnected from `Buffer` and shared for every instance with the same properties like `TextureView` is now.	2022-04-14 14:14:52 +05:30
PixelyIon	6eda1777c5	Rework `TextureView` to be disconnected from `Texture` We want `TextureView`(s) to be disconnected from the backing on the host and instead represent a specific texture on the guest with a backing that can change depending on mapping of new textures which'd invalidate the backing but should now be automatically repointed to an appropriate new backing. This approach also requires locking of the backing to function as it is mutable till it has been locked or the backing has an attached `FenceCycle` that hasn't been signaled which will be added for `CommandExecutor` in a subsequent commit.	2022-04-14 14:14:52 +05:30
PixelyIon	82916657fb	Only Enable Shader Compiler Debug Mode in Debug Builds Sets properties that relate to debugging in `Shader::Settings` to `true` only for debug builds while leaving them disabled for release builds.	2022-04-14 14:14:52 +05:30
PixelyIon	b09f28c0ba	Implement Missing Shader Compiler Quirks Introduces the `supportsShaderViewportIndexLayer` quirk and sets `Shader::Profile::support_int64_atomics` depending on if the `supportsAtomicInt64` quirk is available.	2022-04-14 14:14:52 +05:30
PixelyIon	f3e81094a2	Implement Shader Compiler Property Quirks Introduces the `floatControls`, `supportsSubgroupVote` and `subgroupSize` quirks for the shader compiler which are based on Vulkan `PhysicalDevice` properties.	2022-04-14 14:14:52 +05:30
PixelyIon	51c4df24b5	Switch from `VK_VERSION_` to `VK_API_VERSION_` macros Vulkan has officially deprecated `VK_VERSION_*` macros for versioning as it has introduced the variant into the version. It should however be `0` for the Vulkan APi and doesn't need to be printed.	2022-04-14 14:14:52 +05:30
PixelyIon	0588a525b4	Implement Shader Compiler Extension/Feature Quirks Introduces several quirks for optional features used by the shader compiler which are now reported in the `Shader::HostTranslateInfo` and `Shader::Profile` structure. There are still property-related quirks for the shader compiler which haven't been implemented in this commit.	2022-04-14 14:14:52 +05:30
PixelyIon	8f3887c56a	Create `memory::Buffer` & Implement `StagingBuffer` as derivative A `Buffer` class was created to hold any generic Vulkan buffer object with `span` semantics, `StagingBuffer` was implemented atop it as a wrapper for `Buffer` that inherits from `FenceCycleDependency` and can be used as such.	2022-04-14 14:14:52 +05:30
PixelyIon	a55aca76c6	Rename `TextureView::backing` to `TextureView::texture` It was determined that `backing` wasn't a very descriptive name and that it conflicted with the texture's own backing, the name was changed to `texture` to make it more apparent that it was specifically the `Texture` object backing the view.	2022-04-14 14:14:52 +05:30
PixelyIon	482c573b81	Introduce `FlatMemoryManager::ReadTill` for scanning semantics A memory manager function to read into a vector till it satisfies the supplied function or hits an early stop condition like hitting the end of vector or reaching an unmapped region. This can be used to efficiently scan for values in GPU VA.	2022-04-14 14:14:52 +05:30
PixelyIon	31c4f1ca4e	Unlink `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when disabled When `VK_EXT_vertex_attribute_divisor` is not available, `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` is unlinked from the device enabled feature list as it is undefined behavior to link a structure provided by an extension without enabling that extension.	2022-04-14 14:14:52 +05:30
PixelyIon	7df2670ece	Fix `QuirkManager`'s `EXT_SET_V` macro bug `EXT_SET_V` would enable the extension regardless of if it was actually the correct extension or if the version was high enough as long as the hash matched. Co-authored-by: Billy Laws <blaws05@gmail.com>	2022-04-14 14:14:52 +05:30
PixelyIon	e9ed771b48	Check for `supportsMultipleViewports` feature before usage If the host only supports a single viewport then we set `viewportCount` and `scissorCount` in `VkPipelineViewportStateCreateInfo` to 1.	2022-04-14 14:14:52 +05:30
PixelyIon	3e45006d14	Make `shaderImageGatherExtended` a required `VkDevice` feature `shaderImageGatherExtended` is required by the shader compiler, to avoid complications associated with making it optional and considering that it's supported by the vast majority of Vulkan mobile devices, it was made a mandatory feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ece2785582	Introduce `ShaderManager` with Proxy Shader Compiler Logger/Settings This class will be entirely responsible for any interop with the shader compiler, it is also responsible for caching and compilation of shaders in itself.	2022-04-14 14:14:52 +05:30
PixelyIon	89e9a41a86	Implement `VkPipelineViewportStateCreateInfo` "Viewport Transforms" and "Viewport Scissors" were combined into one section to reflect their state in Vulkan correctly like all other sections.	2022-04-14 14:14:52 +05:30
PixelyIon	38119e21d4	Implement Vulkan-Supported Maxwell3D Primitive Topologies Any primitive topologies that are directly supported by Vulkan were implemented but the rest were not and will be implemented with conversions as they are used by applications, they are: * LineLoop * QuadList * QuadStrip * Polygon	2022-04-14 14:14:52 +05:30
PixelyIon	138f884159	Implement Maxwell3D Vertex Attributes Translates all Maxwell3D vertex attributes to Vulkan with the exception of `isConstant` which causes the vertex attribute to return a constant value `(0,0,0,X)` which was trivial in OpenGL with `glDisableVertexAttribArray` and `glVertexAttrib4(..., 0, 0, 0, 1)` but we don't have access to this in Vulkan and might need to depend on undefined behavior or manually emulate it in a shader. This'll be revisited in the future after checking host GPU behavior.	2022-04-14 14:14:52 +05:30
PixelyIon	4b9f99bb27	Make `ENUM_STRING` function `static` `ENUM_STRING` can be used inside a `class`/`struct`/`union` for `enum`s contained within them. Making the function `static` allows doing this and doesn't require supplying a `this` pointer of the enclosing class for usage.	2022-04-14 14:14:52 +05:30
PixelyIon	c2a6da6431	Implement Maxwell3D Vertex Buffer Limit Sets the end of VBOs based on the `vertexArrayLimits` register array which provides an IOVA to the end of the VBO.	2022-04-14 14:14:52 +05:30
PixelyIon	d8890f13e1	Explicitly make `default` case `break` for `Maxwell3D::HandleMethod` This being made implicit removes any confusion that all cases would need to be implemented and explicitly define that the CF should continue onto the 2nd switch-case when it cannot find any matches in the first one.	2022-04-14 14:14:52 +05:30
PixelyIon	612f324e78	Implement Maxwell3D Vertex Buffer Instance Rate Implements the `isVertexInputRatePerInstance` register array which controls if the vertex input rate is either per-vertex or per-instance. This works in conjunction with the vertex attribute divisor for per-instance attribute repetition of attributes.	2022-04-14 14:14:52 +05:30
PixelyIon	476c070c7a	Fix Minor Maxwell3D Register Ordering Issues We order all registers in ascending order, a few registers namely `colorLogicOp`, `colorWriteMask`, `clearBuffers` and `depthBiasClamp` were erroneously not following this order which has now been fixed.	2022-04-14 14:14:52 +05:30
PixelyIon	32de7e5150	Use `decltype` over `typeof` globally We inconsistently utilized `typeof` and `decltype` all over the codebase, this has now been fixed by uniformly using `decltype` as `typeof` is a GCC extension and not in the C++ standard alongside having the hidden side effect of removing references from the determined type.	2022-04-14 14:14:52 +05:30
PixelyIon	841ee9fc15	Check for `vertexAttributeInstanceRateZeroDivisor` feature before usage Check for `vertexAttributeInstanceRateZeroDivisor` in `VkPhysicalDeviceVertexAttributeDivisorFeaturesEXT` when the Maxwell3D register corresponding to the vertex attribute divisor is set to 0. If it isn't then it logs a warning and sets the value anyway which could result in UB since the only alternative is an exception that stops emulation which might not be optimal if the game mostly works fine without this, we will add a user-facing warning when we intentionally allow UB like this in the future.	2022-04-14 14:14:52 +05:30
PixelyIon	c3895a8197	Support `VkPhysicalDeviceFeatures2` Extensions Implement the infrastructure to depend on `VkPhysicalDeviceFeatures2` extended feature structures which can be utilized to retrieve the specifics of features from extensions. It is implemented in the form of `vk::StructureChain` with `vk::PhysicalDeviceFeatures2` that can be extended with any extension feature structures.	2022-04-14 14:14:52 +05:30
PixelyIon	ff5515d4d1	Implement Maxwell3D Vertex Buffer Bindings This implements everything in Maxwell3D vertex buffer bindings including vertex attribute divisors which require the extension `VK_EXT_vertex_attribute_divisor` to emulate them correctly, this has been implemented in the form of of a quirk. It is dynamically enabled/disabled based on if the host GPU supports it and a warning is provided when it is used by the guest but the host GPU doesn't support it.	2022-04-14 14:14:52 +05:30
PixelyIon	d163e4ffa6	Introduce `IOVA` union for flipped words of IOVAs in `GraphicsContext` The Maxwell3D `Address` class follows the big-endian register ordering for addresses while on the host we consume them in little-endian, the `IOVA` class is the host equivalent to the `Address` class with implicitly flipped 32-bit register ordering. It shares implicit decomposition semantics from `Address` due to similar requirements with a minor difference of being returned by reference rather than value as we want to have value setting semantics with implicit decomposition while we don't for `Address`.	2022-04-14 14:14:52 +05:30
PixelyIon	73646c4da8	Implicitly decompose `Address` into `u64` The semantics of implicitly decomposing the `Address` class into a `u64` were determined to be appropriate for the class. As it is an integer type this effectively retains all semantics from using an integer directly for the most part.	2022-04-14 14:14:52 +05:30
PixelyIon	48d0b41f16	Implement Maxwell3D Common/Independent Color Write Mask Maxwell3D supports both independent and common color write masks like color blending but for common color write masks rather than having register state specifically for it, the state from RT 0 is extended to all RTs. It should be noted that color write masks are included in blending state for Vulkan while being entirely independent from each other for Maxwell, it forces us to use the `independentBlend` feature even when we are doing common blending unless the color write mask is common as well but to simplify all this logic the feature was made required as it supported by effectively all targeted devices.	2022-04-14 14:14:52 +05:30
PixelyIon	92809f8a78	Implement Maxwell3D Independent/Common Color Blending Maxwell3D supports independent blending which has different blending per-RT and common blending which has the same blending for all RTs. There is a register determining which mode to utilize and we simply have two arrays of `VkPipelineColorBlendAttachmentState` for the RTs that we toggle between to make the transition between them extremely cheap.	2022-04-14 14:14:52 +05:30
PixelyIon	2ceb6465e8	Make `independentBlend` a required `VkDevice` feature Independent blending is supported by effectively every Vulkan 1.1 Android GPU, it gives us the ability to architecture Maxwell3D blending emulation better as we can avoid additional checks for independent blending state and having a fallback path for when the host doesn't support the feature.	2022-04-14 14:14:52 +05:30
PixelyIon	cd737fbdd8	Add Required `VkDevice` Features A prior commit added the ability to utilize features with quirks but this implements the ability to require a feature be present on the host or an exception will be thrown. It allows us to make useful assumptions that result in a better architecture in certain cases.	2022-04-14 14:14:52 +05:30
PixelyIon	081d3277c1	Enable Quirk + Required `VkDevice` Extensions Implements the infrastructure required to enable optional extensions set in `QuirkManager` alongside the required extensions in the `GPU` class. All extensions should be correctly resolved now and according to what the device supports.	2022-04-14 14:14:52 +05:30
PixelyIon	6099b1ead5	Fix Maxwell3D Register `lineWidthAliased` Offset The offset was incorrectly set to `0x4D` rather than `0x4ED` which is what it should be. This would've led to bugs in line width determination and likely broken any aliased line rendering entirely.	2022-04-14 14:14:52 +05:30
PixelyIon	51659e1329	Enable `VkDevice` Features Selectively We selectively enable GPU features that we require as enabling all of them might result in extra driver overhead in certain circumstances. Setting them is handled by `QuirkManager` with the new `FEAT_SET` function that ties a quirk with a feature.	2022-04-14 14:14:52 +05:30
PixelyIon	ec378814aa	Stub Maxwell3D Alpha Testing We stub alpha testing as it doesn't exist in Vulkan and few titles use it, it can be emulated in the future using a shader patch with manually discarding fragments failing the alpha test function but this'll be added in later as it isn't high priority at the moment and has associated overhead with it so other options might be explored at the time.	2022-04-14 14:14:52 +05:30
PixelyIon	83ec99fa48	Print GPU Quirks At Startup It is essential to know what quirks a certain GPU may have to debug an issue, these are now printed at startup into the log alongside all other GPU information. A new `QuirkManager::Summary` function was implemented to provide this functionality.	2022-04-14 14:14:52 +05:30
PixelyIon	01eb16e59a	Implement Maxwell3D Color Logic Operations Implements a basic part of Vulkan blending state which are color logic operations applied on the framebuffer after running the fragment shader. It is an optional feature in Vulkan and not supported on any mobile GPU vendor aside from ImgTec/NVIDIA by default.	2022-04-14 14:14:52 +05:30
PixelyIon	662935c35d	Attempt Flushing `Logger` During Fatal Signals Any signals that lead to exception handling being triggered now attempt to flush all logs given that the log mutex is unoccupied, this is to mostly help logs be more complete when exiting isn't graceful.	2022-04-14 14:14:52 +05:30
PixelyIon	586bee2c59	Remove Maxwell3D Zero Initialization Calls A lot of calls in Maxwell3D register initialization ended up setting the register to 0 which should be implicit behavior and most calls would be eliminated by the redundancy check which had to be manually disabled. It was determined to be better to move this responsibility to the called function to initialize to state equivalent to the corresponding register being 0. All initialization calls with the argument as 0 have been removed now due to this, it was the vast majority of calls.	2022-04-14 14:14:52 +05:30
PixelyIon	49cc0964e2	Initialize Maxwell3D Registers Correctly Maxwell3D Registers weren't initialized to the correct values prior, this commit fixes that by doing `HandleMethod` calls with all the register values being initialized. This is in contrast to the registers being set without calling the methods in `GraphicsContext` or otherwise resulting in bugs.	2022-04-14 14:14:52 +05:30
PixelyIon	ea87b8878c	Implement `B8G8R8A8{Unorm/Srgb}` RT Format Needed by Maxwell3D Register Initialization	2022-04-14 14:14:52 +05:30
PixelyIon	69e7d8b574	Remove Vulkan to Skyline Format Conversion Function The function `GetFormat` was seemingly no longer required due to us never converting from a Vulkan format to a Skyline format, most conversions only went from Skyline to Vulkan and were generally lossy due to certain formats being missing in Vulkan and approximated using channel swizzles. As a result of this, it was pointless to maintain and has now been removed.	2022-04-14 14:14:52 +05:30
PixelyIon	5ed26fef23	Implement Maxwell3D Rasterizer State Maxwell3D registers relevant to the Vulkan Rasterizer state have been implemented aside from certain features such as per-face polygon modes that cannot be implemented due to Vulkan limitations. A quirk was utilized to dynamically support the provoking vertex being the last vertex as opposed to the first as well.	2022-04-14 14:14:52 +05:30
PixelyIon	8ef225a37d	Introduce `QuirkManager` for runtime GPU quirk tracking We require a way to track certain host GPU features that are optional such as Vulkan extensions, this is what the `QuirkManager` class does as it checks for all quirks and holds them allowing other components to branch based off these quirks.	2022-04-14 14:14:52 +05:30
PixelyIon	8803616673	Reorder `GraphicsContext` Members All members are now placed at the start of sections they are relevant to rather than at the start of the class.	2022-04-14 14:14:52 +05:30
PixelyIon	107d337d77	Fix `MacroInterpreter::MethodAddress` Bitfield Padding Due to compiler alignment issues, the bitfield member `increment` of `MacroInterpreter::MethodAddress` was mistakenly padded and moved to the next byte. This has now been fixed by making its type `u16` like the member prior to it to prevent natural alignment from kicking in.	2022-04-14 14:14:52 +05:30
PixelyIon	26966287c7	Implement Maxwell3D Shader Program Registers This commit added basic shader program registers, they simply track the address a shader is pointed to at the moment. No parsing of the shader program is done within them.	2022-04-14 14:14:52 +05:30
PixelyIon	93ea919c8f	Fix warnings from `NVRESULT` due to unused lambda capture A previously used `this` capture is no longer used since the introduction of the static logger.	2022-04-14 14:14:52 +05:30
lynxnb	092dcb18c8	Stub `ectx:w` and `ectx:aw` Glue services	2022-02-06 21:57:38 +05:30
lynxnb	6913a90361	Use the new log file name & ext for every logger context	2021-11-11 16:32:19 +01:00
lynxnb	5cd1f01690	Refactor all logger calls	2021-11-11 16:13:24 +01:00
lynxnb	769e6c933d	Make `Logger` class static and introduce `LoggerContext` A thread local LoggerContext is now used to hold the output file stream instead of the `Logger` class. Before doing any logging operations, a LoggerContext must be initialized. This commit will not build successfully on purpose.	2021-11-11 16:13:24 +01:00
Billy Laws	d88b08d986	Address PR feedback	2021-11-10 21:35:36 +05:30
Billy Laws	1b453c04ca	Remove completed nvmap TODO Pins have been implemented so the to-do is no longer needed.	2021-11-10 21:35:36 +05:30
Billy Laws	d2d181725f	Remove unused virtEnd variable in FlatMemoryManager::{Read, Write}	2021-11-10 21:35:36 +05:30
Billy Laws	60fbfad4bc	Add virtual dtors to time service code	2021-11-10 21:35:36 +05:30
Billy Laws	ef10d3d394	Use semantic wrapping where appropriate for class initialiser lists	2021-11-10 21:35:36 +05:30
Billy Laws	6b33268d85	Remove unused gm20b EngineID enum	2021-11-10 21:35:36 +05:30
Billy Laws	73896c2e6b	Fixup nvdrv channel types to follow naming conventions	2021-11-10 21:35:36 +05:30
Billy Laws	ad900aba7a	s/Host1X/Host1x/ as per Nvidia naming	2021-11-10 21:35:36 +05:30
Billy Laws	dbfb1cfe20	Fully implement the nvdrv Host1xChannel::Submit operation This pushes a set of command buffers into the Host1x command FIFO allocated for the channel, returning fence thresholds that can be waited on for completion,	2021-11-10 21:35:36 +05:30
Billy Laws	baefb0fe93	Implement the Host1x command FIFO together with barebones Host1x classes The Host1x block of the TX1 supports 14 separate channels to which commands can be issued, these all run asynchronously so are emulated the same way as GPU channels with one FIFO emulation thread each. The command FIFO itself is very similar to the GPFIFO found in the GPU however there are some differences, mainly the introduction of classes (similar to engines) and the Mask opcode (which allows writing to a specific set of offsets much more efficiently). There is an internal Host1x class which functions similar to the GPFIFO class in the GPU, handling general operations such as syncpoint waits, this is accessed via the simple method interface. Other channels such as NVDEC and VIC are behind the 'Tegra Host Interface' (THI) in HW, this abstracts out the classes internal details and provides a uniform method interface ontop of the Host1x method one. We emulate the THI as a templated wrapper for the underlying class. Syncpoint increments in Host1x are different to GPU, the THI allows submitting increment requests that will be queued up and only be applied after a specific condition in the associated engine is met; however the option to for immediate increments is also available.	2021-11-10 21:35:36 +05:30
Billy Laws	2494cafee8	Cleanup GPFIFO comments and make Run() private	2021-11-10 21:35:36 +05:30
Billy Laws	2577658fc7	Avoid GetPointer on nvmap handles where they would be accessed via SMMU GetPointer sets the sharedMemMapped flag, which should only be set if other userspace processes have the handle mapped.	2021-11-10 21:35:36 +05:30
Billy Laws	fd0420443c	Add template utils for constructing all elements in an std::array This avoids the excessive repetition needed for the case where array members have no default constructor. eg: ```c++ std::array<Type, 10> channels{util::MakeFilledArray<Type, 10>(typeConstructorArg, <...>)}; ```	2021-11-10 21:35:36 +05:30
Billy Laws	34bf413661	Fix bitmask check for event IDs > 32 in Ctrl::SyncpointFreeEventBatch Doing 1 << 32 would result in an integer overflow rather than the desired behaviour of checking a bit, make 1 64 bit to present that.	2021-11-10 21:34:30 +05:30
Billy Laws	debab7c9c7	Implement nvmap handle pinning/unpinning nvmap allows mapping handles into the SMMU address space through 'pins'. These are refcounted mappings that are lazily freed when required due to AS space running out. Nvidia implements this by using several MRU lists of handles in order to choose which ones to free and which ones to keep, however as we are unlikely to even run into the case of not having enough address space a naive queue approach works fine. This pin infrastructure is used by nvdrv's host1x channel implementation in order to handle mapping of both command and data buffers for submit.	2021-11-10 21:34:30 +05:30
Billy Laws	a0c57256cc	Hookup FlatMemoryManager for SMMU into SoC The SMMU is used to control the mappings of peripherals such as the VIC and NVDEC.	2021-11-10 21:34:30 +05:30
Billy Laws	97dc053ffd	Move FlatAllocator allocation error handling to the caller This is a prerequisite for nvmap SMMU memory management, which only frees the memory handles refer to if an allocation fails.	2021-11-10 21:34:30 +05:30
Billy Laws	04e5237ec1	Stub host1x channel devices and IOCTLs host1x channels are generally similar to GPU channels however there is only one channel for each specific class (like a GPU engine) and an address space is shared between them all. This PR implements the simple IOCTLs with the larger ones that will depend on changes outside of nvdrv being left for future commits. This is enough to partly run oss-nvjpeg.	2021-11-10 21:34:30 +05:30
Billy Laws	5087d3dc2a	Reserve host1x channel syncpoints	2021-11-10 21:34:30 +05:30
Billy Laws	b0a5dab0f7	Support passing additional constructor arguments to nvdrv devices Needed for host1x channels which use the same class for multiple types.	2021-11-10 21:34:30 +05:30
Billy Laws	eb6f052873	Fixup GpuChannel::SetNvmapFd to take an FD rather than an nvmap handle	2021-11-10 21:34:30 +05:30
Billy Laws	386a3447a8	Introduce variable sized span support to nvdrv deserialisation The element containing the size first needs to be saved to a save slot with Save<T, slotId>, it can then be read back later as the size of a span with SlotSizeSpan<T, slotId>. This is needed to support the Host1XChannel Submit IOCTL.	2021-11-10 21:34:30 +05:30
Billy Laws	6eeaa343f8	Avoid crash when passing unallocated syncpoint IDs to EventWait	2021-11-10 21:34:30 +05:30
PixelyIon	fbfad21f03	Migrate `Maxwell3D::Registers` to `OffsetMember` Maxwell3D registers were primarily written with absolute offsets and ended up being fairly messy due to attempting to emulate this using struct relative positioning resulting in a lot of pointless padding members. This has now been improved by utilizing `OffsetMember` to directly use offsets resulting in much neater code.	2021-11-10 21:31:45 +05:30
PixelyIon	69761389ff	Extend `OffsetMember` with direct `operator=`/`operator[]` It was found to be semantically advantageous to directly pass-through certain operators such as the assignment (`=`) and array index (`[]`) operators. These would require a dereference prior to their usage otherwise but now can be directly used.	2021-11-10 21:30:02 +05:30
Billy Laws	cc7b2a0ab8	Introduce `OffsetMember` for offset-based union members The offset of a member in a structure was determined by its relative position and compiler alignment. This is unoptimal for larger structures such as those found in GPU Engines that are primarily named by offset rather than relative positioning and end up requiring a massive amount of padding to function as is. A solution to this problem was simply to supply member offsets directly which can be done by using `OffsetMember` alongside a `union`.	2021-11-10 21:17:31 +05:30
PixelyIon	fb476567ff	Introduce `JniString` as C++ wrapper over `jstring` We used to manually call JNI UTF-8 string allocation and deallocation functions when utilizing a `jstring` but this could be erroneous and is just inconvenient. All of this has now been consolidated into an class `JniString` which is a wrapper around `std::string` and creates a copy of the contents of the `jstring` in its constructor and passes them into the `std::string` constructor.	2021-11-09 21:18:47 +05:30
PixelyIon	79ceb2cf23	Improve Vulkan `Texture` Synchronization The Vulkan Pipeline Barriers were unoptimal and incorrect to some degree prior as we purely synchronized images and not staging buffers. This has now been fixed and improved in general with more relevant synchronization.	2021-11-09 21:08:03 +05:30
PixelyIon	414c0104c3	Rework Joy-Con Vibration Conversion The guest -> host vibration conversion code was entirely broken as it didn't set the vibration `start`/`end` timestamps correctly for a cycle nor did it subtract from the `totalAmplitude` (`currentAmplitude` now) when it a cycle ended due to an incorrect `if` statement and contents. It would just end up saturating the amplitude as much as possible by adding more and more to `totalAmplitude` on every cycle while never subtracting which is entirely wrong and led to a very noticeable drop in amplitude when a vibration was repeated. It's been entirely reworked to fix all the issues listed above and remove a lot of code that had no understandable purpose. More comments have also been added to the code to make it more readable with better variable and function naming alongside it.	2021-11-01 00:28:11 +05:30
PixelyIon	4b80e1f91c	Use libcxx from LLVM Project submodule The version of libcxx shipped with Android NDK is fairly outdated and doesn't contain several features we desire such as C++ 20 ranges. This has been fixed by using libcxx directly from the LLVM Project which has been added as a submodule and can be updated independently of NDK.	2021-10-31 16:04:44 +05:30
PixelyIon	82154f3ef6	Upgrade AGP to `7.1.0-beta01` & NDK to `24.0.7856742` We've moved to using a beta AGP as `7.0.2` is breaks `clangd` and other C++ features on Beta/Canary Android Studio. NDK was additionally updated with `mbedtls` to fix warnings caused by it alongside some other minor fixes to code for newer versions of libcxx. The new AGP has a bug where it does not look for executables specified in `android_gradle_build.json` in `PATH` that includes `ninja` which is provided by the `ninja-build` package on the system rather than Android SDK's CMake on GitHub Actions (Ubuntu 20.04). This has been fixed by symlinking `/usr/bin/ninja` to the project root which is searched in for the `ninja` executable.	2021-10-31 15:50:15 +05:30
PixelyIon	962d8dc4c8	Return immediately for non-joining `KProcess::Kill`s when already killed Locking `KProcess::threadMutex` when a process is being killed by another thread with `join` can lead to the non-joining killer effectively joining as it's waiting on the joining killer to relinquish the mutex. This has been fixed by having an atomic boolean tracking if the process has already been killed and if it has, immediately returning prior to locking the mutex for any non-joining killers.	2021-10-31 15:45:10 +05:30
Billy Laws	6f59cba68d	Adds bounds checks to resampler to avoid OOB reads Resampling would sometimes perform an OOB read into `inputBuffer` due it not containing enough data to calculate corresponding the output sample, this has been fixed by introducing bounds checking to ensure that the buffer has enough data.	2021-10-29 21:46:51 +05:30
PixelyIon	9f5ab13858	Implement `R16G16{Unorm/Sint/Uint}` RT Formats Utilized in "The Touryst"	2021-10-29 20:21:58 +05:30
PixelyIon	afebf77544	Zero-Initialize `GuestTexture` Members Members of `GuestTexture` were apparently not being initialized and this led to UB since they would be read as random values. Titles such as Super Mario Odyssey avoided setting `baseArrayLayer` which led to it being left at the default value which was completely random and this would lead to crashes. This commit fixes this by initializing said values correctly.	2021-10-27 22:49:45 +05:30
PixelyIon	a0921f8261	Implement `R16G16B16A16Snorm`/`R16G16B16A16Sint` RT Formats Utilized in "The Touryst"	2021-10-27 22:49:45 +05:30
PixelyIon	df64ff5d14	Zero-Fill `IAudioRenderer::RequestUpdate` Output Buffer Some titles don't clear the output buffer prior to submission, as the service is expected to fill all of it in, our audren implementation is incomplete and doesn't end up doing this leaving the contents of the buffer to be undefined leading to UB in the form of SEGFAULTs or the application throwing a fatal error. This has been patched over by 0-filling the buffer which is a sane default value for the fields that aren't filled in albeit not a replacement for a proper audren implementation.	2021-10-27 16:18:20 +05:30
PixelyIon	1f3519e6e3	Fix Logger Message OOB Access Certain titles can submit logs where the last field is one off by the buffer end, the logger loop now considers this and terminates if there isn't enough data left to read the field type and length.	2021-10-26 21:59:47 +05:30
PixelyIon	645183c903	Fix OOB Vibration Array Access in `VibrateDevice` Access to the `vibrations` field in `vibrations[3].period` could lead to UB, this has been replaced with a proper check which adds up the period over all vibrations instead. A minor cleanup with variable names and explicit types for integer arithmetic has also been done.	2021-10-26 21:57:28 +05:30
PixelyIon	dc3f7f1ab4	Fix Incorrect Scissor Extent The decomposition from `texture::Dimension` to `vk::Rect2D` was somehow implicit and completely incorrect resulting in wrong conversion with undefined values. It's now been fixed by explicitly setting `vk::Rect2D::extent` to `scissor` specifically.	2021-10-26 20:08:53 +05:30
PixelyIon	a60f238479	Fix `GPU::DebugCallback` Type String Extraction The second parameter of `std::string_view::substr` was assumed to be an end position (similar to `std::span`) rather than `count` which it is. As a result of this, it was entirely broken but only held together by a constant factor being subtracted from it which was derived by trial and error. It's now been fixed by returning a count rather than the absolute position.	2021-10-26 20:08:35 +05:30
Billy Laws	70d1b4994c	Enable Wconversion and fix warnings produced	2021-10-26 11:41:24 +01:00
Billy Laws	1e7347bf72	Use semantic wrapping for nvdrv where appropriate	2021-10-26 10:46:36 +05:30
PixelyIon	830a800d9e	Consolidate `AddAttachment` Loops + Rename `Renderpass` -> `RenderPass`	2021-10-26 10:46:36 +05:30
PixelyIon	92a21ea616	Cleanup & Use C++ Concepts in `utils.h`	2021-10-26 10:46:36 +05:30
PixelyIon	ea2626bcc6	Address CR Comments	2021-10-26 10:46:36 +05:30
PixelyIon	1d532628cb	Null-Check Optional NACP before extracting application title Not doing this can lead to the NACP being filled with invalid data and led to crashes on homebrew titles like SpaceNX.	2021-10-26 10:46:36 +05:30
PixelyIon	c8821c7313	Update `nvdrv` perms to 11.0.0+ & Implement `nvdrv:a` service `nvdrv:a` (For Applets) is used by some older homebrew such as SpaceNX which don't fall back to `nvdrv` (For Applications).	2021-10-26 10:46:36 +05:30
PixelyIon	3b4bbd2b38	Switch to using exceptions for guest exiting Guest-driven exiting could cause objects left on the heap due to a `std::longjmp` from high up in the host call stack, this has been fixed by introducing `ExitException` which implicitly unrolls the stack with the exception handling mechanism.	2021-10-26 10:46:36 +05:30
PixelyIon	eff5711c49	Split monolithic `common.h` into smaller chunks * Resolves dependency cycles in some components * Allows for easier navigation of certain components like `span` which were especially large * Some imports have been moved from `common.h` into their own files due to their infrequency	2021-10-26 10:46:36 +05:30
PixelyIon	1d57bab08f	Revamp `LicenseDialog` + Update Licenses + Stop Bintray Usage * Update licenses for dependent projects * Add copyright notices (as provided) * Revamp styling for `LicenseDialog` * Fix invisible `PreferenceDialog` buttons in Settings * Consolidating color variables into `colorPrimary`, `backgroundColor` and `backgroundColorVariant` * Use `com.google.android.flexbox:flexbox:3.0.0` (Google Maven) rather than `com.google.android:flexbox:2.0.1` (Bintray)	2021-10-26 10:46:36 +05:30
PixelyIon	bbf28d1942	Improve Clean Exit + Audio Pausing + Improve System Language Setting * Clean Exiting was improved by implementing a robust system for when to abandon clean exiting and simply restart the process alongside moving clean exiting to the background when the application is quit by using the back button * Audio is now automatically paused whenever the application is moved to the background and automatically resumed when it's brought to the foreground * The system language setting had several errors and inconsistencies which have now been fixed, it's been brought more in line with HOS language (Albeit not entirely due to no region setting in Skyline) * Fix a bug with `ThreadLocal` where the atomic `list` pointer was uninitialized resulting in a `SEGFAULT` during the destructor	2021-10-26 10:46:36 +05:30
PixelyIon	a7548c79a0	Android 12 Support + Update Libraries + Include Khronos Validation Layer * Fix handling `SA_EXPOSE_TAGBITS` bit being set in Android 12 `sigaction` * Fix CMake bug using `CMAKE_INTERPROCEDURAL_OPTIMIZATION_RELEASE` when not supported causing `-fuse-ld=gold` to be emitted as a linker flag * Support using `VIBRATOR_MANAGER_SERVICE` rather than `VIBRATOR_SERVICE` on Android 12 * Optimize Imports for Kotlin code * Move away from deprecated APIs in Kotlin or explicitly mark where it's not possible * Update SDK, NDK and libraries * Enable Gradle Configuration Cache	2021-10-26 10:46:36 +05:30
Billy Laws	b7d0f2fafa	Implement support for pushbuffer methods split across multiple GpEntries These are used heavily in OpenGL games, which now, together with the previous syncpoint changes, work perfectly. The actual implementation is rather novel as rather than using a per-class state machine for all methods we only use it for those that are known to be split across GpEntry boundaries, as a result only a single bounds check is added to the hot path of contiguous method execution and the performance loss is negligible.	2021-10-16 12:13:30 +01:00
Billy Laws	fc017e1e95	Implement pre-wait and post-increment syncpoint operations in submit These are used by both OpenGL and Vulkan games as opposed to including the operations inside the main commandbuffer.	2021-10-16 12:13:30 +01:00
PixelyIon	9b9bf8d300	Introduce `ThreadLocal` Class + Fix Several GPU Bugs * Fix `AddClearColorSubpass` bug where it would not generate a `VkCmdNextSubpass` when an attachment clear was utilized * Fix `AddSubpass` bug where the Depth Stencil texture would not be synced * Respect `VkCommandPool` external synchronization requirements by making it thread-local with a custom RAII wrapper * Fix linear RT width calculation as it's provided in terms of bytes rather than format units * Fix `AllocateStagingBuffer` bug where it would not supply `eTransferDst` as a usage flag * Fix `AllocateMappedImage` where `VkMemoryPropertyFlags` were not respected resulting in non-`eHostVisible` memory being utilized * Change feature requirement in `AndroidManifest.xml` to Vulkan 1.1 from OGL 3.1 as this was incorrect	2021-10-16 12:13:30 +01:00
Billy Laws	eb25f60033	Implement multichannel support for GPU Allows the execution of multiple channels at the same time, with locking being performed on the host GPU scheduler layer, address spaces can be bound to one or more channels.	2021-10-16 12:13:30 +01:00
PixelyIon	b762d1df23	Introduce Texture Always Sync + Wait on GPU Execution + More RT Formats Infrastructure for always syncing textures has been introduced now, they will be synced prior to and after every execution. This does considerably reduce the performance alongside waiting on GPU execution to finish but it will be partially recouped once conditional syncing is performed.	2021-10-16 12:13:30 +01:00
PixelyIon	f8acc1e131	Improve Shared Fonts + Fix AM `PopLaunchParameter` & Choreographer Bug * Move Shared Font TTFs to AAsset storage + Support external shared font loading from `/data/data/skyline.emu/data/fonts` * Fix bug in `IApplicationFunctions::PopLaunchParameter` caused by ignoring `LaunchParameterKind` * Fix bug with Choreographer causing it to be awoken and exit prior to the destruction of `PresentationEngine` * Fix bug with `IDirectory::Read` where it used `inputBuf` for the output buffer rather than `outputBuf` * Improve `GetFunctionStackTrace` logs when `dli_sname` or `dli_fname` are missing * Support more RT Formats	2021-10-16 12:13:30 +01:00
PixelyIon	95a08627e5	Subpass Support + More RT Formats + Fix `FenceCycle` Cyclic Dependencies Support for subpasses was added by reworking attachment reuse code to account for preserved attachments and subpass dependencies. A lot of RT formats were also added to allow SMO to boot up entirely, it should be noted that it doesn't render anything. `FenceCycle` had a cyclic dependency which broke clean exit, we now utilize `std::weak_ptr<FenceCycle>` inside the `Texture` object. A minor fix for broken stack traces was also made caused by supplying a `nullptr` C-string to libfmt when a symbol was unresolved which caused an `abort` due to invocation of `strlen` with it.	2021-10-16 12:13:30 +01:00
PixelyIon	239d2625e2	Introduce `CommandExecutor` + Implement `ClearBuffers` + More RT Formats This commit introduces the `CommandExecutor` which is responsible for creating and orchestrating a Vulkan command graph comprising of `CommandNode`s that construct all the objects required for rendering. As a result of the infrastructure provided by `CommandExecutor`, `ClearBuffers` could be implemented and be appropriately utilized. A bug regarding scissors was also determined and fixed in the PR, the extent of them were previously inaccurate and this has now been fixed. Note: We don't synchronize any textures from the guest for now as this would override the contents on the host, this'll be fixed with the appropriate write tracking but it also results in a black screen for anything that writes to FB	2021-10-05 01:13:22 +05:30
PixelyIon	3879d573d5	Fix Command Buffer Allocation & `FenceCycle` This commit fixes a major issue with command buffer allocation which would result in only being able to utilize a command buffer slot on the 2nd attempt to use it after it's freed, this would lead to a significantly larger amount of command buffers being created than necessary. It also fixes an issue with the command buffers not being reset after they were utilized which results in UB eventually. Another issue was fixed with `FenceCycle` where all dependencies are only destroyed on destruction of the `FenceCycle` itself rather than the function where the `VkFence` was found to be signalled.	2021-10-05 01:13:22 +05:30
PixelyIon	bee28aaf0d	Validation Layer Filter + Fix `Texture`, GPU & `PresentationEngine` bugs This commit implements a filter by type for any validation layer output, this allows filtering out any logs which may be unnecessary and additionally triggering a breakpoint as required. An issue concerning the `NDEBUG` flag never being set was fixed, it's now supplied as a release compiler flag. The issue can manifest itself by always relying on a validation layer even though it shouldn't on release, this is why the validation layer was mistakenly disabled entirely previously by using `#ifndef` rather than `#ifdef`. An issue with the initial layout of a texture being supplied as neither `VK_IMAGE_LAYOUT_UNDEFINED` or `VK_IMAGE_LAYOUT_PREINITIALIZED` was fixed, these cases are now handled by transitioning to those layouts after creating the image rather than supplying it within `initialLayout`. Another issue was fixed regarding not maintaining a transformation after a surface has been destroyed and recreated existed and manifested itself when the user would go out of the app and come back in, they would see the surface having an identity transformation rather than the desired one.	2021-10-05 01:13:22 +05:30
PixelyIon	54908afc44	Texture GMMU Address Resolution + Refactor `Maxwell3D::CallMethod` Fixes bugs with the Texture Manager lookup, fixes `RenderTarget` address extraction (`low`/`high` were flipped prior), refactors `Maxwell3D::CallMethod` to utilize a case for the register being modified + preventing redundant method calls when no new value is being written to the register, and fixes the behavior of shadow RAM which was broken previously and would lead to incorrect arguments being utilized for methods.	2021-10-05 01:13:22 +05:30
PixelyIon	270f2db1d2	Initial Texture Manager Implementation + Maxwell3D Render Target Implement the groundwork for the texture manager to be able to report basic overlaps and be extended to support more in the future. The Maxwell3D registers `RenderTargetControl`, `RenderTarget` and a stub for `ClearBuffers` were implemented. A lot of changes were also made to `GuestTexture`/`Texture` for supporting mipmapping and multiple array layers alongside significant architectural changes to `GuestTexture` effectively disconnecting it from `Texture` with it no longer being a parent rather an object that can be used to create a `Texture` object. Note: Support for fragmented CPU mappings hasn't been added for texture synchronization yet	2021-10-05 01:13:22 +05:30
PixelyIon	8cba1edf6d	Introduce Boost as a submodule + Minor Fixes Utilize Boost Container's `small_vector` for optimizing allocations, fix certain implicit casting issues and make `ILogger` not output an additional newline in the log when the application supplies one at the end of the log	2021-10-05 01:13:22 +05:30
PixelyIon	190fde110f	Introduce `GraphicContext` and Implement Viewport Transform + Scissors	2021-10-05 01:13:22 +05:30
PixelyIon	bc378ad135	Fix `GraphicBufferProducer` Format Bug The format provided in `GraphicBuffer` can be misleading and is supplied as `None` by the Deko3D Swapchain, it instead supplies the real format in the `NvGraphicHandle` which we now utilize instead of the one in `GraphicBuffer`.	2021-10-05 01:13:22 +05:30
PixelyIon	18cf0b82d6	Fix KTransferMemory Destruction Behavior + Add Missing SVC Handle Logs KTransferMemory needs to be reset to RW on destruction unlike KSharedMemory which is simply freed on destruction, not emulating this behavior accurately leads to `deko_examples` from `switch-examples` to lead to a SEGFAULT on selecting an example as it expects the memory to be R/W while it ends up being freed instead	2021-10-05 01:13:22 +05:30
Billy Laws	192459f726	Adjust AS block end predecessors to successors after reuse as end	2021-10-04 20:29:02 +01:00
PixelyIon	d45193874e	Fix NvDrv `CloseDevice` Bug The unique pointer to a device in the map was simply reset rather than deleting the entry from the map, this resulted in the device not being properly closed and when the device was reopened then the `emplace` was a NOP as the entry already existed. This resulted in a `nullptr` dereference down the line when an application attempted to issue an IOCTL to a device that was previously closed and reopened. This is known to occur in Deko3D as it recreates the context when loading an example which includes closing and reopening devices.	2021-10-03 18:02:32 +05:30
Billy Laws	7a5ca19c0b	Fix improper VMM error check in unmap	2021-10-01 20:58:22 +01:00
Billy Laws	4eff515c55	Add support for IOCTL3 CtrlGpu calls missed in refactor	2021-10-01 20:57:59 +01:00
PixelyIon	02bcf98b8c	Fix NvDrv Bugs + NACP Parsing Bug A hotfix for a nvdrv bug where an IOCTL with no input/output buffers would crash and a major `nvmap` bug which broke the `FromId` IOCTL as it didn't write back the handle ID, another minor bug existed in `nvhost` where the `ZCullGetInfo` IOCTL was `INOUT` rather than `OUT`. A bug with NACPs was also fixed caused by incorrect padding for the `NacpData` structure which resulted in the `saveDataOwnerId` member being read incorrectly and as a result the save data folder being incorrect. Co-authored-by: artem8086 <artemsvyatoha@gmail.com>	2021-10-02 00:15:53 +05:30
lynxnb	afc3dda1ee	Reintroduce `LanguageCode` alias type	2021-09-30 02:19:19 +05:30
lynxnb	c6d91c552b	Address CR	2021-09-30 02:19:19 +05:30
lynxnb	0077833667	Support game icon selection based on language	2021-09-30 02:19:19 +05:30
lynxnb	01ce09183b	Rework common languages Swapped out the usages of frozen constexpr unordered maps for switch statements, which are very likely to be turned into jump tables given the nature of the enums used, resulting in better performance than a map	2021-09-30 02:19:19 +05:30
sspacelynx	4d20d7a4d0	Address CR + Code formatting	2021-09-30 02:19:19 +05:30
sspacelynx	ca97517d81	Rework `GetDesiredLanguage` + Remove `LanguageCode` alias type	2021-09-30 02:19:19 +05:30
sspacelynx	fdc7e1238a	Implement language support for rom loaders	2021-09-30 02:19:19 +05:30
sspacelynx	3c5509e52a	Update available languages to those post 10.1.0 Brazilian Portuguese has been added in firmware version 10.1.0	2021-09-30 02:19:19 +05:30
sspacelynx	3fffde3061	Rework OS to allow passing a language	2021-09-30 02:19:19 +05:30
Billy Laws	a4a6511177	Address feedback	2021-09-30 02:11:19 +05:30
Billy Laws	a18f1aa889	Address feedback	2021-09-30 02:11:19 +05:30
Billy Laws	8db2cf29f2	Implement AsGpu::FreeSpace and clean up locking and debug prints	2021-09-30 02:11:19 +05:30
Billy Laws	d3dc0bddc4	Cleanup IStorageAccessor validation code	2021-09-30 02:11:19 +05:30
Billy Laws	39faa739b9	Optimise GPFIFO command processing for higher throughput Using a u32 for the loop index prevents masking on all increments, giving a moderate performance increase. Passing methods as u32 parameters and stopping subChannel being passed gives quite a significant increase when combined with the inlining allowed by subchannel based engine selection.	2021-09-30 02:11:19 +05:30
Billy Laws	3d538a29da	Misc code cleanup and more comments in AS map	2021-09-30 02:11:19 +05:30
Billy Laws	30d41252f7	Use IPC raw data size to calculate C Buffer offset Some games don't have a u16 size array, the code for this is also kinda pointless when we can use the raw data size too.	2021-09-30 02:11:19 +05:30
Billy Laws	d03b288db6	NEEDS CLEANUP: Reimplement GPU VMM and rewrite nvdrv VM impl	2021-09-30 02:11:19 +05:30
Billy Laws	020aa0e43a	Add support for Ioctl2/3 and improve debug logging	2021-09-30 02:11:19 +05:30
Billy Laws	9bda574a4e	Move nvhost-ctrl-gpu to new device API	2021-09-30 02:11:19 +05:30
Billy Laws	39492d9365	Move nvhost-gpu to new device API The implementation of GPU channels and Host1X channels will be split up as it allows a much cleaner implementation and less undefined behaviour potential.	2021-09-30 02:11:19 +05:30
Billy Laws	3d4f7d9b44	Move nvhost-as-gpu to new device API	2021-09-30 02:11:19 +05:30
Billy Laws	1149158198	Make GraphicBufferProducer use the new nvmap API	2021-09-30 02:11:19 +05:30
Billy Laws	c97f5a9315	Entirely rewrite nvmap to separate global and per device state. This will be required later for NVDEC/SMMU support and fixes many significant issues in the previous implementation. Based off of my 2.0.0/12.0.0 nvdrv REs.	2021-09-30 02:11:19 +05:30
Billy Laws	78356fa789	Migrate syncpoint management over to the new device API The syncpoint manager has beeen given convinience functions for fences which remove the need to access the raw id/threshold most of the time and various accuracy fixes and cleanups to match HOS 12.0.0 have also been done.	2021-09-30 02:11:19 +05:30
Billy Laws	a19bf973b1	Rework the nvdrv core for accuracy and support new deserialisation Devices will need to be moved over in order to support this new interface, IOCTL2/3 support will be added in a later commit.	2021-09-30 02:11:19 +05:30
Billy Laws	db2abe2290	Use concepts for Backing::Read over enable_if	2021-09-30 02:11:19 +05:30
Billy Laws	3340c74332	Implement efficient template based IOCTL deserialisation This moves IOCTLs from frozen to a template based parser, this allows IOCTL lookup to be performed much more optimally and some IOCTLs to be inlined.	2021-09-30 02:11:19 +05:30
Billy Laws	75a67dcfa5	Allow supplying a custom ResultValue result type and optimise span This allows PosixResultValue to be created easily.	2021-09-30 02:11:19 +05:30
Billy Laws	3d3c13f90c	Add common PosixResult and PosixResultValue types Many services like bsd and nvdrv use these to represent the result values of functions so add a common type for these as an alternative to the macros.	2021-09-30 02:11:19 +05:30
sspacelynx	14dc42b991	Add `IntegerFor` template class + Update `GenerateUuidV4`	2021-09-22 06:24:05 +05:30
sspacelynx	ea3c7301b1	Make `UUID::GenerateUuidV4` use `util::GetRandomBytes`	2021-09-22 06:24:05 +05:30
sspacelynx	10d43c88c9	csrng service implementation Implements the 'csrng:' service using C++ <random>'s Mersenne Twister, this does make it insecure for cryptographic purposes but it is pointless to attempt to do this regardless as we cannot ensure that the guest will run in a secure environment which cannot be mutated by an attacker. Used by Prison Princess, Pokemon Cafe Mix, Paint your Pet and more.	2021-09-22 06:24:05 +05:30
sspacelynx	06674b3d07	hwopus decoder service implementation	2021-07-19 02:07:12 +05:30
PixelyIon	b533801bbe	Fix Scheduler Core Migration Deadlock Encountered in 不如帰大乱 when `HOS-3` is awoken at the same time as `HOS-0` called `SvcSetThreadCoreMask` resulting in a deadlock where `HOS-0` owns `HOS-3`'s `coreMigrationMutex` while `HOS-3` owns the core mutex with the both of them attempting to lock the other mutex	2021-07-12 21:27:49 +05:30
PixelyIon	dba99ce542	Implement `IApplicationDisplayService::ConvertScalingMode` Utilized by アイリス魔法学園	2021-07-12 21:27:49 +05:30
PixelyIon	1ad5ea6867	Move Mbed TLS to submodule + Update Java Libraries We've moved from using an AAR for Mbed TLS to a submodule as the AAR was packaged manually and used from a local repository which ended up being very hacky and resulted in Linter errors, it could also not be updated with ease as it would need to be repackaged. All of these issues have been solved by moving to a git submodule tied to the official Mbed TLS GitHub repository.	2021-07-12 21:27:49 +05:30
PixelyIon	ec1da1b3f5	Address CR Comments + Increase `minSdkVersion` to 29 (Android 10)	2021-07-12 21:27:49 +05:30
PixelyIon	19be9f4b66	Support Sticky Transforms + Minor Formatting Fixes Sticky transforms have been stubbed, as they are on HOS/Android. Certain titles like Xenoblade Chronicles end up setting the sticky transform even if it doesn't do anything, as a result of this we cannot throw an exception for it and stub it without an exception (Aside from the cases where the value isn't recognized).	2021-07-12 21:27:49 +05:30
PixelyIon	9f967862bd	Implement additional IGraphicBufferProducer transactions The following GraphicBufferProducer transactions were implemented: * `SetBufferCount` * `DetachBuffer` * `DetachNextBuffer` * `AttachBuffer` It should be noted that `preallocatedBufferCount` (previously `hasBufferCount`) and `activeSlotCount` were adapted accordingly with how they were effectively the same value as all active buffers were preallocated prior but now there can be a non-preallocated active slot. Additionally, a bug has been fixed where `SetPreallocatedBuffer` has the graphic buffer as an optional argument whereas it was treated as a mandatory argument prior and could lead to a SEGFAULT if an application were to not pass in a buffer.	2021-07-12 21:27:49 +05:30
PixelyIon	0d6d90c4cd	Implement SvcUnmapSharedMemory + Remove Redundant [[unlikely]]	2021-07-12 21:27:49 +05:30
PixelyIon	f1433ad0d9	Rework VI + IHOSBinder VI/IHOSBinder suffered from major inaccuracies in their function due to being quickly thrown together initially with little concern for accuracy, this has now been fixed with them being substantially more accurate now.	2021-07-12 21:27:49 +05:30
PixelyIon	ca4744c603	Improve `ENUM_STRING` + Fix Host1X Syncpoints `ENUM_STRING` now has a unified implementation in <common/macros.h> with a documented format and can be used throughout the codebase. A major performance regression was added in the Host1X Syncpoint revamp as it did a syscall if there were any waiters during `Increment` even if they would just be woken up and go back to sleep as the threshold wasn't hit. It has now been optimized to only do a wake if any waiting thread needs to be awoken. There was also a bug concerning increment where it would perform actions corresponding to the previous increment rather than the current one which has also been fixed.	2021-07-12 21:27:49 +05:30
PixelyIon	6595670a5d	Rework Performance Statistics We used instantaneous values for FPS previously which led to a lot of variation in it and the inability to determine a proper FPS value due to constant fluctuations. All FPS values are now averaged to allow reading out a stable value and a deviation statistic has been added for the frame-time to judge judder and frame-pacing which allows for a significantly better measure of overall performance. The formatting for all the floating-point numbers is now fixed-point to prevent shifting of position due to decimal digits becoming 0.	2021-07-12 21:27:49 +05:30
PixelyIon	ce804254ab	Support Swap Interval + Cropping + Transforms and more for QueueBuffer Support for the following parameters was added to `QueueBuffer`: * Earliest Present Timestamp * Swap Interval * Crop * Scaling Mode * Transform * Frame ID (Not returned to guest yet) We utilize ANativeWindow APIs directly to achieve all of this in an efficient manner since HWC will be used directly for it, we do plan to introduce Vulkan equivalents for all of these operations later down the line for a port to non-Android platforms.	2021-07-12 21:27:49 +05:30
PixelyIon	2e8ce2cbaf	Update C++ Libraries	2021-07-12 21:27:49 +05:30
PixelyIon	6854e07356	Replace LogActivity with "Share Logs" + Add Timestamp to Logger	2021-07-12 21:27:49 +05:30
PixelyIon	e511e158e3	Implement Display Settings + Attach Texture to FenceCycle	2021-07-12 21:27:49 +05:30
PixelyIon	b2bb855a02	Revamp Host1X Syncpoint + Address Review Comments	2021-07-12 21:27:49 +05:30
PixelyIon	216e5cee81	Separate Guest and Host Presentation + AChoreographer V-Sync Event We had issues when combining host and guest presentation since certain configurations in guest presentation such as double buffering were very unoptimal for the host and would significantly affect the FPS. As a result of this, we've now made host presentation have its own presentation textures which are copied into from the guest at presentation time, allowing us to change parameters of the host presentation independently of the guest. We've implemented the infrastructure for this which includes being able to create images from host GPU memory using VMA, an optimized linear texture sync and a method to do on-GPU texture-to-texture copies. We've also moved to driving the V-Sync event using AChoreographer on its on thread in this PR, which more accurately encapsulates HOS behavior and allows games such as ARMS to boot as they depend on the V-Sync event being signalled even when the game isn't presenting.	2021-07-12 21:27:49 +05:30
PixelyIon	36547cd5dc	Redesign Texture Class + Improve Presentation Engine This commit reworks the `Texture` class to include a Vulkan Image backing that can be optionally owning or non-owning and swapped in with consideration for Vulkan image layout, it also adds CPU-sided synchronization for the texture objects with FenceCycle. It also makes the appropriate changes to `PresentationEngine` and `GraphicBufferProducer` to work with the new `Texture` class while setting the groundwork for supporting swapchain recreation. It also fixes a log in `IpcResponse` and improves the display mode selection algorithm by further weighing refresh rate.	2021-07-12 21:27:49 +05:30
PixelyIon	b2132fd7aa	Implement Fence Cycle, Memory Manager and Command Scheduler Implements a wrapper over fences to track a single cycle of activation, implement a Vulkan memory manager that wraps the Vulkan-Memory-Allocator library and a command scheduler for scheduling Vulkan command buffers	2021-07-12 21:27:49 +05:30
PixelyIon	d8025e7178	Rework GraphicBufferProducer This commit makes GraphicBufferProducer significantly more accurate by matching the behavior of AOSP alongside mirroring the tweaks made by Nintendo. It eliminates a lot of the magic structures and enumerations used prior and replaces them with the correct values from AOSP or HOS. There was a lot of functional inaccuracy as well which was fixed, we emulate the exact subset of HOS behavior that we need to. A lot of the intermediate layers such as GraphicBufferConsumer or Gralloc/Sync are not emulated as they're pointless abstractions here.	2021-07-12 21:27:49 +05:30
PixelyIon	795472004e	Presentation Engine Vulkan Surface + Swapchain Initialization This commit adds in VkSurface/VkSwapchain initialization and recreation. It also adapts GraphicsBuffferProducer and Texture to fit in with those changes but it doesn't yet implement presenting those buffers nor uploading guest buffers onto the host.	2021-07-12 21:27:49 +05:30
PixelyIon	9a09a4e815	Extend VkQueue Allocation Code The VkQueue allocation code is rewritten to iterate through all queue families to look for an appropriate queue now	2021-07-12 21:27:49 +05:30
PixelyIon	2143b43068	Vulkan PhysicalDevice + Device Initialization + Report Accurate Version Vulkan Device initialization is handled now, it supports required extensions but support for optional extensions/features/properties will come in later when we require those. In addition, we now correctly report the version of Skyline to Vulkan which can be accessed from debugging tools. There's also a minor change regarding the search pattern for `SkylineLibraries` which now only searches in headers of libraries and it also explicitly excludes the redundant `vulkan.hpp` from the `Vulkan-Headers` repository.	2021-07-12 21:27:49 +05:30
PixelyIon	8ceed74371	Vulkan Instance + Validation Layer + Debug Report Initialization The GPU class has been extended in this for Vulkan initialization, this is done to the point of initializing the instance alongside loading in `VK_LAYER_KHRONOS_validation` which is also now packed into all Debug APKs for Skyline. In addition, `VK_EXT_debug_report` is also initialized and it's output is piped directly into the Logger. A minor change regarding the type of the `Fps` and `Frametime` globals was changed to `skyline::i32`s which is a more suitable type due to those having a smaller chance of overflowing while being signed as Java doesn't have unsigned integral types.	2021-07-12 21:27:49 +05:30
Billy Laws	5be7860cf7	Fixup validity check in mmnv IRequest::Get Fixes a random crash in SMO when launching the game for the first time.	2021-06-27 14:47:52 +01:00
Billy Laws	100cff7692	Address feedback	2021-06-20 14:18:40 +01:00
Billy Laws	b87f451746	Handle empty core queues in Scheduler::UpdatePriority If the queue is empty then begin() == end() leading to ArmPreemptionTimer being called on an unscheduled thread.	2021-06-20 14:18:40 +01:00
Billy Laws	d57883705d	Avoid aligning BSS and .data individually and instead align as one As both of these are in the same memory segment they have no individual alignment requirements, this created a bug in にゃんらぶ～私の恋の見つけ方～ where the data segment would be larger than the game expected and invalid command line arguments would be read.	2021-06-20 14:18:40 +01:00
Billy Laws	e1ca24d35f	Stub IParentalControlService::Initialize This is used in newer SDKs in favour of initialising on creation with IParentalControlServiceFactory::CreateServiceOld.	2021-06-20 14:18:40 +01:00
Billy Laws	af5ee96b9a	Ensure that threads are ready before their preemption timer can be armed. It was discovered during testing of 'Hatsune Miku Project DIVA: Mega Mix' that if a thread was starting while preemption was being enabled a NULL pointer dereference could occur in the timer_settime call as timer_create may not have been called yet.	2021-06-20 14:18:40 +01:00
Billy Laws	a7dc69223b	Implement mmnv::IRequest for media module clock control This is used by games before calling into nvdec in order to clock up the HW module, it can also be used to request a RAM frequency. Since we obviously don't emulate the hardware down to this level a basic stub that provides the correct reponses is enough. Fixes a crash on first level of Super Mario Odyssey.	2021-06-20 14:18:40 +01:00
Billy Laws	54d39fbaa7	Fix backtrace format string so it properly handles SVC names This was missed when we moved over to printing SVC names rather than numbers.	2021-06-20 14:18:40 +01:00
Billy Laws	99a839d669	Check the mutex tag in MutexLock if the given handle isn't found Makes sure the correct error is reported as HOS only reports InvalidHandle if the tag matches.	2021-06-20 14:18:40 +01:00
Billy Laws	476bb65f37	Stub GetAccumulatedSuspendedTickValue and fix call misnumbering This is used by some more recent games like Hatsune Miku: Project DIVA MegaMix!	2021-06-20 14:18:40 +01:00
Billy Laws	22c83006a3	Stub application copyright image AM calls These are used on HOS to control the screenshot overlay image for developer copyrights.	2021-06-20 14:18:40 +01:00
Billy Laws	6f9189e8f9	Reimplement GetPseudoDeviceId to be more accurate Now returns a valid V5 UUID and uses the game's seed from control.nacp for generation.	2021-06-20 14:18:40 +01:00
Billy Laws	d4b13c34ee	Fixup broken SetTimeslice nvdrv ioctl that was missed in conversion	2021-06-20 14:18:40 +01:00
PixelyIon	0c29f982d5	Replace skyline-emu/VkHpp with KhronosGroup/Vulkan-Hpp We used a custom version of Vulkan-Hpp which split the files a lot prior to avoid any developers needing to manually set IDE settings for IntelliJ to work but this wasn't practical due to how it required modifications to Vulkan-Hpp's generator which would make maintenance extremely difficult. It was determined that we should just add the requirement for changing the IDE settings and use Vulkan-Hpp directly.	2021-06-17 20:30:22 +05:30
PixelyIon	8056b80073	Address CR Comments (#151 ) + Fix SvcWaitSynchronization Trace An RAII scoped trace was used for SvcWaitSynchronization but it was placed within a condition scope which led to an incorrect lifetime for the traces. Minor changes regarding the CR not affecting functionality were made aside from that.	2021-03-25 22:57:26 +05:30
PixelyIon	3f7373209a	Move Guest GPU into SoC Directory We decided to restructure Skyline to draw a layer of separation between guest and host GPU. We're reserving the `gpu` namespace and directory for purely host GPU and creating a new `soc` directory and namespace for emulation of parts of the X1 SoC which is currently limited to guest GPU but will be expanded to contain components like the audio DSP down the line.	2021-03-25 22:57:26 +05:30
PixelyIon	0ea6d9bee5	Address CR Comments (#150 ) Correct a single grammatical error	2021-03-23 02:40:02 +05:30
PixelyIon	f983b701c8	Fix BT Audio Stuttering Issues This fixes audio stuttering which occurred on certain BT audio devices by requesting an exclusive stream from Oboe alongside a low-latency stream. Co-authored-by: Billy Laws <blaws05@gmail.com>	2021-03-23 02:40:02 +05:30
PixelyIon	ae68009f9b	Extend Perfetto Tracing Add Tracing for SVCs, Services, NVDRV, and Synchronization Primitives. In addition, fix `TRACE_EVENT_END("guest")` being emitted when a signal is received while being in the guest rather than host which would cause an exception. This commit also disables warnings for the Perfetto library as we do not control fixing them.	2021-03-23 02:40:02 +05:30
PixelyIon	df3427a8c0	Extend SVC table with names This extend a descriptor table for the SVCs with names for every SVC alongside their function pointer. The names are then used for logging and eventually tracing.	2021-03-23 02:40:02 +05:30
PixelyIon	6eea38cca9	Optimize NvDevice Function Lookup This essentially optimizes NvDevice in the same way services were, refer to the last commit for details	2021-03-23 02:40:02 +05:30
PixelyIon	33f1a3e1b3	Optimize Service Function Lookup This moves from using std::function with a this pointer binding (which would likely cause a heap allocation) to returning the this pointer in a structure which implements operator() to do the call with it. It also moves to using const char* for strings from std::string_view which was pointless in this scenario due to it's usage being limited to being a C-string for the most part, it also integrates the class name directly into the string which allows us to avoid runtime string concatenation in libfmt and RTTI for finding the class name.	2021-03-23 02:40:02 +05:30
PixelyIon	2b4a094cd8	Logger Function Prefix Support	2021-03-23 02:40:02 +05:30
Billy Laws	6c6e665569	Implement Perfetto Tracing This commit implements tracing Skyline using https://perfetto.dev/ for SVCs, IPC, Scheduler and Presentation	2021-03-23 02:40:02 +05:30
PixelyIon	1155394d58	Improve KMemory + Fix Service Implementation Warning * Improve KMemory Comments * Add parameter prefix 'p-' to `KPrivateMemory::UpdatePermission` * Fix the missing trailing double quote in missing service prints, this was due to `stringName` being padded with extra 0s	2021-03-23 02:40:02 +05:30
Billy Laws	d8d4b4c9e0	Address review feedback	2021-03-21 18:56:31 +05:30
Billy Laws	74cd0bc2c3	Various timesrv cleanups and comments Mainly just adapts the rest of time to add some things missed in the initial commit as they required TZ, everything else is just renames from switchbrew and comments.	2021-03-21 18:56:31 +05:30
Billy Laws	ba418976b0	Implement full timezone service support This serves as an extension to the initial time commit and combined they provide a complete implementation of everything application facing in time. psc:ITimeZoneService and glue:ITimeZoneService are used to convert between POSIX and calendar times according to the device location. Timezone binaries are used during the conversion, details of them can be read about in the previous commit. This is based off my own glue RE and Thog's time RE.	2021-03-21 18:56:31 +05:30
Billy Laws	34cb0b49e8	Implement the entirety of time services This reimplements our time backend to be significantly more accurate to the real PSC and provides complete implementations for every time IPC allowing many newer games to work properly. Time is unique in its use of glue services, the core sysmodule is fully isolated and doesn't interface with any other services. Glue is instead used where that is needed (e.g. for fetching settings), this distinction is also present in our implementation. Another unique feature of time is its global state, as time is calibrated from the start of the service its state cannot be lost as that would result in the application offsetting time incorrectly whenever it closed a session. A large proportion of this is based off of Thog's 9.0.0 PSC reversing.	2021-03-21 18:56:31 +05:30
Billy Laws	63c54207d2	Add AndroidAsset{Backing,FileSystem} for accessing built-in assets These only implement the subset of VFS needed for time, implementing more is difficult due to some issues in the AAsset API which make support quite ugly. The abstract asset filesystem can be accessed by services through the OS class allowing other implementations to be used in the future.	2021-03-21 18:56:31 +05:30
Billy Laws	a0f6cc161c	Add a UUID struct for holding and generating RFC4122 UUIDs Time uses UUIDs for clock identification.	2021-03-21 18:56:31 +05:30
Billy Laws	937e3d6f68	Add endiannes swapping helper functions Avoid using compiler functions manually to create a cleaner interface, and allow swapping the endianness of arbritary lengths.	2021-03-21 18:56:31 +05:30
Billy Laws	def9f96b02	Add a ResultValue helper class to simplify result handling This is a very common usecase for result handling and allows much cleaner code.	2021-03-21 18:56:31 +05:30
PixelyIon	e90dd672af	Fix Clock Rescaling Bug (Introduced in #135 ) There was a mistake in the code-style refactor where the signature in the instruction encoding of `MRS` was set to `0xD54` instead of `0xD53` which would cause a SIGILL (Illegal Instruction) for devices which had their HW timer frequency equivalent to the Switch (19.2MHz) as a modified `MRS` would be deployed there. This issue should not affect devices which perform clock rescaling as the `MRS` instruction there is encoded by the assembler.	2021-03-10 15:13:57 +05:30
sspacelynx	75b769ca1d	Add NSP ticket extraction support	2021-03-07 23:16:36 +05:30
sspacelynx	77d8d1bee1	Add support for XCI titlekey decryption	2021-03-07 22:55:59 +05:30
sspacelynx	666df1eb43	Add XCI file format support	2021-03-07 22:55:59 +05:30
Billy Laws	a8dafc10e0	Fix backing read size check I buggered this one up and broke all reads during the refactor, so fix that.	2021-03-06 20:29:55 +05:30
Billy Laws	6390561f0f	Fixup VFS error checking Many users of VFS didn't check for nullptr or 0 results leading to various potential issues, to mitigate this introduce error checking to VFS by default. The original variants can still be used through the *Unchecked family of functions.	2021-03-06 18:58:04 +05:30
Billy Laws	00d0586d1f	Correctly set size in CtrEncryptedBacking This bug caused crashes with the improved error checking.	2021-03-06 18:58:04 +05:30
Billy Laws	5f942e2dff	Split VFS implementations from API This allows better validation and simplified default argument handling. Could also be useful in the future when we switch to proper VFS error reporting.	2021-03-06 18:58:04 +05:30
Billy Laws	48acb6d369	Address a bunch of issues detected by clang-tidy	2021-03-06 18:58:04 +05:30
Billy Laws	9d5138bef1	Support Ioctl3 without the inline output buffer This is used by Project Diva for GET_VA_REGIONS	2021-03-05 23:54:32 +05:30
Billy Laws	c489da610e	Add locking to GPU VMM and fix a few codestyle issues As VMM can be accessed by nvdrv and the GPFIFO thread at the same time locking is needed to prevent races.	2021-03-05 23:54:32 +05:30
Billy Laws	c1aec00ed1	Rework GPFIFO pushing to optimise performance and accuracy * Pushbuffer data is now stored in a member buffer to avoid reallocating it for each pushbuffer which hampered performance before. * Don't prefetch pushbuffers as it puts unnecessary load on the guest thread that is better suited for the GPFIFO thread. * Clean up some misc code to avoid pointless casts of a 4 byte object and handle GPFIFO control opcodes.	2021-03-05 23:54:32 +05:30
Billy Laws	78cdb1eeb4	Add locking to nvhost-ctrl syncpoint events and sync with switchbrew NvHostEvents were renamed to SyncpointEvents which is a much clearer name that more accurately describes them. Locking is needed as IOCTLs can be called asynchronously and so event registration and signalling can race.	2021-03-05 23:54:32 +05:30
PixelyIon	c282276b74	Address CR Comments 2 (#132 ) + Fix Several Scheduler Bugs The following scheduler bugs were fixed: * It was assumed that all non-cooperative `Rotate` calls were from a preemptive yield and changed the state of `KThread::isPreempted` incorrectly which could lead to UB, an example of a scenario with it would be: * * Preemptive thread A gets a signal to yield from cooperative thread B due to it being ready to schedule and higher priority * * A complies with this request but there's an assumption that the signal was actually from it's preemption timer therefore it doesn't reset it (As it isn't required if the timer was responsible for the signal) * * A receives the actual preemption signal a while later, causing UB as the signal handler is invoked twice * `Scheduler::UpdatePriority` * * A check for `currentIt == core->queue.begin()` existed which caused an incorrect early return * * The preemption timer was armed correctly when a priority transition from cooperative priority -> preemption priority occurred but not disarmed when a transition from preemption priority -> cooperative priority occurred * * The timer was unnecessarily disarmed in the case of updating the priority of a non-running thread, this isn't as much a bug as it is just pointless * Priority inheritance in `KProcess::MutexLock` is fundamentally broken as it performs UB with `waitThread` being accessed prior to being assigned * When a thread sets its own priority using `SvcSetThreadCoreMask` and its current core is no longer in the affinity mask, it wouldn't actually move to the new thread until the next time the thread is load balanced	2021-03-05 14:55:34 +05:30
PixelyIon	fe5061a8e0	Address CR Comments (#132 ) + Change Core Migration API This addresses all CR comments including more codebase-wide changes arising from certain review comments like proper usage of its/it's and consistent contraction of it is into it's. An overhaul was made to the presentation and formatting of `KThread.h` and `LoadBalance` works has been superseded by `GetOptimalCoreForThread` which can be used alongside `InsertThread` or `MigrateToCore`. It makes the API far more atomic and neater. This was a major point of contention for the design prior, it's simplified some code and potentially improved performance.	2021-03-05 14:55:34 +05:30
PixelyIon	0ea02f2d56	Fix Non-Cooperative Core Migration + Fix `yieldWithCoreMigration` + Improve Mutex Locking in `ConditionalVariableWait` The case of a thread not being in the core queue during a non-cooperative core affinity change would break things as the thread was non-conditionally removed and inserted, this has been fixed by adding a check to see if the thread exists in the core's queue prior to migration. In addition, `yieldWithCoreMigration` was broken by the previous commit as the fallthrough was intentional and removing it cause core migration without a yield which led to breakage in certain circumstances. The mutex locking logic was also improved in `ConditionalVariableWait` to use atomics in a more effective manner with less atomic operations being performed overall.	2021-03-05 14:55:34 +05:30
PixelyIon	0233916489	Increase Code Region Size to 4GiB The code region's size was previously set at the same value as it is for 36-bit ASes, this value is inadequate for certain larger games and needed to be expanded. We've chosen 4GiB as the new value which should easily encompass all Switch games.	2021-03-05 14:55:34 +05:30
PixelyIon	11c5f50d37	Use likelihood attributes in NCE + Fix System Version + SVC improvements The SVCs improvements are as follows: * Make SVC logs more concise for: * * `SleepThread` * * `ClearEvent` * * `CloseHandle` * * `ResetSignal` * * `WaitSynchronization` (Special case for single handle) * * `ArbitrateLock` * * `ArbitrateUnlock` * * `WaitProcessWideKeyAtomic` * * `SignalProcessWideKey` * Fix unintentional fallthrough into `yieldWithoutCoreMigration` from `yieldWithCoreMigration` in `SleepThread` * Return `result::InvalidState` when an unsignalled handle is reset in `ResetSignal` * Return `Result{}` (Success) in `CancelSynchronization` * Do not return `result::InvalidCurrentMemory` in `ArbitrateLock` as it's not a failure condition * Make `count` in `WaitProcessWideKeyAtomic` a `i32` from a `u32`, zero and all negative values result in waking all waiters	2021-03-05 14:55:34 +05:30
PixelyIon	1884d98163	Implement Address Arbiter The entirety of the address arbiter is implemented in this commit, all three arbitration types: `WaitIfLessThan`, `DecrementAndWaitIfLessThan` and `WaitIfEqual`, and all three signal types: `Signal`, `SignalAndIncrementIfEqual` and `SignalAndModifyBasedOnWaitingThreadCountIfEqual` have been implemented. This allows any application which uses levent (Light Events) to function which includes titles such as ARMS.	2021-03-05 14:55:34 +05:30
PixelyIon	20bdda6a63	Support Core Migration for Running External Thread We did not support migration of threads which were running in a non-cooperative manner, this was partially due to the dependence on per-core conditional variables rather than per-thread which made this harder to do programmatically. This has been fixed by moving to per-thread cvars and therefore the limitation can be removed, this feature is used by Unity games.	2021-03-05 14:55:34 +05:30
◱ PixelyIon	861d7e9eb2	Fix SvcClearEvent resetting behaviour SvcClearEvent previously set the `signalled` flag directly rather than calling `ResetSignal`, which skipped the locking necessary to make it globally visible. Switch it to use `ResetSignal` to fix this.	2021-03-05 14:55:34 +05:30
◱ PixelyIon	a621408b9c	Move to using ASCII separators for Logger We've moved to using RS and GS from ASCII as delimiters rather than '\n' and '\|', this allows more robust parsing and increases the readability of the log files	2021-03-05 14:55:34 +05:30
◱ PixelyIon	31db70f1d4	Lock decryption in CtrEncryptedBacking This prevents a race where two threads could read at the same time and end up using the wrong IV leading to garbage data being read. This caused crashes in several games including Celeste.	2021-03-05 14:55:34 +05:30
Billy Laws	ce0e032255	Avoid constantly signalling audren DSP done event This was causing a significant amount of sched thrashing and pinning a core to 100% as games constantly updated audren, now change it to only signal on buffer release.	2021-03-05 14:55:34 +05:30

... 3 4 5 6 7 ...

739 Commits