virglrenderer

Commit Graph

Author	SHA1	Message	Date
Dave Airlie	083d97fff5	renderer: add shader_storage_buffer_object support. (v4) This pulls the code out from the gles31 development, and modifies the caps to support two different limits (so far I've only found fs/cs vs everyone else limits differ) v2: fix buffer creation paths, limit maximums, handle indirect (don't pass -1 into gl funcs when we don't need to). v3: free ssbo locs v4: use two caps fields Co-authors: Gurchetan Singh <gurchetansingh@chromium.org> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	6 years ago
Gert Wollny	2846dcf565	vrend: If available use glCopyImageSubData to execute memcopy like blits When the host is gles >= 3.2, gl >= 4.3, or when the extension GL_(ARB\|EXT\|OES)_copy_image is available, memcopy like blitting and region copying can be done for many color format combinations by using glCopyImageSubData. This fixes a number of tests from the subset dEQP-GLES31.functional.copy_image.non_compressed.viewclass_* v2: - Clean list of canonical formats (Gurchetan Singh) - Use size of canonical formats to decide whether they can be copied via gCopyImageSubData - Also honour the render state when deciding whether glCopyImageSubData will do, or whether we need to do a blit. v3: - replace format size check by compatibility check (Gurchetan Singh) but keep the check seperate because we need to add logic for compressed texture later Reviewed-by: Gurchetan Singh <gurchetansingh at chromium.org> Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Jakob Bornecrantz <jakob@collabora.com>	6 years ago
Dave Airlie	29853b7456	vrend_decode: use uints for sampler view decode The protocol will never send negative numbers, so use uints to avoid having to compare to 0 and other warnings. Reviewed-by: Po-Hsien Wang <pwang@chromium.org>	6 years ago
Erik Faye-Lund	d24ac12d7b	implement VIRGL_CCMD_SET_MIN_SAMPLES This is required to implement glMinSampleShading(). Sadly, we've been setting has_sample_shading for a while, even though this is needed. So we need to set a capability so mesa will know that it's safe to emit this command. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	6 years ago
Gert Wollny	c0e0274e8c	virgl: Add method to query supported MSAA samples and positions Query the number of supported samples and the sample position and store these to the caps.v2 structure. We support only up to 16 samples. This implementation requires a GL host backend. v2: - glTexImage2Dmultisample is not available on a gles 3.1 host and trying to call it crashed qemu (Jakob Bornecrantz) Use glTexStorage2DMultisample instead and delete texture each round because the texture becomes immutable. - move call to get sample positions only when caps v2 needs to be filled. v3: - rebase against master - take care of nits (Dave) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	6 years ago
Gert Wollny	04f2080d89	vrend: store offsets into backing ivo for mipmaps and use it when reading data back (v3) In the copy fallback, when a texture can not be rendered, the data that resides in the backing iovec needs to be used. For the non-zero levels of mip-map textures the data is located at an offset. This patch adds storing this offset and using it when data is read from the backing iovec and updating the dst iov. We limit the mip-map levels for which this is done to 1-17, which is enough to cover 32kx32k textures. The patch also fixes the stride when accessing mip-map levels. Fixes: dEQP-GLES3.functional.texture.specification.teximage3d_depth.depth_component24_2d_array dEQP-GLES3.functional.texture.specification.texsubimage3d_depth.depth_component32f_2d_array dEQP-GLES3.functional.texture.specification.texsubimage3d_depth.depth_component24_2d_array dEQP-GLES3.functional.texture.specification.texsubimage3d_depth.depth_component16_2d_array dEQP-GLES3.functional.texture.specification.texsubimage3d_depth.depth32f_stencil8_2d_array dEQP-GLES3.functional.texture.specification.texsubimage3d_depth.depth24_stencil8_2d_array v2: * rebase and remove unused variables * also correct offset when writing to the destination backing iovec v3: * follow mesa/virgl notation and range for storing the mip-map offsets Suggested-by: Gurchetan Singh <gurchetansingh@chromium.org> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Jakob Bornecrantz <jakob@collabora.com>	6 years ago
Gert Wollny	50d7d36733	vrend_renderer.h: include epoxy/gl.h because it is actually needed here (v2) v2: With epoxy GL/gl.h is not directly included (Dave Airlie). Instead move the include of epoxy/gl.h from vrend_renderer.c to vrend_renderer.h Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	6 years ago
David Riley	77a42bc8f0	virglrenderer: Add method to import EGLImageKHRs as resources. Allow resources created externally (eg gbm created buffers as dma bufs) to be used. As an example, crosvm (https://chromium.googlesource.com/chromiumos/platform/crosvm/) will intercept resource creation to use minigbm to allocate buffers that its compositor is able to properly handle since it only supports compositing with buffers allocated via minigbm. This patch allows direct rendering to those buffers without requiring an extra copy. v2: Handle missing extension better. v3: Update commit message with more details on usage. Signed-off-by: David Riley <davidriley@chromium.org> Reviewed-by: Dave Airlie <airlied@redhat.com>	6 years ago
Dave Airlie	05554838b0	tessellation: add protocol support for set tess state. (v2) This passes the default tessellation factors from the guest to the host. v2: fix warnings Tested-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Elie Tournier <elie.tournier@collabora.com> Tested-by: Jakob Bornecrantz <jakob@collabora.com>	6 years ago
Tomeu Vizoso	b27fa082de	vrend: Add support for PIPE_FORMAT_A4B4G4R4_UNORM and make sure that on GLES it it chosen for GL_RGBA4 over VIRGL_FORMAT_B4G4R4A4_UNORM, by removing support for the latter. This is needed because on GLES3 GL_BGRA isn't a supported format to pass to glTexImage3D. Fixes the test dEQP-GLES3.functional.texture.format.sized.3d.rgba4_pot on GLES hosts. v2: * Make more explicit the GL/GLES split (Gert Wollny) Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Suggested-by: Jakob Bornecrantz <jakob@collabora.com> Reviewed-By: Gert Wollny <gert.wollny@collabora.com> Tested-by: Jakob Bornecrantz <jakob@collabora.com>	7 years ago
Dave Airlie	792b99d9db	vrend: make texture buffer objects more robust This allows the tbo code to properly detect if we are using a buffer as a texture or not, instead of relying on GL_TEXTURE_BUFFER being used. We also don't need to special case generate the tbo texture id until sampler bind time. Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	7 years ago
Alexandros Frantzis	732e68e48d	vrend: Use source swizzle when blitting with GL Use the source format swizzle information to set the GL_TEXTURE_SWIZZLE_* parameters for the GL blit operation. This also removes the need for the emulated alpha special case, since when using emulated alpha the source format already has proper swizzle information. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Signed-off-by: Jakob Bornecrantz <jakob@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> Tested-by: Gurchetan Singh <gurchetansingh@chromium.org>	7 years ago
Alexandros Frantzis	84f1a4c8ce	vrend: Add swizzle to format descriptions Explicitly describe the swizzle of all supported formats in the format table. In this commit all format swizzles are set to NO_SWIZZLE, but future commits will update some format/swizzle combinations to improve support for the corresponding virgl formats. Signed-off-by: Alexandros Frantzis <alexandros.frantzis@collabora.com> Signed-off-by: Jakob Bornecrantz <jakob@collabora.com> Reviewed-by: Gurchetan Singh <gurchetansingh@chromium.org> Tested-by: Gurchetan Singh <gurchetansingh@chromium.org>	7 years ago
Dave Airlie	12f25462c2	virglrenderer: introduce a second capability set to workaround bugs in first. This introduces a second capability set exposing a larger struct size. The kernel ioctl has some bugs that necessitated this change.	7 years ago
Gurchetan Singh	fa835b0f88	vrend: don't hardcode context version Currently, we always try to create an OpenGL 3.1 context. Some dEQP tests require an OpenGL 3.2 context (specifically, ones that use glGetInteger64v). Let's try to create the highest version context we can, and iterate to lower versions, i.e: https://developer.android.com/guide/topics/graphics/opengl.html#version-check The return code for (*create_gl_context) is a little unclear. This patch assumes NULL is returned on failure. This should work for GLX and EGL. GLX: "On failure glXCreateContextAttribsARB returns NULL and generates an X error with extended error information" https://www.khronos.org/registry/OpenGL/extensions/ARB/GLX_ARB_create_context.txt EGL: "#define EGL_NO_CONTEXT ((EGLContext)0)" https://www.khronos.org/registry/EGL/api/1.1/EGL/egl.h The semantics of rcbs->create_gl_context may be different, though. Fixes: dEQP-GLES3.functional.state_query.integers.max_vertex_output_components_getinteger64 dEQP-GLES3.functional.state_query.integers.max_vertex_output_components_getfloat Signed-off-by: Dave Airlie <airlied@redhat.com>	7 years ago
Jakob Bornecrantz	2447355893	vrend: Support BGRX & BGRA formats v2 These two formats are required by DRI in the guest and as such Wayland, X11, GBM or any API built on top if DRI. The format GL_BGRA_EXT is not supported on Desktop OpenGL. v2: Better documentation. Signed-off-by: Jakob Bornecrantz <jakob.bornecrantz@collabora.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	7 years ago
Dave Airlie	9485f7ed36	vrend: add support for indirect draws. These are needed for ARB_draw_indirect and GL4.0 This enables support and turns in the cap when support is present. This also enhances the draw packets to cover future features, it doesn't enable or show these yet, since other work is required in the shaders. Signed-off-by: Dave Airlie <airlied@redhat.com>	7 years ago
Li Qiang	48f67f6096	renderer: fix NULL pointer deref in vrend_clear In vrend clear dispatch function, the 'buffers' is read from guest. A malicious guest can specify a bad 'buffers' to make a the function call util_format_is_pure_uint() even the 'ctx->sub->surf[i]' is NULL. This can cause a NULL pointer deref. Make a sanity check to avoid this. [airlied: use a define] Signed-off-by: Li Qiang <liq3ea@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	8 years ago
Dave Airlie	20916f13f6	vrend: fix text in stellarium. If we are blitting to an emulated alpha we should rewrite inside the shader, not in the incoming texture swizzle.	9 years ago
Marc-André Lureau	aa48af986c	Fix void* casting warnings Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	9 years ago
Dave Airlie	7b776ec866	renderer: remove double declaration of function.	9 years ago
Marc-André Lureau	ec24bd2211	decode: fix set_scissor_state bounds Do not accept negative values for num & start. Fix found thanks to american fuzzy lop. Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>	9 years ago
Marc-André Lureau	bfa6cd741d	renderer: prevent out of bound vps access Fix found thanks to american fuzzy lop. Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>	9 years ago
Marc-André Lureau	3767dbf18c	renderer: use a uint32_t for shader type That way an value if (type > PIPE_SHADER_GEOMETRY) guard will actually work for all values. Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>	9 years ago
Marc-André Lureau	89aea798b6	renderer: use a thread to block for fences. Instead of polling the fences regularly, have a thread that blocks for a single fence using a separate shared context, then uses eventfd to wake up the main thread when something happens. Inside the guest, glmark2 typicially runs twice as fast with the thread sync. Although in general, the performances seems to be about +30%. The benefits is mostly for CPU-bounds tasks (when main the thread hits 100%) A naive perf stat of the vtest renderer with glmark2 "build" test with a fixed number of frames (500) results in the following stats data: (do not value timing related informations, since the renderer is ran and stopped manually) without thread: 3032.282265 task-clock (msec) # 0.420 CPUs utilized 4,277 context-switches # 0.001 M/sec 102 cpu-migrations # 0.034 K/sec 9,020 page-faults # 0.003 M/sec 7,884,098,254 cycles # 2.600 GHz 4,440,126,451 stalled-cycles-frontend # 56.32% frontend cycles idle <not supported> stalled-cycles-backend 11,024,091,578 instructions # 1.40 insns per cycle # 0.40 stalled # cycles per insn 1,091,831,588 branches # 360.069 M/sec 5,426,846 branch-misses # 0.50% of all branches with thread: 3403.592921 task-clock (msec) # 0.452 CPUs utilized 7,145 context-switches # 0.002 M/sec 410 cpu-migrations # 0.120 K/sec 6,191 page-faults # 0.002 M/sec 7,475,038,064 cycles # 2.196 GHz 4,487,043,071 stalled-cycles-frontend # 60.03% frontend cycles idle <not supported> stalled-cycles-backend 9,925,205,494 instructions # 1.33 insns per cycle # 0.45 stalled # cycles per insn 834,375,503 branches # 245.146 M/sec 4,919,995 branch-misses # 0.59% of all branches Signed-off-by: Marc-André Lureau <marcandre.lureau@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	9 years ago
Marc-André Lureau	0738199d41	renderer: return an int in vrend_renderer_init This is an internal API.	9 years ago
Dave Airlie	85602b31bd	renderer: fix regression in shader binding made a mistake in the shader binding code, not good, time for brown paper bag.	9 years ago
Dave Airlie	92b00c978b	renderer: CLEANUP - remove TABs I didn't do a good enough job last time at purging these.	9 years ago
Dave Airlie	81b741a05c	virgl/shaders: handle large shaders. the protocol failed to handle larger shaders, this allow the renderer to reassemble large shaders and recombined the chunks before passing them to the GLSL translation. This also enhances the renderer protocol to allow for some more info in the shader object, and removes the separate vs/gs/fs variants in favour of a type field in the shader.	9 years ago
Dave Airlie	fd8116476b	virgl: add query index to top 16-bits of query type. This is an ABI valid change, we won't get passed indices unless we advertise later GLSL versions.	9 years ago
Dave Airlie	4385520930	renderer: fix compressed transfer gets This code ended up in the other file and really wasn't necessary there. Remove the transfer code from virglrenderer.c, move into main renderer file, and match it with the corresponding transfer reader. This should at least fix the crash in compressed textures with ARB_get_texture_sub_image	9 years ago
Dave Airlie	16990471e7	renderer: misc cleanups move some static decls around, drop useless struct	10 years ago
Dave Airlie	38f01a3daf	renderer: reorder some structs to remove holes	10 years ago
Dave Airlie	7e85c2f114	renderer: CLEANUP: whitespace and reindent this uses the mesa coding style, pray I never have to do this again. strip all trailing whitespace as much as possible	10 years ago
Dave Airlie	ca23e98b22	renderer: CLEANUP: boolean/GLboolean -> bool use stdbool.h as much as possible some of the gallium code imported uses boolean so leave it alone for now	10 years ago
Dave Airlie	9259bc768e	renderer: handle transform feedback 2 and 3 extensions This fixes a number of issues with how transform feedback works it does requires ARB_transform_feedback3 to work at all, but hopefully this extension is widespread enough, if not we can revisit later. It uses transform feedback objects to store the stream out state.	10 years ago
Dave Airlie	b1109ee664	renderer: move program id into context no point in having this in global state anymore	10 years ago
Dave Airlie	c2d68e9995	renderer: move some global state into contexts. This stuff isn't required since the renderer uses contexts now	10 years ago
Dave Airlie	3f4e3e1cad	renderer: add support for ARB_viewport_array	10 years ago
Dave Airlie	0ff22a06b7	renderer: ABI break: overhaul viewport/scissor state add support for multiple viewports, and reduce viewport size.	10 years ago
Dave Airlie	e13ebc57e1	renderer: move away from pipe bind flag definitions.	10 years ago
Marc-André Lureau	cd69deebad	Remove INLINE, use inline instead Similar to MESA recent change	10 years ago
Dave Airlie	aea7785887	renderer: handle resource inline writes using common code inline resource writes should use common code for transfers	10 years ago
Dave Airlie	cccbc3b5e4	renderer: return values from submit_cmd/decode block makes easier to write unit tests.	10 years ago
Dave Airlie	2258a66f61	renderer: drop ctx query handling code also drop active_hw flag	10 years ago
Dave Airlie	83d7fbb0d7	renderer: overhaul transfer code a bit This merges the error/bounds checking on the transfer code, but keeps the same API, it also uses a struct to pass through the transfer info. this also passes a return value out to make testing easier.	10 years ago
Dave Airlie	3d91ff730e	renderer: cleanup context create errors and context destroy	10 years ago
Dave Airlie	96e1b0b693	renderer: reset decode contexts in two stages reset the non-0 contexts first, then kill resource table, then nuke the 0 context.	10 years ago
Dave Airlie	aef2ec2e78	renderer: cleanup dangling resources. if a context goes away we should cleanup any resources left in its resource hash table.	10 years ago
Dave Airlie	03e3116a75	import latest renderer code	10 years ago

1 2

100 Commits (e3683f0fbd5a58afb5e6b5b7068e3354487d0242)