~airlied/mesa - Dave's mesa hackery

Age	Commit message (Collapse)	Author	Files	Lines
2016-04-21	i965/surface_state: Use libisl functions for image format lowering	Jason Ekstrand	3	-120/+12
	This lets us delete some redundant code and keep all of the image_load_store format lowering logic in one place: libisl. Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	i965/fs_surface_builder: Use isl instead of mesa for format info	Jason Ekstrand	1	-66/+52
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	i965/fs_surface_builder: Add a helper for converting GL to ISL formats	Jason Ekstrand	1	-0/+55
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	i965/fs_surface_builder: Explicitly handle FORMAT_NONE in num_image_coordinates	Jason Ekstrand	1	-0/+1
	Previously, we were relying on has_matching_typed_format returning true for MESA_FORMAT_NONE which, in turn, relied on _mesa_get_format_bytes returning 1 for MESA_FORMAT_NONE. When we switch to ISL, this behaviour will no longer be something we can rely on. Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	i965/fs_surface_builder: Take a GL format enum instead of mesa_format	Jason Ekstrand	3	-9/+10
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl/format: Add a get_num_channels helper	Jason Ekstrand	2	-0/+17
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl/format: Add more isl_format_has_type_channel functions	Jason Ekstrand	2	-4/+43
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl/format: Break the guts of has_[us]int_channel into a helper	Jason Ekstrand	1	-18/+16
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	anv/image: Use the has_matching_typed_storage_image_format helper from isl	Jason Ekstrand	1	-12/+3
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl: Add a helper for determining when a typed load/store can be used	Jason Ekstrand	2	-0/+20
	Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl: Take a devinfo in lower_storage_image_format instead of an isl_device	Jason Ekstrand	4	-18/+19
	We want to call this function from the shader compiler and having a full isl_device available at that point isn't practical. Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl: Don't use designated initializers in the header	Jason Ekstrand	1	-13/+24
	C++ doesn't support designated initializers and g++ in particular doesn't handle them when the struct gets complicated, i.e. has a union. Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	isl: Include c99_compat.h	Jason Ekstrand	1	-0/+1
	We need the restrict keyword in isl.h Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	i965: Add a dependency on libisl	Jason Ekstrand	4	-10/+17
	To avoid build issues, ensure that you're running `make' at the top level and/or you've executed `make clean' beforehand. Reviewed-by: Chad Versace <chad.versace@intel.com>
2016-04-21	radeon: handle query buffer allocation and mapping failures	Nicolai Hähnle	2	-9/+33
	Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94984 Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2016-04-21	radeon: wire end_query return value to sw/hw_end	Nicolai Hähnle	2	-7/+10
	Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2016-04-21	st/mesa: check return value of begin/end_query	Nicolai Hähnle	1	-22/+33
	They can only indicate out of memory conditions, since the other error conditions are caught earlier. v2: fix error message in EndQuery Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
2016-04-21	gallium: add bool return to pipe_context::end_query	Nicolai Hähnle	19	-26/+49
	Even when begin_query succeeds, there can still be failures in query handling. For example for radeon, additional buffers may have to be allocated when queries span multiple command buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>
2016-04-21	i965: Always use Y-tiled buffers on SKL+	Ben Widawsky	4	-8/+30
	Starting with Skylake, the display engine is capable of scanning out from Y-tiled buffers. As such, we can and should use Y-tiling for better efficiency. This also has the added benefit of being able to fast clear the winsys buffer. Note that the buffer allocation done for mipmaps will already never allocate an X-tiled buffer for GEN9. This has an almost universal positive impact on benchmarks, some improving by as much as 20%. Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
2016-04-22	softpipe: fix a warning due to an incorrect enum comparison	Marek Olšák	1	-1/+1
	no change in behavior, because both are defined the same Acked-by: Jose Fonseca <jfonseca@vmware.com>
2016-04-22	gallium: remove helpers converting to/from TGSI_PROCESSOR_*	Marek Olšák	4	-37/+3
	Acked-by: Jose Fonseca <jfonseca@vmware.com>
2016-04-22	gallium: use PIPE_SHADER_* everywhere, remove TGSI_PROCESSOR_*	Marek Olšák	48	-369/+360
	Acked-by: Jose Fonseca <jfonseca@vmware.com>
2016-04-22	gallium: merge PIPE_SWIZZLE_* and UTIL_FORMAT_SWIZZLE_*	Marek Olšák	63	-499/+489
	Use PIPE_SWIZZLE_* everywhere. Use X/Y/Z/W/0/1 instead of RED, GREEN, BLUE, ALPHA, ZERO, ONE. The new enum is called pipe_swizzle. Acked-by: Jose Fonseca <jfonseca@vmware.com>
2016-04-22	gallium: use enums in p_shader_tokens.h (v2)	Marek Olšák	1	-139/+164
	Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> (v1) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Acked-by: Jose Fonseca <jfonseca@vmware.com> (v1) v2: name enums
2016-04-22	gallium: use enums in p_defines.h (v2)	Marek Olšák	1	-173/+205
	and remove number assignments which are consecutive Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> (v1) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Acked-by: Jose Fonseca <jfonseca@vmware.com> (v1) v2: name enums
2016-04-22	radeonsi: remove the shader parameter from si_set_ring_buffer	Marek Olšák	3	-15/+11
	not used anymore this is a follow-up to the RW buffer cleanup. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
2016-04-22	radeonsi: decrease GS copy shader user SGPRs to 2	Marek Olšák	2	-3/+3
	const buffers are no longer used since the clip plane const buffer was moved to RW buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: shorten slot masks to 32 bits	Marek Olšák	4	-63/+61
	Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: clean up shader resource limit definitions	Marek Olšák	3	-23/+12
	Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: move default tess level constant buffer to RW buffers	Marek Olšák	5	-10/+35
	Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: move sample positions constant buffer to RW buffers	Marek Olšák	3	-4/+5
	Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: move clip plane constant buffer to RW buffers	Marek Olšák	4	-14/+12
	Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: rework polygon stippling to use constant buffer instead of texture	Marek Olšák	6	-101/+55
	add it to the RW_BUFFERS descriptor array now the slot masks don't have to have 64 bits Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: generalize si_set_constant_buffer	Marek Olšák	1	-10/+17
	this will be used in the next commit Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: make RW buffer descriptor array global, not per shader stage	Marek Olšák	2	-51/+43
	v2: also simplify invalidation of RW buffer bindings (squashed) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-22	radeonsi: rename and rearrange RW buffer slots	Marek Olšák	4	-30/+39
	- use an enum - use a unique slot number regardless of the shader stage (the per-stage slots will go away for RW buffers) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>
2016-04-21	gallivm: fix bogus argument order to lp_build_sample_mipmap function	Roland Scheidegger	1	-2/+2
	Screwed up since 0753b135f6e83b171d8a1b08aea967374f3542bc. (Only an issue with different min/mag filters, and then only in some cases, which is probably why it went unnoticed for quite a while. The effect should have simply been nearest mip filter instead of linear, iff min was nearest, mag was linear, and all pixels hit the mignifying path.) Fixes a bunch of dEQP failures. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>
2016-04-21	i965: Fix clear code for ignoring colormask for XRGB formats on Gen9+.	Kenneth Graunke	1	-0/+1
	In commit cda886a4851ab767fba40e8474d6fa8190347e4f, Neil made us stop advertising RGBX formats on Gen9+, as the hardware apparently no longer has working fast clear support for those formats. Instead, we just fall back to RGBA formats, and use SCS to override alpha to 1.0. This is fine, but had one unintended side effect: it made us fall back to slow clears when the color mask disables alpha. Normally, we ignore the color mask for non-existent channels. This includes alpha for XRGB formats as writing garbage to the X channel is harmless. But, now that we use RGBA, we think there's a real alpha channel, and can't do the optimization. To hack around this, check if _BaseFormat is GL_RGB and ignore alpha. Improves WebGL Aquarium performance on Skylake GT3e by about 50% by letting it use repclears instead of slow clears. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
2016-04-21	i965/blorp: Improve precission of blitting coordinates when clipping	Iago Toral Quiroga	1	-61/+163
	We do this in two steps: first we clip the dst rect and adjust the src rect accordingly. Then we do it the other way around. In both passes the adjustment part involves multiplying by a scale factor that can lead to a small precision loss. This is breaking a few dEQP tests. Specifically, the problem happens when we need to clip the same coordinate twice. For example, if srcX0 and dstX0 need both to be clipped we want to avoid the situation where we clip srcX0 first, then adjust dstX0 accordingly but then we realize that the resulting dstX0 still needs to be clipped, so we clip dstX0 and adjust srcX0 again. Each of these two passes can lead to precission loss. What we want to do here is detect the rect that leads to the largest clip (accounting for the scale factor involved), clip that rect and adjust the other one. With this we ensure that the adjusted coordinate does not need to be clipped again and we can skip a second pass, improving precision. Fixes the following 4 dEQP tests: dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_src_x_nearest dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_src_x_linear dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_dst_x_nearest dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_dst_x_linear Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Mark Janes <mark.a.janes@intel.com>
2016-04-21	radeonsi: Add config parameter to si_shader_apply_scratch_relocs.	Bas Nieuwenhuizen	4	-3/+5
	shader->config is not updated for compute kernels. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
2016-04-21	glsl: Relax GLSL 1.10 float suffix error to a warning.	Matt Turner	1	-2/+2
	Float suffixes are allowed in all subsequent GLSL specifications, and it's obvious what the user meant if they specify one. Accept it with a warning to avoid breaking applications, like Planeshift (although it looks like between 0.6.1 and 0.6.3 they might have removed the suffixes from their shaders). Reviewed-by: Lars Hamre <chemecse@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
2016-04-21	i965/fs: Readd opt_drop_redundant_mov_to_flags().	Matt Turner	2	-0/+41
	This reverts commit b449366587b5f3f64c6fb45fe22c39e4bc8a4309. I removed the pass thinking that it was now not useful, but that was not true. I believe I ran shader-db on HSW and saw no results, but HSW does not use the unlit centroid workaround code and as a result does not emit redundant MOV_DISPATCH_TO_FLAGS instructions. On IVB, the shader-db results are: total instructions in shared programs: 6650806 -> 6646303 (-0.07%) instructions in affected programs: 106893 -> 102390 (-4.21%) helped: 793 total cycles in shared programs: 56195538 -> 56103720 (-0.16%) cycles in affected programs: 873048 -> 781230 (-10.52%) helped: 553 HURT: 209 On SNB, the shader-db results are: total instructions in shared programs: 7173074 -> 7168541 (-0.06%) instructions in affected programs: 119757 -> 115224 (-3.79%) helped: 799 total cycles in shared programs: 98128032 -> 98072938 (-0.06%) cycles in affected programs: 1437104 -> 1382010 (-3.83%) helped: 454 HURT: 237 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
2016-04-21	i965/blorp: Do not emit pma stall on gen9+	Topi Pohjolainen	1	-1/+3
	This was left out from the original gen8 upload introduction. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
2016-04-21	swr: add PIPE_CAP_FRAMEBUFFER_NO_ATTACHMENT to get_param	Tim Rowley	1	-0/+1
	Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>
2016-04-21	i965: automake: remove gratuitous "+" during variable assignment	Emil Velikov	1	-2/+2
	There is not initial assignment, thus appending to it does not work. Fixes: b27c85c4c08 "i965: add build rule for brw_nir_trig_workarounds.c" Signed-off-by: Emil Velikov <emil.velikov@collabora.com>
2016-04-21	gbm: add GBM_FORMAT_XBGR8888 format support	Rob Herring	1	-0/+7
	Add GBM_FORMAT_XBGR8888/__DRI_IMAGE_FORMAT_XBGR8888 format support which is needed for Android. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>
2016-04-21	st/dri: add 32-bit RGBX/RGBA formats	Rob Herring	2	-0/+10
	Add support for 32-bit RGBX/RGBA formats which are preferred for Android. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>
2016-04-21	dri/common: add MESA_FORMAT_R8G8B8{A8, X8}_UNORM formats as supported configs	Rob Herring	1	-0/+10
	Add MESA_FORMAT_R8G8B8A8_UNORM and MESA_FORMAT_R8G8B8X8_UNORM formats as these are the preferred formats for Android. Signed-off-by: Rob Herring <robh@kernel.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Anholt <eric@anholt.net>
2016-04-21	i965: add build rule for brw_nir_trig_workarounds.c on Android	Rob Herring	4	-2/+49
	Commit bfd17c76c126 ("i965: Port INTEL_PRECISE_TRIG=1 to NIR.") added a generated file brw_nir_trig_workarounds.c which broke the Android build. Add the necessary makefiles to the Android build. Cc: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Rob Herring <robh@kernel.org> Tested-by: Chih-Wei Huang <cwhuang@linux.org.tw> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>
2016-04-21	glsl: android: add back missing generated glcpp include path	Rob Herring	1	-0/+1
	Commit 4db8f15a2576 ("glsl: move the android build scripts a level up") dropped a generated include path for glcpp. Add it back adjusting for the new location. Signed-off-by: Rob Herring <robh@kernel.org> Tested-by: Chih-Wei Huang <cwhuang@linux.org.tw> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>