Commit Graph

20682 Commits (c00272d80af16f19a04d72d30ee998933cdb4c1c)

Author SHA1 Message Date
Joe Groff c00272d80a cpu.x86.features: add popcnt? test 2010-05-15 13:26:14 -07:00
Joe Groff d5b7ec3348 cpu.x86.assembler: SETcc instructions 2010-05-15 13:14:27 -07:00
Joe Groff 986ed057d8 cpu.x86.assembler: BT family instructions 2010-05-15 13:08:22 -07:00
Joe Groff 010b5069bd opengl.textures: borrow get-texture-float and get-texture-int helper words from gpu.textures 2010-05-15 12:09:50 -07:00
Joe Groff 22e853ecb0 cuda: more API cleanups:
- remove useless with-cuda and with-cuda-program combinators
	- eliminate redundant cuda-device, cuda-context variables
	- rearrange arguments of with-*cuda-context to ( device flags quot -- )
	- don't pass context to with-cuda-context quot
	- add context-device word to ask for current device
2010-05-15 11:47:19 -07:00
Slava Pestov 5c0c87fcaa compiler.cfg.value-numbering.comparisons: ##test-imm rewrite rule must check that the immediate fits 2010-05-15 03:19:24 -04:00
Joe Groff 20305432a9 gpu.buffers: "with-mapped-buffer-array" helper word that wraps a mapped GL buffer in a specialized-array for easy inspection 2010-05-14 16:54:06 -07:00
Joe Groff e32e0f1423 cuda.gl: add free-resource word to release interop handles 2010-05-14 16:17:03 -07:00
Joe Groff f9a8e90c41 cuda: add "cuda.gl" vocab with words for cuda/opengl/gpu interop 2010-05-14 16:00:27 -07:00
Joe Groff afdced0183 cuda: in with-cuda-context, sync context when cleaning up so that destroying context doesn't fail due to asynchronous errors 2010-05-14 16:00:27 -07:00
Joe Groff 51c3e8a1b9 gpu.render: allow uchar-array, ushort-array, and uint-array to be passed directly to render as element arrays 2010-05-14 16:00:27 -07:00
Slava Pestov ad7636045f compiler: fix bad unit test 2010-05-14 18:37:11 -04:00
Slava Pestov 62eadda2aa cpu.x86.bootstrap: use TEST instruction 2010-05-14 18:37:10 -04:00
Slava Pestov 5ecd2b61c7 compiler.cfg: more silly optimizations 2010-05-14 18:37:09 -04:00
Slava Pestov 1a61c50896 compiler.cfg.instructions: change vreg-insn from a mixin into a superclass 2010-05-14 18:37:09 -04:00
Slava Pestov 6d30bb8bf4 compiler.cfg.linear-scan: pointless optimizations 2010-05-14 18:37:08 -04:00
Slava Pestov 291543c300 compiler.cfg.ssa.live-ranges: clean up 2010-05-14 18:37:08 -04:00
Slava Pestov 2a137bed0e bootstrap.compiler.timing: small update 2010-05-14 18:37:07 -04:00
Slava Pestov d1e422c981 compiler.cfg.alias-analysis: simplify and speed up 2010-05-14 18:37:07 -04:00
Slava Pestov 5a67711bfd compiler.cfg: use x86 TEST instruction to optimize 'bitand 0 =' 2010-05-14 18:37:06 -04:00
Joe Groff 22625469bc sequences.cords: specialize vshuffle-elements to use vshuffle2-elements on cord components 2010-05-14 13:29:37 -07:00
Joe Groff 8f1751cd54 math.vectors.simd.intrinsics: fix scalar fallback for (simd-vshuffle2-elements) 2010-05-14 02:47:39 -07:00
Joe Groff 2744816209 math.vectors, math.vectors.simd: add user-facing vshuffle2 word 2010-05-14 02:47:05 -07:00
Joe Groff 465a1673eb compiler.cfg.intrinsics.simd: mod shuffle indices for shuffle-2-vectors-imm intrinsic so they wrap like a real instruction would 2010-05-14 01:20:05 -07:00
Joe Groff 9f9e145819 math.vectors.simd.intrinsics: (simd-vshuffle2-elements) intrinsic that creates a vector by selecting elements from two input vectors. use ##shuffle-vector-halves-imm to implement for double-2s with SSE 2010-05-14 01:16:29 -07:00
Joe Groff 3766abd65e compiler.cfg.instructions: ##shuffle-vector-halves-imm insn to map to SSE's two-input SHUFPS/SHUFPD 2010-05-14 00:20:21 -07:00
Slava Pestov fe30355ec7 math.vectors.simd.cords: implement new-sequence and like methods on cords to make cross product work 2010-05-13 21:55:19 -04:00
Slava Pestov 4a98278720 cpu.x86.32: check for SSE2 on bootstrap since Factor now requires it 2010-05-13 18:25:46 -04:00
Joe Groff eb89311bcd Merge branch 'master' of factorcode.org:/git/factor 2010-05-13 14:41:40 -07:00
Joe Groff 1196087991 math: declare quotation stack effects on each-integer, times, find-integer, all-integers? 2010-05-13 14:30:19 -07:00
Slava Pestov bb87d124c1 compiler: small fixes and cleanups 2010-05-13 02:48:20 -04:00
Joe Groff 8b9f33e040 x11: convert XSupportsLocale return value to factor bool before testing (bug reported by ceninan) 2010-05-12 18:07:33 -07:00
Slava Pestov db2db6a1a6 compiler.cfg.builder.alien: fix for x86-32 2010-05-12 18:07:11 -04:00
Joe Groff 4eb55a5c3b Merge branch 'master' of factorcode.org:/git/factor 2010-05-12 13:49:02 -07:00
Joe Groff 860e871fe6 new cuda.types vocab containing CUDA vector types (int2, float4, etc.) with CUDA alignment 2010-05-12 13:48:44 -07:00
Slava Pestov 42b0d456cd Refactor x86-32 stack cleanup logic 2010-05-12 02:09:11 -04:00
Slava Pestov f89b85db7b Temporary fixes for x86-32 until FFI boxing is rewritten 2010-05-12 01:41:49 -04:00
Slava Pestov c2558e6a66 vm: fix longlong accessors 2010-05-11 23:24:48 -04:00
Slava Pestov 0cde5c8fb5 Eliminate compiler.alien 2010-05-11 23:24:47 -04:00
Slava Pestov 1c76c87c5c cpu.x86.64: fix typo 2010-05-11 22:29:46 -04:00
Slava Pestov ee0640f176 Move flatten-c-type to death row so that it can be executed in part 5 2010-05-11 22:26:31 -04:00
Slava Pestov 7d62376e24 vm: fix compile error 2010-05-11 19:29:50 -04:00
Slava Pestov eb802208d1 FFI rewrite part 4: parameter and return value unboxing redesign 2010-05-11 19:11:44 -04:00
Slava Pestov d1e2554ebf cuda.utils: fix load error 2010-05-10 20:50:28 -04:00
Slava Pestov 423f9c607a cpu.ppc: fixes 2010-05-10 17:50:10 -05:00
Joe Groff cc2cee1ae3 Merge branch 'master' of factorcode.org:/git/factor 2010-05-10 15:07:00 -07:00
Joe Groff 6d30ce485c cuda.utils: add "sync-context" word 2010-05-10 15:06:15 -07:00
Slava Pestov 9321f9378a cpu.ppc: update for recent changes -- untested 2010-05-10 02:21:23 -04:00
Slava Pestov 45689dbfe6 compiler: simplify ##unbox-small/large-struct by emitting an ##unbox-any-c-ptr first 2010-05-10 01:38:34 -04:00
Slava Pestov 67b82a2509 sequences: remove unused 2reverse-each word 2010-05-10 00:46:29 -04:00