Joe Groff
|
104c29aabc
|
simd intrinsic implementation for v*high, v*hs+, vavg, and vsad
|
2009-12-05 14:52:18 -08:00 |
Joe Groff
|
75d04922b9
|
Merge branch 'master' into simd-cleanup
Conflicts:
basis/math/vectors/simd/functor/functor.factor
|
2009-11-05 11:27:08 -06:00 |
Joe Groff
|
f6643a1c72
|
change ##horizontal-add-vector insn to better match what the HADD SSE instructions do (add adjacent pairs, pack results)
|
2009-11-04 12:18:01 -06:00 |
Slava Pestov
|
51e9a891a8
|
cpu.x86: update %box-displaced-alien for introduction of address field
|
2009-11-03 03:17:43 -06:00 |
Slava Pestov
|
e4ad642134
|
vm: 4 bit tags, new representation of alien objects makes unbox-any-c-ptr more efficient (work in progress)
|
2009-11-02 04:25:54 -06:00 |
Joe Groff
|
b858860a67
|
add ##shl-vector-imm and ##shr-vector-imm insn variants. use merge/shr instead of compare/merge to do signed unpacks
|
2009-10-30 00:41:19 -05:00 |
Doug Coleman
|
bd13e018dd
|
memq? -> member-eq?, sorted-memq? -> sorted-member-eq?
|
2009-10-28 15:02:00 -05:00 |
Slava Pestov
|
1ce39963fd
|
Working on adding support for the new write barrier to optimized code
|
2009-10-14 02:06:01 -05:00 |
Joe Groff
|
3bc097f6ff
|
rename ##shuffle-vector to ##shuffle-vector-imm, and add a new ##shuffle-vector for dynamic shuffles. have vshuffle use ##shuffle-vector to do word and byte shuffles on x86
|
2009-10-09 21:26:27 -05:00 |
Joe Groff
|
dd691a61e8
|
break vector compare intrinsics into %compare, %or, and %not instructions that map directly to cpu instructions
|
2009-10-07 15:27:03 -05:00 |
Joe Groff
|
f2c9eb79e2
|
decompose %unpack-vector-head/tail into %compare-vector/%merge-vector-head/tail or %tail>head-vector/%unpack-vector-head insns when there isn't an actual unpack insn; get rid of fake x86 implementations
|
2009-10-07 14:09:46 -05:00 |
Joe Groff
|
34def34481
|
don't generate a ##not-vector instruction if the cpu doesn't have one; instead, fall back to a ##fill-vector/##xor-vector combo. get rid of pretend %not-vector in cpu.x86
|
2009-10-07 11:59:36 -05:00 |
Joe Groff
|
879fe9a665
|
fix x86 %unpack-vector insns
|
2009-10-06 20:38:51 -05:00 |
Joe Groff
|
529c444e32
|
add insns for vector pack, unpack, integer>float, and float>integer
|
2009-10-05 22:34:14 -05:00 |
Slava Pestov
|
b4e36608da
|
compiler.cfg: remove _gc instruction, it doesn't need to exist, and change GC checks to ensure that the right amount of space is available instead of blindly checking for 1Kb
|
2009-10-05 05:27:49 -05:00 |
Joe Groff
|
0c9c3d4859
|
add %merge-vector-head and %merge-vector-tail instructions to back vmerge
|
2009-10-03 21:48:53 -05:00 |
Joe Groff
|
4e024cbbc2
|
extend x86 %compare-vector to cover all comparison codes, sometimes stupidly for now
|
2009-10-02 23:19:56 -05:00 |
Joe Groff
|
38f413a8a6
|
add intrinsic for vnot/vbitnot
|
2009-10-02 20:04:28 -05:00 |
Slava Pestov
|
bcbdc9e0c8
|
Merge branch 'master' of git://factorcode.org/git/factor into improved-aa
|
2009-10-02 00:21:44 -05:00 |
Joe Groff
|
53b265f682
|
Merge branch 'master' of git://factorcode.org/git/factor
Conflicts:
basis/compiler/codegen/codegen.factor
|
2009-10-01 23:14:16 -05:00 |
Slava Pestov
|
31f39ce32f
|
compiler.cfg.alias-analysis: optimize ##vm-field-ptr and ##alien-global instructions, and optimize out ##compare between values of different alias classes; this optimizes '[ [ >float ] bi@ [ + ] [ - ] 2bi eq? ]' down to an o-op and removes boxing from '[ [ >float ] bi@ [ + ] [ - ] 2bi = ]'
|
2009-10-01 23:03:17 -05:00 |
Joe Groff
|
f9695951a0
|
fold test-vector/branch sequences into a test-vector-branch instruction
|
2009-10-01 19:53:30 -05:00 |
Slava Pestov
|
b9577e2b0e
|
compiler.cfg.instructions: remove ##box-float, ##unbox-float, ##box-vector, ##unbox-vector since they can be expressed in terms of ##alien-double, ##set-alien-double, ##alien-vector, ##set-alien-vector, and ##allot
|
2009-10-01 18:07:50 -05:00 |
Joe Groff
|
d14f150b58
|
%test-vector instruction for vany?, vall?, vnone?
|
2009-10-01 15:35:38 -05:00 |
Joe Groff
|
987ced4070
|
%compare-vector instruction (only does v= for now)
|
2009-10-01 14:31:37 -05:00 |
Joe Groff
|
a93f8f66f9
|
Revert "add a %blend-vector intrinsic for v?"
This reverts commit 21e4b28b67 .
|
2009-09-30 23:40:37 -05:00 |
Joe Groff
|
67cc45235d
|
Merge branch 'master' of git://factorcode.org/git/factor
|
2009-09-30 23:04:04 -05:00 |
Joe Groff
|
7db7b63552
|
add a %blend-vector intrinsic for v?
|
2009-09-30 23:03:59 -05:00 |
Slava Pestov
|
2384b630b2
|
math.vectors.simd: use fallbacks for hlshift, hrshift, vshuffle if parameter is not a literal;al; element access in int-4 on x86-64 now sign-extends the value; don't throw error at compile time if parameter for vshuffle does not have enough elements
|
2009-09-30 20:04:37 -05:00 |
Slava Pestov
|
cdc7b7e2c7
|
Various minor compiler tweaks: Combine address calculation with dereferencing in alien accessors; convert SIMD XOR of a vector with itself into an XOR of the destination with itself; convert SIMD unbox of zero vector into XOR of the destination with itself; fix SIMD indexing on x86-64
|
2009-09-30 05:00:36 -05:00 |
Slava Pestov
|
1da18d06b1
|
compiler.cfg.value-numbering: add some more rewrite rules, neg/neg, not/not, and a few for SIMD
|
2009-09-30 02:18:29 -05:00 |
Slava Pestov
|
f395d83379
|
math.vectors.simd: add fast intrinsic for 'nth', replace broadcast primitive with shuffles
|
2009-09-29 04:48:11 -05:00 |
Slava Pestov
|
a8ea929ad9
|
Work in progress
|
2009-09-28 17:31:34 -05:00 |
Slava Pestov
|
b2ea3afd84
|
math.vectors.simd: add hlshift, hrshift (128-bit shift), vbitandn intrinsics
|
2009-09-28 02:17:46 -05:00 |
Slava Pestov
|
59fbe85c9b
|
compiler.cfg: nuke ##bignum>integer and ##integer>bignum since they were unused
|
2009-09-27 20:36:05 -05:00 |
Slava Pestov
|
66071f45a3
|
Add a couple more def-is-use instructions
|
2009-09-27 20:34:20 -05:00 |
Slava Pestov
|
f757b454cc
|
Merge branch 'master' into more_aggressive_coalescing
|
2009-09-27 19:29:50 -05:00 |
Slava Pestov
|
705b4ab5c3
|
compiler.cfg.linear-scan: fix partial sync point logic in case where dst == src, and clean up spilling code
|
2009-09-27 19:28:20 -05:00 |
Slava Pestov
|
1e841e5086
|
compiler.cfg.ssa.destruction: more aggressive coalescing work in progress
|
2009-09-27 17:17:26 -05:00 |
sheeple
|
01a4047126
|
Merge branch 'slots' of git://factorcode.org/git/factor into slots
Conflicts:
basis/cpu/x86/x86.factor
|
2009-09-26 03:12:42 -05:00 |
Daniel Ehrenberg
|
7bd330cfd5
|
Making ##slot and ##set-slot not have a temporary parameter
|
2009-09-26 00:28:14 -05:00 |
Slava Pestov
|
cb497f009b
|
Merge branch 'vm_cleanup' of git://github.com/phildawes/factor
|
2009-09-24 04:31:55 -05:00 |
Slava Pestov
|
7b6128dd03
|
math.vectors.simd: add v<< and v>> intrinsics for bitwise shifts on elements
|
2009-09-24 03:32:39 -05:00 |
Phil Dawes
|
84faf2431f
|
cleaned up vm-field-ptr compiler code
|
2009-09-24 08:16:57 +01:00 |
Slava Pestov
|
43fa252af5
|
math.vectors.simd: new operations: vabs vsqrt vbitand vbitor vbitxor
|
2009-09-23 02:47:14 -05:00 |
Slava Pestov
|
9d90bdd439
|
Fix conflict
|
2009-09-20 23:18:07 -05:00 |
Slava Pestov
|
ea44ea3522
|
math.vectors.simd: add saturated arithmetic operations
|
2009-09-20 23:16:02 -05:00 |
Slava Pestov
|
acea55c692
|
math.vectors: add v+- word which is accelerated by SSE3
|
2009-09-20 17:43:16 -05:00 |
Phil Dawes
|
ff8f2b10ec
|
fixed up getenv compiler intrinsic to use vm struct userenv
|
2009-09-16 08:16:32 +01:00 |
Slava Pestov
|
7e2f0e5dbf
|
compiler: separate ##save-context instruction from ##alien-invoke, generate a ##save-context for libm calls, and add a pass to combine multiple context saves within a basic block. Fixes crashes with FP traps thrown by libm functions on x86-32
|
2009-09-08 21:50:55 -05:00 |