a0e1af5891 
								
							
								 
							
						 
						
							
							
								
								Merge branch 'master' of git://factorcode.org/git/factor  
							
							
							
						 
						
							2009-09-08 14:22:52 -05:00  
				
					
						
							
							
								 
						
							
							
								1e5bc3b376 
								
							
								 
							
						 
						
							
							
								
								inline alien-vector and set-alien-vector if SIMD is not available for a small speedup  
							
							
							
						 
						
							2009-09-08 13:56:17 -05:00  
				
					
						
							
							
								 
						
							
							
								942fb7b048 
								
							
								 
							
						 
						
							
							
								
								Merge branch 'master' of git://factorcode.org/git/factor  
							
							
							
						 
						
							2009-09-08 13:22:52 -05:00  
				
					
						
							
							
								 
						
							
							
								bbca00e2ae 
								
							
								 
							
						 
						
							
							
								
								Fix conflicts  
							
							
							
						 
						
							2009-09-07 23:51:25 -05:00  
				
					
						
							
							
								 
						
							
							
								9f9b6bca01 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: type check inputs to unsafe foldable words manually, so that stuff like [ "Hi" { } fixnum+fast ] doesn't crash in the compiler  
							
							
							
						 
						
							2009-09-07 23:40:23 -05:00  
				
					
						
							
							
								 
						
							
							
								a1ae209f81 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation.call-effect: stronger call( inlining; now can inline 'a [ b ] curry call(' where 'a' is literal, [ b ] doesn't infer, but [ a b ] does infer. This simplifies classes.struct:memory>struct  
							
							
							
						 
						
							2009-09-07 17:45:03 -05:00  
				
					
						
							
							
								 
						
							
							
								e36a0d7ef4 
								
							
								 
							
						 
						
							
							
								
								compiler: clean up code generation for alien boxing/unboxing a bit  
							
							
							
						 
						
							2009-09-03 21:22:43 -05:00  
				
					
						
							
							
								 
						
							
							
								4d5a4222b6 
								
							
								 
							
						 
						
							
							
								
								More SIMD work  
							
							... 
							
							
							
							- Rename SIMD types and register representations: <type>-<count> rather than <count><type>-array
- Make a functor to define 256-bit vector types, use it to define float-8 type
- Make SIMD instructions pure-insns so that they participate in value numbering 
							
						 
						
							2009-09-03 20:58:56 -05:00  
				
					
						
							
							
								 
						
							
							
								bf81cb4259 
								
							
								 
							
						 
						
							
							
								
								math.vectors.simd: split off intrinsics into a sub-vocabulary, to avoid loading most of the SIMD code on bootstrap  
							
							
							
						 
						
							2009-09-03 03:43:43 -05:00  
				
					
						
							
							
								 
						
							
							
								ae051e0c9a 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: type function for clone had an issue, sometimes clone would get optimized out because of incorrect constant folding  
							
							
							
						 
						
							2009-09-03 02:40:18 -05:00  
				
					
						
							
							
								 
						
							
							
								ff8c70dbe0 
								
							
								 
							
						 
						
							
							
								
								Initial implementation of SSE vector intrinsics:  
							
							... 
							
							
							
							- cpu.architecture: add SSE vector representations
- compiler.cfg.intrinsics.alien: remove an attempt at optimization that value numbering handles now
- compiler.cfg.representations: support instructions where the representation is set in the 'rep' slot, and support conversions between single and double floats
- alien-float, set-alien-float now use the single float representation, and the conversion is implicit; this fixes a long-standing bug where a register could get clobbered because of how %set-alien-float was defined on x86
- math.vectors.specialization: add support for SIMD specialization (where the vector word's body is replaced by another quotation), also specialize the 'sum' word
- math.vectors.simd: 4float-array, 2double-array, 4double-array types, and specializers for the math.vectors words 
							
						 
						
							2009-09-03 02:33:07 -05:00  
				
					
						
							
							
								 
						
							
							
								2bb6293217 
								
							
								 
							
						 
						
							
							
								
								compiler: add fixnum-min/max intrinsics; ~10% speedup on benchmark.yuv-to-rgb  
							
							
							
						 
						
							2009-08-28 19:02:59 -05:00  
				
					
						
							
							
								 
						
							
							
								d957ae4e44 
								
							
								 
							
						 
						
							
							
								
								Performance improvements to make struct-arrays benchmark faster  
							
							... 
							
							
							
							- improved optimization of ##unbox-any-c-ptr on ##box-displaced-alien; convert it to ##unbox-c-ptr where possible using class info stored in the ##bda instruction
- make fcos, fsin, etc inline again; everything in math.libm inline again, except for fsqrt which is an intrinsic
- convert min and max on floats to float-min and float-max
- make min and max not inline, so that the above can work
- struct-arrays: rice a bit so that more fixnums come up 
							
						 
						
							2009-08-28 05:21:16 -05:00  
				
					
						
							
							
								 
						
							
							
								ba0f3a9911 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation.transforms: don't fail to compile if 'at' called on something that's not an assoc  
							
							
							
						 
						
							2009-08-27 18:57:56 -05:00  
				
					
						
							
							
								 
						
							
							
								0df8aadce2 
								
							
								 
							
						 
						
							
							
								
								cpu.x86: use SQRTSD instruction for math.libm:fsqrt word  
							
							
							
						 
						
							2009-08-25 23:22:15 -05:00  
				
					
						
							
							
								 
						
							
							
								49bd2228ec 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.modular-arithmetic: fix regression; set-alien-*-1 was not always open-coded  
							
							
							
						 
						
							2009-08-20 17:56:49 -05:00  
				
					
						
							
							
								 
						
							
							
								9ef8f6c81d 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.modular-arithmetic: eliminate >bignum calls where possible, convert fixnum-shift to fixnum-shift-fast if shift count is positive, don't run if there are no modular values  
							
							
							
						 
						
							2009-08-20 03:47:45 -05:00  
				
					
						
							
							
								 
						
							
							
								f01f7ad6eb 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: bitand custom inlining was wrong if the second input was a bignum  
							
							
							
						 
						
							2009-08-20 03:47:07 -05:00  
				
					
						
							
							
								 
						
							
							
								2dc99ea05f 
								
							
								 
							
						 
						
							
							
								
								Fix interval inference of abs, absq when input is a complex number  
							
							
							
						 
						
							2009-08-19 16:06:37 -05:00  
				
					
						
							
							
								 
						
							
							
								829107902e 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: improved interval inference for absq eliminates a conditional from math.vectors:distance. Type inference for rational math also a bit sharper now  
							
							
							
						 
						
							2009-08-19 02:33:41 -05:00  
				
					
						
							
							
								 
						
							
							
								d7594c3381 
								
							
								 
							
						 
						
							
							
								
								compiler: inline singleton predicates, and optimize predicate engines, reduces terrain demo deployed size by ~20kb  
							
							
							
						 
						
							2009-08-17 23:59:24 -05:00  
				
					
						
							
							
								 
						
							
							
								3047d4a451 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: remove method inlining heuristic  
							
							
							
						 
						
							2009-08-17 22:29:05 -05:00  
				
					
						
							
							
								 
						
							
							
								bf57d78b09 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.modular-arithmetic: >fixnum elimination and value info annotations were too aggressive  
							
							
							
						 
						
							2009-08-17 01:20:25 -05:00  
				
					
						
							
							
								 
						
							
							
								5297be3e19 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.modular-arithmetic: stronger optimization handles > 1 usages case as well as values defined and used in loops. Eliminates 5 out of 8 >fixnum calls in benchmark.yuv-to-rgb  
							
							
							
						 
						
							2009-08-15 18:42:41 -05:00  
				
					
						
							
							
								 
						
							
							
								7c92ab1ea5 
								
							
								 
							
						 
						
							
							
								
								move if-zero etc to math, remove 1-/1+ from math  
							
							
							
						 
						
							2009-08-14 14:27:23 -05:00  
				
					
						
							
							
								 
						
							
							
								3f3d57032b 
								
							
								 
							
						 
						
							
							
								
								Delete empty unit tests files, remove 1- and 1+, reorder IN: lines in a lot of places, minor refactoring  
							
							
							
						 
						
							2009-08-13 19:21:44 -05:00  
				
					
						
							
							
								 
						
							
							
								4ecf5a904a 
								
							
								 
							
						 
						
							
							
								
								More accurate wrap-interval in compiler.tree.propagation.info fixes test regression; constructing an interval with endpoints at infinity now outputs full-interval  
							
							
							
						 
						
							2009-08-12 03:25:53 -05:00  
				
					
						
							
							
								 
						
							
							
								422dd24bad 
								
							
								 
							
						 
						
							
							
								
								alien.complex: fix tests  
							
							
							
						 
						
							2009-08-11 14:07:33 -05:00  
				
					
						
							
							
								 
						
							
							
								26859a45eb 
								
							
								 
							
						 
						
							
							
								
								various doc fixes  
							
							
							
						 
						
							2009-08-11 16:58:47 +02:00  
				
					
						
							
							
								 
						
							
							
								7e35723db0 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation: be more careful with intervals, ensuring that the inferred interval of a value is a subset of the value class's interval. This improves accuracy, for example [ >fixnum 1 + >fixnum most-positive-fixnum <= ] constant-folds to true  
							
							
							
						 
						
							2009-08-10 01:16:49 -05:00  
				
					
						
							
							
								 
						
							
							
								8a9c15ab0b 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.escape-analysis: if the output of an #introduce node has an immutable tuple class type declaration, and it is not passed to any subroutine calls, or returned from the word, then unbox it. This speeds up vector arithmetic words on specialized arrays, because the specialized array is unboxed up-front, eliminating an indirection on every loop iteration  
							
							
							
						 
						
							2009-08-09 16:29:21 -05:00  
				
					
						
							
							
								 
						
							
							
								e400d80d8b 
								
							
								 
							
						 
						
							
							
								
								More accurate interval inference for mod, rem, and propagation can now infer intervals in the case where a value might be f. so, [ [ 127 bitand ] [ drop f ] if dup [ 0 >= ] [ not ] if ] now constant-folds down to 't'!  
							
							
							
						 
						
							2009-08-08 23:03:45 -05:00  
				
					
						
							
							
								 
						
							
							
								61ea749bb6 
								
							
								 
							
						 
						
							
							
								
								More accurate interval-mod and interval-rem  
							
							
							
						 
						
							2009-08-08 22:01:12 -05:00  
				
					
						
							
							
								 
						
							
							
								97ec3ea1b4 
								
							
								 
							
						 
						
							
							
								
								compiler.graphviz: add high-level IR call graph rendering  
							
							
							
						 
						
							2009-08-05 03:33:06 -05:00  
				
					
						
							
							
								 
						
							
							
								fb03a93763 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.recursive: have to check tail call flag of call tree edges too  
							
							
							
						 
						
							2009-08-05 02:14:49 -05:00  
				
					
						
							
							
								 
						
							
							
								c3d60e5899 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.recursive: more accurate loop detection  
							
							
							
						 
						
							2009-08-04 19:18:40 -05:00  
				
					
						
							
							
								 
						
							
							
								eed4f4dcfc 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.recursive: add some more more loop detection tests  
							
							
							
						 
						
							2009-08-04 16:23:14 -05:00  
				
					
						
							
							
								 
						
							
							
								82c1106945 
								
							
								 
							
						 
						
							
							
								
								compiler.cfg.ssa.destruction: new implementation: simpler and more correct  
							
							
							
						 
						
							2009-08-02 10:26:52 -05:00  
				
					
						
							
							
								 
						
							
							
								e01fc93812 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation.call-effect: fix case where quotation inlining could enter an infinite loop: [ dup call( quot -- ) ] dup call( quot -- ) etc  
							
							
							
						 
						
							2009-08-01 23:34:14 -05:00  
				
					
						
							
							
								 
						
							
							
								501629cf75 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation.info: fix load error  
							
							
							
						 
						
							2009-07-27 00:30:24 -05:00  
				
					
						
							
							
								 
						
							
							
								fc41140720 
								
							
								 
							
						 
						
							
							
								
								compiler.cfg.utilities: move value-info utilities to compiler.tree.propagation.info  
							
							
							
						 
						
							2009-07-26 21:10:33 -05:00  
				
					
						
							
							
								 
						
							
							
								343151fd62 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.finalization: undo a change that resulted in a compile time regression  
							
							
							
						 
						
							2009-07-18 03:33:24 -05:00  
				
					
						
							
							
								 
						
							
							
								5061db2a48 
								
							
								 
							
						 
						
							
							
								
								compiler.cfg: Move tail call optimization to the end, compiler.tree: simplify finalization pass because of this  
							
							
							
						 
						
							2009-07-17 22:57:28 -05:00  
				
					
						
							
							
								 
						
							
							
								1d735bc32d 
								
							
								 
							
						 
						
							
							
								
								compiler.tree.propagation.transforms: fix shift transform  
							
							
							
						 
						
							2009-07-17 00:02:57 -05:00  
				
					
						
							
							
								 
						
							
							
								3fb4fc1bde 
								
							
								 
							
						 
						
							
							
								
								Improve code generation for shift word: add intrinsics for fixnum-shift-fast in the case where the shift count is not constant, transform 1 swap shift into a more overflow check with open-coded fast case, transform bitand into fixnum-bitand in more cases  
							
							
							
						 
						
							2009-07-16 23:50:48 -05:00  
				
					
						
							
							
								 
						
							
							
								e76dce8aff 
								
							
								 
							
						 
						
							
							
								
								Overflowing fixnum intrinsics now expand into several CFG nodes. This speeds up the common case since only the uncommon case is now a stack syncpoint  
							
							
							
						 
						
							2009-07-16 18:29:40 -05:00  
				
					
						
							
							
								 
						
							
							
								062e33f8fb 
								
							
								 
							
						 
						
							
							
								
								fixing stupid bug in propagation  
							
							
							
						 
						
							2009-07-16 00:43:54 -05:00  
				
					
						
							
							
								 
						
							
							
								ee3e84a1f8 
								
							
								 
							
						 
						
							
							
								
								define-partial-eval framework in propagation pass makes it easy to add transforms; moving some transforms from stack checker to propagation, making them stronger  
							
							
							
						 
						
							2009-07-16 00:34:50 -05:00  
				
					
						
							
							
								 
						
							
							
								bb06facb01 
								
							
								 
							
						 
						
							
							
								
								new is inlined in the propagation pass when the class is known  
							
							
							
						 
						
							2009-07-14 14:16:39 -05:00  
				
					
						
							
							
								 
						
							
							
								47500fad06 
								
							
								 
							
						 
						
							
							
								
								call( and execute( inline known quotations/words in the propagation pass  
							
							
							
						 
						
							2009-07-14 01:12:45 -05:00