Commits · a62ba58f2bdd7b7ca743d58b59546c803901aace · Boxiang Sun / Pyston

28 Jul, 2015 9 commits

Templatize runtimeCall for different exception styles · a62ba58f

Kevin Modzelewski authored Jul 28, 2015

Which also involves templatizing most of the things that it
calls as well.

CompiledFunction's now have an additional parameter "exception_style",
and we will try to call a version of the function with the requested
style.  All of our runtime functions are written to the CXX style,
and there's not a whole lot of benefit until we templatize the runtime
functions individually.

a62ba58f

Merge pull request #768 from kmod/exceptions2 · 75e203a2
Kevin Modzelewski authored Jul 27, 2015
```
start templatizing the runtime to be able to choose exception styles
```
75e203a2
Merge pull request #759 from Daetalus/test_format · 52a94c1d
Kevin Modzelewski authored Jul 27, 2015
```
Fix the errors that report in `test format` and re-enable `test format`.
```
52a94c1d
Add noexcept() specifiers · 8170d8ae
Kevin Modzelewski authored Jul 28, 2015
```
Apparently they can do compile-time evaluations, which is cool.
```
8170d8ae

Templatize getitemInternal · ed14dd77

Kevin Modzelewski authored Jul 27, 2015

For use of PyObject_GetItem

django_template3 ends up calling this a fair amount via unicode_translate
(ie it checks to see if certain entries are in the translation table).

ed14dd77

Merge pull request #760 from undingen/bjit_missing_core · 971519a0
Kevin Modzelewski authored Jul 27, 2015
```
bjit: add support for most common missing nodes and don't JIT compile cold blocks
```
971519a0
Merge pull request #763 from Daetalus/old_style_class · f7367f15
Kevin Modzelewski authored Jul 27, 2015
```
Numeric binary operator support for old style class
```
f7367f15
Fix exception handling in lookup_maybe · 729feee3
Kevin Modzelewski authored Jul 27, 2015

729feee3

Partially-templatize len · 4a114aaa

Kevin Modzelewski authored Jul 27, 2015

The exceptions thrown by len itself can now be either style,
though any exceptions thrown by any called functions (ex __len__)
will still get thrown as C++ exceptions and converted if needed.

Helps in a common case of "try calling len but don't worry if no
len was defined".

4a114aaa

27 Jul, 2015 20 commits
- Templatize getattrInternal · dfeff147
  Kevin Modzelewski authored Jul 27, 2015
  
  dfeff147
- copy PyNumber_Long and related function from CPython · a2c06e4c
  Boxiang Sun authored Jul 28, 2015
  
  a2c06e4c
- bjit: don't jit cold blocks after doing a OSR JIT compilation · d18ff95a
  Marius Wachtler authored Jul 27, 2015
```
Previously after doing a OSR JIT compilation we continued to JIT every block outside of the loop.
This doesn't show up as a perf change but reduces the number of JITed code / makes it slightly smaller.
```
  d18ff95a
- bjit: add support for make function and lambda nodes · aa5aaca4
  Marius Wachtler authored Jul 27, 2015
  
  aa5aaca4
- copy instance_pow and instance_ipow with related function from CPython · d4ad2f78
  Boxiang Sun authored Jul 27, 2015
  
  d4ad2f78
- Merge pull request #764 from corona10/issue689 · eb8e5c4d
  Kevin Modzelewski authored Jul 27, 2015
```
list recursive printing
```
  eb8e5c4d
- bjit: support some delete statements · 3d146ce6
  Marius Wachtler authored Jul 27, 2015
  
  3d146ce6
- bjit: add support for the import nodes · 8e8c910f
  Marius Wachtler authored Jul 24, 2015
  
  8e8c910f
- Merge pull request #761 from kmod/perf2 · 0fdbc574
  Kevin Modzelewski authored Jul 27, 2015
```
optimize some misc runtime functions
```
  0fdbc574
- add test for testing recursive print · 87a906df
  Dong-hee,Na authored Jul 27, 2015
  
  87a906df
- fix issue 689 · 87957462
  Dong-hee,Na authored Jul 27, 2015
  
  87957462
- Add some more exceptions stats · bc73a3ab
  Kevin Modzelewski authored Jul 26, 2015
  
  bc73a3ab
- Speed up a number of runtime functions · 63ec21c9
  Kevin Modzelewski authored Jul 24, 2015
```
callable(), str(), repr(), PySequence_GetItem(),
and PyObject_HasAttrString()

Mostly by bringing in the CPython versions.
```
  63ec21c9
- Add django_tiny microbenchmark · a6a326e5
  Kevin Modzelewski authored Jul 24, 2015
  
  a6a326e5
- Optimize calling str() and unicode() · 0afbff14
  Kevin Modzelewski authored Jul 24, 2015
```
They are tricky since these are types, which means they invoke
the relatively-complicated constructor logic.  ie str() doesn't
just call __str__ on the argument: if the result is a subclass
of str, it calls result.__init__().  Similarly for unicode, except
unicode is even trickier since it takes some more arguments, one
of which is "encoding" which will have non-type-based dynamic
behavior.

I didn't realize that at first and optimized unicode() by exposing an
inner version of it that takes its arguments in registers, which we
can take advantage of using our jit-arg-rearrangement capability.
This means we have to do parts of PyArg_ParseTuple ourselves, so I
added a PyArg_ParseSingle that runs a single object through the
arg-conversion code.  PyArg_ParseSingle could be further optimized if
we want to.  Or rather, if we have functions of the form
PyArg_ParseSingle_s (which corresponds to the "s" format code) we
could skip some more of the overhead.

I had to disable most of that once I realized the encoding issue, but
I left it in since hopefully we will be able to use it again once
we have some "do some guards after mutations if we know how to resume
after a failed guard" rewriter support.
```
  0afbff14
- add more power tests for oldstyle class, including new-old mixded calling · 954cd76e
  Boxiang Sun authored Jul 27, 2015
  
  954cd76e
- re-enable test_aguassign · 5bb0f9ba
  Boxiang Sun authored Jul 27, 2015
  
  5bb0f9ba
- add other binary operators to old style class except pow, ipow, rpow · a8f91c23
  Boxiang Sun authored Jul 24, 2015
  
  a8f91c23
- Merge pull request #762 from tjhance/test_tuple_cpython · c42d3395
  Kevin Modzelewski authored Jul 26, 2015
```
get cpython/test_tuple.py to pass
```
  c42d3395
- Merge pull request #715 from tjhance/test_iter_cpython · cb0939c6
  Kevin Modzelewski authored Jul 26, 2015
```
Get test_iter.py to work
```
  cb0939c6
26 Jul, 2015 4 commits
- revese the pyston changes in PyString_Formatlong · b4a5816d
  Boxiang Sun authored Jul 26, 2015
  
  b4a5816d
- if long is zero, do not add prefix when with oct · 0ae78250
  Boxiang Sun authored Jul 26, 2015
  
  0ae78250
- check the whether has __long__ function in object, try to call it. · 057004c1
  Boxiang Sun authored Jul 26, 2015
  
  057004c1
- reenable test_format · f094057e
  Boxiang Sun authored Jul 26, 2015
  
  f094057e
24 Jul, 2015 7 commits

Merge pull request #739 from Daetalus/min_max · 4cd23054
Kevin Modzelewski authored Jul 24, 2015
```
Min max
```
4cd23054
Merge pull request #736 from undingen/interp_vregs4 · 22759d1a
Kevin Modzelewski authored Jul 24, 2015
```
assign fixed slots (vregs) to the symbols.
```
22759d1a
Merge pull request #757 from undingen/threadlock · b5341641
Kevin Modzelewski authored Jul 24, 2015
```
Use cpythons lock implementation
```
b5341641

Use cpythons lock implementation · d0044180

Marius Wachtler authored Jul 24, 2015

This switches the thread lock implementation to use a semaphore instead of a mutex.
I hope this gets rid of the threading_local.py error on travis-ci.

d0044180

interpreter: Assign fixed slots (vregs) to symbols with fast or closure scope. · 16eed354

Marius Wachtler authored Jul 17, 2015

This removes a bottleneck of the interpreter/bjit:
most var accesses introduced a DenseMap lookup, with this change we use a fixed offset per var.
The bjit stores the pointer to the vregs array inside r14 for fast accesses.

16eed354

Allocate all ASTInterpreter instances on the stack and remove the interpreter map. · 1c1dcdb9

Marius Wachtler authored Jul 16, 2015

Not having the ASTInterpreter GC allocated improves performance.
I had to add a small asm function in order to produce a special stack frame where we can easily retrieve the ASTInterpreter*,
to replace s_interpreterMaps job. This also make sure that this function really does not get inlined.
The s_interpreterMap was hard to understand and produced several times problems (duplicate entries,...)

This patch contains a hack which limits the number of variables inside a function to 512.
Because we have to make sure the are all on the stack and can't dynamically add more space.
An upcoming patch will remove this limitation and replace it with a stack alloca of the size of the actual number of variables the function uses.

1c1dcdb9

Use cpythons lock implementation · a0b5c48a

Marius Wachtler authored Jul 24, 2015

This switches the thread lock implementation to use a semaphore instead of a mutex.
I hope this gets rid of the threading_local.py error on travis-ci.

a0b5c48a