Commits · 552a3cfc8073a8f813529fa3537c4c6a1b5dfde2 · Boxiang Sun / Pyston

04 Feb, 2015 4 commits

interpreter: increase OSR and reopt threshold · 552a3cfc
Marius Wachtler authored Feb 04, 2015

552a3cfc
Use llvm::DenseSet/Map<InternedString> · 1f083974
Marius Wachtler authored Feb 04, 2015
```
Speeds up the interpreter by about 10-15% when the higher tiers are disabled
```
1f083974
Improve interpreter perf by caching the name lookup scope · bf63439c
Kevin Modzelewski authored Feb 03, 2015

bf63439c

Kevin Modzelewski authored Feb 03, 2015

Most importantly, intern all the strings we put into the AST* nodes.
(the AST_Module* owns them)

This should save us some memory, but it also improves performance pretty
substantially since now we can do string comparisons very cheaply.  Performance
of the interpreter tier is up by something like 30%, and JIT-compilation times
are down as well (though not by as much as I was hoping).

The overall effect on perf is more muted since we tier out of the interpreter
pretty quickly; to see more benefit, we'll have to retune the OSR/reopt thresholds.

For better or worse (mostly better IMO), the interned-ness is encoded in the type
system, and things will not automatically convert between an InternedString and
a std::string.  It means that this diff is quite large, but it also makes it a lot
more clear where we are making our string copies or have other room for optimization.

325dbfeb

03 Feb, 2015 4 commits

Identify + fix codegen bug · d3ba142d

Kevin Modzelewski authored Jan 29, 2015

In certain cases we wouldn't do well if we were sure that a type error
would occur (ex indexing into what we know is None) -- we would error in
codegen instead of generating the code to throw the error at runtime.

(sneak in another travis.yml attempt)

d3ba142d

whoops · 17fa11b0

Kevin Modzelewski authored Feb 02, 2015

I'm sure there's a better way to test the travis build than committing to master,
but why bother when this time will obviously work!

17fa11b0

travis.yml doesn't seem to like backticks? · 26f0e34d
Kevin Modzelewski authored Feb 02, 2015

26f0e34d

Try to fix the Travis-CI build · da15429c

Kevin Modzelewski authored Feb 02, 2015

Our previous travis build steps had a circular dependency between cmake and llvm:
we need to run cmake to update llvm to our picked revision, but we need to be on
our specific llvm revision in order to run cmake (newer LLVM's are incompatible
with our build scripts).

Break the dependency by manually calling git_svn_gotorev.py
Hopefully this syntax works

da15429c

02 Feb, 2015 7 commits

Merge branch 'deopt' · e69673e4
Kevin Modzelewski authored Feb 02, 2015

e69673e4
Need to always filter out !-names · 58cdce54
Kevin Modzelewski authored Feb 02, 2015

58cdce54

Remove function versions that fail their speculations · b4094e4e

Kevin Modzelewski authored Feb 02, 2015

The goal is to not continually call functions that deopt every time,
since the deopt is expensive.

Right now the threshold is simple: if a function deopts 4 (configurable)
times, then mark that function version as invalid and force a recompilation
on the next call.

b4094e4e

Basic (new) deopt support · 25ac9de4

Kevin Modzelewski authored Jan 30, 2015

Old deopt worked by compiling two copies of every BB, one with
speculations and one without, and stitching the two together.
This has a number of issues:
- doubles the amount of code LLVM has to jit
- can't ever get back on the optimized path
- doesn't support 'deopt if branch taken'
- horrifically complex
- doesn't support deopt from within try blocks

We actually ran into that last issue (see test from previous commit).  So
rather than wade in and try to fix old-deopt, just start switching to new-deopt.

(new) deopt works by using the frame introspection features, gathering up all
the locals, and passing them to the interpreter.

25ac9de4

Add failing codegen test (reduced from subprocess.py) · 6dad4d22
Kevin Modzelewski authored Jan 30, 2015
```
We currently can't deopt from inside an exception block.
```
6dad4d22
Filter out undefined variables from locals() · 2c814183
Kevin Modzelewski authored Feb 02, 2015
```
You can imagine what happens if the variable is undefined and we
try to return it.
```
2c814183
Merge pull request #269 from undingen/enumerate_typecall · e8746068
Kevin Modzelewski authored Feb 02, 2015
```
Mark enumerate_cls as safe for type call rewriting
```
e8746068

29 Jan, 2015 11 commits
- Merge branch 'selfhost' · f1e73eaf
  Kevin Modzelewski authored Jan 29, 2015
  
  f1e73eaf
- Implement more of thread.lock · fef27673
  Kevin Modzelewski authored Jan 29, 2015
  
  fef27673
- Some list functions · df8ee5c6
  Kevin Modzelewski authored Jan 28, 2015
  
  df8ee5c6
- Implement PyDict_Next · ff39eb26
  Kevin Modzelewski authored Jan 28, 2015
  
  ff39eb26
- Fix some more issues in the macro-to-function conversion · d8b4aafb
  Kevin Modzelewski authored Jan 29, 2015
```
The macros would cast to PyObject*, but our functions would just take a PyObject* --
which is an issue if the argument is something else (like a PyListObject*).
We need to have wrapper macros that cast and then call the underlying function.
```
  d8b4aafb
- Test cleanup: enable a bunch of close-to-succeeding tests · 6a83fd81
  Kevin Modzelewski authored Jan 29, 2015
```
Many of them originally failed due to a missing language feature,
but now were failing due to simple reasons like printing out the repr
of an object which doesn't define __repr__ and getting something like
"<C object at 0x1234567890>" (ie nondeterministic).
```
  6a83fd81
- Simple 'subprocess' support · dc9d0adc
  Kevin Modzelewski authored Jan 28, 2015
  
  dc9d0adc
- Add set.remove() · ce3bca51
  Kevin Modzelewski authored Jan 28, 2015
  
  ce3bca51
- Add set.update · 3012414c
  Kevin Modzelewski authored Jan 28, 2015
  
  3012414c
- Merge pull request #274 from toshok/more-gc-perf-changes · 553bd949
  Kevin Modzelewski authored Jan 28, 2015
```
More gc perf changes
```
  553bd949
- Try again to fix #272 · 2630ac9b
  Kevin Modzelewski authored Jan 28, 2015
  
  2630ac9b
28 Jan, 2015 14 commits
- add a PRECISE and HIDDEN_CLASS GCKind, with special visit behavior (visitRange... · a9e0e5eb
  Chris Toshok authored Jan 28, 2015
```
add a PRECISE and HIDDEN_CLASS GCKind, with special visit behavior (visitRange for PRECISE, and HiddenClass::gc_visit for HIDDEN_CLASS).  don't scan HiddenClasses or HCAttrs conservatively.
```
  a9e0e5eb
- store frequently used values in the Block. also give scanForNext an out from... · 6982eabf
  Chris Toshok authored Jan 28, 2015
```
store frequently used values in the Block.  also give scanForNext an out from entering the loop at all
```
  6982eabf
- minor · 88cbdd00
  Kevin Modzelewski authored Jan 28, 2015
  
  88cbdd00
- Merge pull request #270 from toshok/gc-perf · 262cb6b2
  Kevin Modzelewski authored Jan 28, 2015
```
use a vector of chunks for the TraceStack, instead of a vector of individual pointers.
```
  262cb6b2
- Deal with the fact that EPOLLET can be negative in CPython · b1a1a171
  Kevin Modzelewski authored Jan 28, 2015
```
My theory is that this is because it overflows a signed int in
32-bit builds.  This should hopefully fix #272
```
  b1a1a171
- Merge pull request #267 from tjhance/str-just-funcs · 3089737c
  Travis Hance authored Jan 27, 2015
```
Str just funcs
```
  3089737c
- str functions: ljust, rjust, and center · c23aed3b
  Travis Hance authored Jan 25, 2015
  
  c23aed3b
- use a vector of chunks for the TraceStack, instead of a vector of individual pointers. · 6377dc55
  Chris Toshok authored Jan 28, 2015
```
also keep a free list of chunks around to make subsequent collections faster.

results in TraceStack::pop and ::push being inlined and disappearing from the perf report (::push was at 3.99% before).

Also drops aggregate GC times for ray trace by ~5%.

before: gc_collections_us: 2151827
after: gc_collections_us: 2023809
```
  6377dc55
- Merge branch 'selfhost' · 4d906a96
  Kevin Modzelewski authored Jan 27, 2015
  
  4d906a96
- Support two- and three-arg startswith/endswith · 05f3b5b5
  Kevin Modzelewski authored Jan 27, 2015
  
  05f3b5b5
- Basic tempfile support · 0818c46d
  Kevin Modzelewski authored Jan 27, 2015
```
Had to modify tempfile to not depend on io, which is a big module (due to importing _io).
tempfile seems to barely even use it, and I think I was able to replace its usage with
a simpler os.write call.
```
  0818c46d
- Fix issue where we couldn't have set literals in inner scopes · 1aa2cc5e
  Kevin Modzelewski authored Jan 27, 2015
  
  1aa2cc5e
- Add the 'fcntl' module · f8b46de3
  Kevin Modzelewski authored Jan 27, 2015
  
  f8b46de3
- Add basic "select" module support · 13459951
  Kevin Modzelewski authored Jan 27, 2015
  
  13459951