Commits · fb57599d51fe161e5f60867e75dda5ff17adb5b9 · nexedi / gitlab-workhorse

19 Jan, 2016 2 commits

blob/auth: Cache auth backend reply for 30s · fb57599d

Kirill Smelkov authored Dec 09, 2015

In previous patch we added code to serve blob content via running `git cat-file
...` directly, but for every such request a request to slow RoR-based auth
backend is made, which is bad for performance.

Let's cache auth backend reply for small period of time, e.g. 30 seconds, which
will change the situation dramatically:

If we have a lot of requests to the same repository, we query auth backend only
for every Nth request and with e.g. 100 raw blob request/s N=3000 which means
that previous load to RoR code essentially goes away.

On the other hand as we query auth backend only once in a while and refresh the
cache, we will not miss potential changes in project settings. I mean potential
e.g. 25 seconds delay for a project to become public, or vise versa to become
private does no real harm.

The cache is done with the idea to allow the read side codepath to execute in
parallel and to be not blocked by eventual cache updates.

Overall this improves performance a lot:

  (on a 8-CPU i7-3770S with 16GB of RAM, 2001:67c:1254:e:89::fa34 is on localhost)

  # request is handled by gitlab-workhorse, but without auth caching
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   622.99ms  200.08ms   1.40s    77.03%
      Req/Sec    62.65     22.37   120.00     55.00%
    Latency Distribution
       50%  589.51ms
       75%  726.88ms
       90%  896.09ms
       99%    1.18s
    626 requests in 10.01s, 1.11MB read
  Requests/sec:     62.55
  Transfer/sec:    113.73KB

  # request goes to gitlab-workhorse with auth caching (this patch)
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency    36.62ms   25.39ms 351.14ms   72.02%
      Req/Sec     1.16k    93.73     1.36k    77.00%
    Latency Distribution
       50%   36.30ms
       75%   47.02ms
       90%   66.36ms
       99%  122.46ms
    11580 requests in 10.01s, 20.56MB read
  Requests/sec:   1156.85
  Transfer/sec:      2.05MB

i.e. it is ~ 17x improvement.

fb57599d

Teach gitlab-workhorse to serve requests to get raw blobs · 277d5067

Kirill Smelkov authored Dec 09, 2015

Currently GitLab serves requests to get raw blobs via Ruby-on-Rails code and
Unicorn. Because RoR/Unicorn is relatively heavyweight, in environment where
there are a lot of simultaneous requests to get raw blobs, this works very slow
and server is constantly overloaded.

On the other hand, to get raw blob content, we do not need anything from RoR
framework - we only need to have access to project git repository on filesystem,
and knowing whether access for getting data from there should be granted or
not. That means it is possible to handle '.../raw/....' request directly
in more lightweight and performant gitlab-workhorse.

As gitlab-workhorse is written in Go, and Go has good concurrency/parallelism
support and is generally much faster than Ruby, moving raw blob serving task to
it makes sense and should be a net win.

In this patch: we add infrastructure to process GET request for '/raw/...':

- extract project / ref and path from URL
- query auth backend for whether download access should be granted or not
- emit blob content via spawning external `git cat-file`

I've tried to mimic the output to be as close as the one emitted by RoR code,
with the idea that for users the change should be transparent.

As in this patch we do auth backend query for every request to get a blob, RoR
code is still loaded very much, so essentially there is no speedup yet:

  (on a 8-CPU i7-3770S with 16GB of RAM, 2001:67c:1254:e:89::fa34 is on localhost)

  # without patch: request eventually goes to unicorn  (9 unicorn workers)
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   609.34ms  156.92ms   1.18s    79.60%
      Req/Sec    64.22     19.90   120.00     67.00%
    Latency Distribution
       50%  596.50ms
       75%  678.23ms
       90%  805.72ms
       99%    1.04s
    642 requests in 10.01s, 1.24MB read
  Requests/sec:     64.16
  Transfer/sec:    127.00KB

  # with this patch: request handled by gitlab-workhorse
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:89::fa34]:7777/root/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   622.99ms  200.08ms   1.40s    77.03%
      Req/Sec    62.65     22.37   120.00     55.00%
    Latency Distribution
       50%  589.51ms
       75%  726.88ms
       90%  896.09ms
       99%    1.18s
    626 requests in 10.01s, 1.11MB read
  Requests/sec:     62.55
  Transfer/sec:    113.73KB

In the next patch we'll cache requests to auth backend and that will improve
performance dramatically.

277d5067

18 Jan, 2016 2 commits
- Version 0.6.0 · 3bbb1be7
  Jacob Vosmaer authored Jan 18, 2016
  
  3bbb1be7
- Merge branch 'fix-development-mode' into 'master' · 05df7741
  Jacob Vosmaer authored Jan 18, 2016
```
Develomentmode ended up the wrong way around



See merge request !29
```
  05df7741
15 Jan, 2016 6 commits
- Merge branch 'master' into fix-development-mode · c87a6d91
  Jacob Vosmaer authored Jan 15, 2016
  
  c87a6d91
- Gotta commit those moved files · 9530b40d
  Jacob Vosmaer authored Jan 15, 2016
  
  9530b40d
- Develomentmode ended up the wrong way around · 54290c47
  Jacob Vosmaer authored Jan 15, 2016
  
  54290c47
- Move test helpers into seperate package · e845314b
  Jacob Vosmaer authored Jan 15, 2016
  
  e845314b
- Prevent package testing from going into the build · 2893a29b
  Jacob Vosmaer authored Jan 15, 2016
  
  2893a29b
- Merge branch 'refactor-upstream' into 'master' · a58ceca7
  Jacob Vosmaer authored Jan 15, 2016
```
Refactor upstream

Rationale: the code has become a tangled mess of global variables and
types that hang together when they need not. For example: every HTTP
handler uses a 'gitRequest'?? I want to clean this up and see if I can
move some things into internal packages.

Apart from using internal packages we now use http.Handler where we can,
and fewer global variables.

See merge request !20
```
  a58ceca7
14 Jan, 2016 3 commits
- Recompile workhorse if any of `.go` file is changed · 9e7b612b
  Kamil Trzcinski authored Jan 14, 2016
```
We need to use `$(shell find` to find files from all subdirectories
```
  9e7b612b
- Fix authorization request · 30884ebf
  Kamil Trzcinski authored Jan 14, 2016
  
  30884ebf
- Fix URLFlag · a9058bcd
  Kamil Trzcinski authored Jan 14, 2016
```
Conflicts:
	main.go
```
  a9058bcd
13 Jan, 2016 11 commits
- Restore missing handleFileUploads · 8de2206c
  Jacob Vosmaer authored Jan 13, 2016
  
  8de2206c
- Forgot to commit moved logging.go · 7e5084eb
  Jacob Vosmaer authored Jan 13, 2016
  
  7e5084eb
- Move logging and HTTPError to the helper package · d360a0c2
  Jacob Vosmaer authored Jan 13, 2016
  
  d360a0c2
- Remove last uses of sync.Once · bcf4e63e
  Jacob Vosmaer authored Jan 13, 2016
  
  bcf4e63e
- Eagerly initialize badgateway.RoundTripper · 69e801cf
  Jacob Vosmaer authored Jan 13, 2016
  
  69e801cf
- Use eager initialization for Upstream.URLPrefix · 9f0b3f71
  Jacob Vosmaer authored Jan 13, 2016
  
  9f0b3f71
- Use api.NewAPI instead of lazy initialization · a202aadc
  Jacob Vosmaer authored Jan 13, 2016
  
  a202aadc
- Initialize routes eagerly in NewUpstream · d9014671
  Jacob Vosmaer authored Jan 13, 2016
  
  d9014671
- Merge branch 'master' of https://gitlab.com/gitlab-org/gitlab-workhorse into refactor-upstream · 3ea22ec1
  Jacob Vosmaer authored Jan 13, 2016
  
  3ea22ec1
- Go 1.5 is required now · 64270207
  Jacob Vosmaer authored Jan 13, 2016
  
  64270207
- Merge branch 'warning' into 'master' · 2f8471aa
  Jacob Vosmaer authored Jan 13, 2016
```
remove make warning

When building gitlab-workhose a warning is printed out. This merge request fixes the warning

```
  go build -ldflags "-X main.Version 0.5.1-20160103.233349" -o gitlab-workhorse
  # _/home/git/gitlab-workhorse
  link: warning: option -X main.Version 0.5.1-20160103.233349 may not work in future releases; use -X main.Version=0.5.1-20160103.233349
```

See merge request !22
```
  2f8471aa
12 Jan, 2016 5 commits

Version 0.5.4 · 58957d60
Jacob Vosmaer authored Jan 12, 2016

58957d60

Merge branch 'fix-api-projects-routing' into 'master' · e18abf03

Jacob Vosmaer authored Jan 12, 2016

Fix /api/v3/projects routing error

Non-special API requests were getting special treatment, which
resulted in 500 errors. This change avoids the special treatment and
adds tests that assert that regular API requests should be left alone
by gitlab-workhorse.

See merge request !26

e18abf03

Use foo%2Fbar in tests · 649ce056
Jacob Vosmaer authored Jan 12, 2016
```
We should test with what actually happens, e.g. foo%2Fbar, not
foo/bar.
```
649ce056

Fix /api/v3/projects routing error · cd2bffd6

Jacob Vosmaer authored Jan 12, 2016

Non-special API requests were getting special treatment, which
resulted in 500 errors. This change avoids the special treatment and
adds tests that assert that regular API requests should be left alone
by gitlab-workhorse.

cd2bffd6

Revert "Do not duplicate http.DefaultTransport" · 06cad3f5

Jacob Vosmaer authored Jan 12, 2016

This reverts commit d79f8563.

Kamil pointed out that shallow-copying http.DefaultTransport
accidentally gives us a references to a lot of things that probably
not be shared.

06cad3f5

11 Jan, 2016 3 commits
- Do not duplicate http.DefaultTransport · d79f8563
  Jacob Vosmaer authored Jan 11, 2016
  
  d79f8563
- Improve static file test failures · 92b51f76
  Jacob Vosmaer authored Jan 11, 2016
  
  92b51f76
- Merge branch 'master' of https://gitlab.com/gitlab-org/gitlab-workhorse into refactor-upstream · 132309aa
  Jacob Vosmaer authored Jan 11, 2016
  
  132309aa
08 Jan, 2016 8 commits
- Version 0.5.3 · cf809cbc
  Jacob Vosmaer authored Jan 08, 2016
  
  cf809cbc
- Fix merge error :( · d5a2ff5f
  Jacob Vosmaer authored Jan 08, 2016
  
  d5a2ff5f
- Version 0.5.2 · cf8f25bc
  Jacob Vosmaer authored Jan 08, 2016
  
  cf8f25bc
- Merge branch 'uploads-check' · 6a2dc7c6
  Jacob Vosmaer authored Jan 08, 2016
  
  6a2dc7c6
- Hide ResponseHeaderTimeout · db0dc783
  Jacob Vosmaer authored Jan 08, 2016
  
  db0dc783
- Fix typo · 6951b3e4
  Jacob Vosmaer authored Jan 08, 2016
  
  6951b3e4
- More http.Handler, less http.HandlerFunc · fb1df36e
  Jacob Vosmaer authored Jan 08, 2016
  
  fb1df36e
- Merge remote-tracking branch 'origin/master' into refactor-upstream · 14d70b3b
  Jacob Vosmaer authored Jan 08, 2016
  
  14d70b3b