Commits · a79fd7ad3ef41939933a2496478f7f2fa6ba9939 · iv / gitlab-workhorse

20 Jan, 2017 1 commit
- fixup! NXD blob/auth: Teach it to handle HTTP Basic Auth too · a79fd7ad
  iv authored Jan 20, 2017
```
needed after change for GOPATH compatibility (commit 646de543)
```
  a79fd7ad
12 Jan, 2017 7 commits

NXD blob/auth: Teach it to handle HTTP Basic Auth too · 4e9e1ba6

Kirill Smelkov authored Mar 14, 2016

[ Not sent upstream.

  The patch was not sent upstream, because previous 2 raw blob patches
  were not accepted (see details there).

  OTOH it is very handy in SlapOS environment to use CI token auth for
  raw downloading, so just carry with us as NXD. ]

There are cases when using user:password for /raw/... access is handy:

- when using query for auth (private_token) is not convenient for some
  reason (e.g. client processing software does not handle queries well
  when generating URLs)

- when we do not want to organize many artificial users and use their
  tokens, but instead just use per-project automatically setup

    gitlab-ci-token : <ci-token>

  artificial user & "password" which are already handled by auth backend
  for `git fetch` requests.

Handling is easy: if main auth backend rejects access, and there is
user:password in original request, we retry asking auth backend the way
as `git fetch` would do.

Access is granted if any of two ways to ask auth backend succeeds. This
way both private tokens / cookies and HTTP auth are supported.

4e9e1ba6

NXD blob/auth: Cache auth backend reply for 30s · dc924dc3

Kirill Smelkov authored Dec 09, 2015

[ Sent upstream: https://gitlab.com/gitlab-org/gitlab-workhorse/merge_requests/17

  This patch was sent upstream but was not accepted for "complexity"
  reason of auth cache, despite that provides more than an order of magnitude
  speedup. Just carry it with us as NXD ]

In previous patch we added code to serve blob content via running `git cat-file
...` directly, but for every such request a request to slow RoR-based auth
backend is made, which is bad for performance.

Let's cache auth backend reply for small period of time, e.g. 30 seconds, which
will change the situation dramatically:

If we have a lot of requests to the same repository, we query auth backend only
for every Nth request and with e.g. 100 raw blob request/s N=3000 which means
that previous load to RoR code essentially goes away.

On the other hand as we query auth backend only once in a while and refresh the
cache, we will not miss potential changes in project settings. I mean potential
e.g. 25 seconds delay for a project to become public, or vise versa to become
private does no real harm.

The cache is done with the idea to allow the read side codepath to execute in
parallel and to be not blocked by eventual cache updates.

Overall this improves performance a lot:

  (on a 8-CPU i7-3770S with 16GB of RAM, 2001:67c:1254:e:8b::c776 is on localhost)

  # request is handled by gitlab-workhorse, but without auth caching
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   458.42ms   66.26ms 766.12ms   84.76%
      Req/Sec    85.38     16.59   120.00     82.00%
    Latency Distribution
       50%  459.26ms
       75%  490.09ms
       90%  523.95ms
       99%  611.33ms
    853 requests in 10.01s, 1.51MB read
  Requests/sec:     85.18
  Transfer/sec:    154.90KB

  # request goes to gitlab-workhorse with auth caching (this patch)
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency    34.52ms   19.28ms 288.63ms   74.74%
      Req/Sec     1.20k   127.21     1.39k    85.00%
    Latency Distribution
       50%   32.67ms
       75%   42.73ms
       90%   56.26ms
       99%   99.86ms
    11961 requests in 10.01s, 21.24MB read
  Requests/sec:   1194.51
  Transfer/sec:      2.12MB

i.e. it is ~ 14x improvement.

dc924dc3

NXD Teach gitlab-workhorse to serve requests to get raw blobs · 45bbe40c

Kirill Smelkov authored Dec 09, 2015

[ Sent upstream: https://gitlab.com/gitlab-org/gitlab-workhorse/merge_requests/17

  This patch was sent upstream but was not accepted for "complexity"
  reason of auth cache (next patch), despite that provides more than an
  order of magnitude speedup. Just carry it with us as NXD ]

Currently GitLab serves requests to get raw blobs via Ruby-on-Rails code and
Unicorn. Because RoR/Unicorn is relatively heavyweight, in environment where
there are a lot of simultaneous requests to get raw blobs, this works very slow
and server is constantly overloaded.

On the other hand, to get raw blob content, we do not need anything from RoR
framework - we only need to have access to project git repository on filesystem,
and knowing whether access for getting data from there should be granted or
not. That means it is possible to handle '.../raw/....' request directly
in more lightweight and performant gitlab-workhorse.

As gitlab-workhorse is written in Go, and Go has good concurrency/parallelism
support and is generally much faster than Ruby, moving raw blob serving task to
it makes sense and should be a net win.

In this patch: we add infrastructure to process GET request for '/raw/...':

- extract project / ref and path from URL
- query auth backend for whether download access should be granted or not
- emit blob content via spawning external `git cat-file`

I've tried to mimic the output to be as close as the one emitted by RoR code,
with the idea that for users the change should be transparent.

As in this patch we do auth backend query for every request to get a blob, RoR
code is still loaded very much, so essentially there is no speedup yet:

  (on a 8-CPU i7-3770S with 16GB of RAM, 2001:67c:1254:e:8b::c776 is on localhost)

  # without patch: request eventually goes to unicorn  (9 unicorn workers)
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   461.16ms   63.44ms 809.80ms   84.18%
      Req/Sec    84.84     17.02   131.00     80.00%
    Latency Distribution
       50%  460.21ms
       75%  492.83ms
       90%  524.67ms
       99%  636.49ms
    847 requests in 10.01s, 1.57MB read
  Requests/sec:     84.64
  Transfer/sec:    161.10KB

  # with this patch: request handled by gitlab-workhorse
  $ ./wrk -c40 -d10 -t1 --latency http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
  Running 10s test @ http://[2001:67c:1254:e:8b::c776]:7777/nexedi/slapos/raw/master/software/wendelin/software.cfg
    1 threads and 40 connections
    Thread Stats   Avg      Stdev     Max   +/- Stdev
      Latency   458.42ms   66.26ms 766.12ms   84.76%
      Req/Sec    85.38     16.59   120.00     82.00%
    Latency Distribution
       50%  459.26ms
       75%  490.09ms
       90%  523.95ms
       99%  611.33ms
    853 requests in 10.01s, 1.51MB read
  Requests/sec:     85.18
  Transfer/sec:    154.90KB

In the next patch we'll cache requests to auth backend and that will improve
performance dramatically.

NOTE 20160228: there is internal/git/blob.go trying to get raw data via
    gitlab-workhorse, but still asking Unicorn about blob->sha1 mapping
    etc. That work started in

        86aaa133 (Prototype blobs via workhorse, @jacobvosmaer)

    and was inspired by this patch. It goes out of line compared to what
    we can do if we serve all blob data just by gitlab-workhorse (see
    next patch), so we just avoid git/blob.go and put our stuff into
    git/xblob.go and tweak routes, essentially deactivating git/blob.go
    code.

45bbe40c

Version 0.7.5 · 27c9eb8c
Jacob Vosmaer authored Jun 03, 2016

27c9eb8c
Incorporate feedback · 9680c1a2
ZJ van de Weg authored May 30, 2016

9680c1a2
Tests for sending diffs · 6782117d
ZJ van de Weg authored May 19, 2016

6782117d

Diffs served by workhorse · 51ad013f

Zeger-Jan van de Weg authored May 08, 2016

Purposed of this commit is to trigger CI and hope it doesn't get the
'importing internal packages not allowed' error Im getting while
building.

51ad013f

27 May, 2016 1 commit
- Version 0.7.4 · dc9b6c39
  Jacob Vosmaer authored May 27, 2016
  
  dc9b6c39
25 May, 2016 2 commits
- Merge branch 'unicorn-tming' into 'master' · c3447a94
  Jacob Vosmaer (GitLab) authored May 25, 2016
```
Send timestamps when proxying

For https://gitlab.com/gitlab-com/operations/issues/264

Companion to https://gitlab.com/gitlab-org/gitlab-ce/merge_requests/4278

See merge request !47
```
  c3447a94
- Test for presence of Gitlab-Workhorse headers · 4704ce33
  Jacob Vosmaer authored May 25, 2016
  
  4704ce33
24 May, 2016 1 commit
- Send timestamps when proxying · 504bd633
  Jacob Vosmaer authored May 24, 2016
  
  504bd633
23 May, 2016 3 commits
- Merge branch 'golang-1.6.2' into 'master' · 74cf6dfc
  Jacob Vosmaer (GitLab) authored May 23, 2016
```
Use Go 1.6.2



See merge request !46
```
  74cf6dfc
- Use Go 1.6.2 · df2b1bd8
  Jacob Vosmaer authored May 23, 2016
  
  df2b1bd8
- Version 0.7.3 · 9d5752c4
  Jacob Vosmaer authored May 23, 2016
  
  9d5752c4
02 May, 2016 1 commit
- Add functional test for shallow git clone · d80945d7
  Jacob Vosmaer authored May 02, 2016
  
  d80945d7
29 Apr, 2016 1 commit
- Do not restrict listen socket access by default · 12c8f495
  Jacob Vosmaer authored Apr 29, 2016
```
Closes https://gitlab.com/gitlab-org/gitlab-workhorse/issues/33
```
  12c8f495
28 Apr, 2016 1 commit

Revert "Delay HTTP headers for Git HTTP responses" · 347b9987

Jacob Vosmaer authored Apr 28, 2016

This reverts commit 1fa69c8d. It
turned out that the new behavior broke 'shallow git clone'.

Closes https://gitlab.com/gitlab-org/gitlab-workhorse/issues/36

347b9987

12 Apr, 2016 1 commit
- Nobody should use 0.7.2 · 768967cc
  Jacob Vosmaer authored Apr 12, 2016
  
  768967cc
06 Apr, 2016 1 commit
- Version 0.7.2 · 7a2c97cb
  Jacob Vosmaer authored Apr 06, 2016
  
  7a2c97cb
04 Apr, 2016 4 commits
- Merge branch 'use-gopath' into 'master' · ff480401
  Jacob Vosmaer authored Apr 04, 2016
```
Be compatible with GOPATH



See merge request !43
```
  ff480401
- Newline · fece6a3b
  Jacob Vosmaer authored Apr 04, 2016
  
  fece6a3b
- Fix typo · 721e0c84
  Jacob Vosmaer authored Apr 04, 2016
  
  721e0c84
- Export GOPATH in the Makefile · ad6899b0
  Jacob Vosmaer authored Apr 04, 2016
  
  ad6899b0
23 Mar, 2016 2 commits
- More separating imports · 405504cc
  Jacob Vosmaer authored Mar 23, 2016
  
  405504cc
- Separate imports · d44febe6
  Jacob Vosmaer authored Mar 23, 2016
  
  d44febe6
21 Mar, 2016 3 commits
- Changelog for 0.7.1 · b04210fe
  Jacob Vosmaer authored Mar 21, 2016
  
  b04210fe
- No need to split the line · 85692df5
  Jacob Vosmaer authored Mar 21, 2016
  
  85692df5
- Fix tar exclude · cee86a22
  Jacob Vosmaer authored Mar 21, 2016
  
  cee86a22
19 Mar, 2016 1 commit
- Be compatible with GOPATH · 646de543
  Jacob Vosmaer authored Mar 19, 2016
  
  646de543
10 Mar, 2016 1 commit

Merge branch 'delayed-responsewriter' into 'master' · 285f47a7

Jacob Vosmaer authored Mar 10, 2016

Delay HTTP headers for Git HTTP responses

Alternative to https://gitlab.com/gitlab-org/gitlab-workhorse/merge_requests/38
and https://gitlab.com/gitlab-org/gitlab-workhorse/merge_requests/40 .

See merge request !42

285f47a7

09 Mar, 2016 4 commits
- Forget the buffer early · 56d92dc1
  Jacob Vosmaer authored Mar 09, 2016
  
  56d92dc1
- Change "-" to "+" · b9589afe
  Jacob Vosmaer authored Mar 09, 2016
  
  b9589afe
- Merge branch 'test-guidelines' into 'master' · 8b9f31be
  Jacob Vosmaer authored Mar 09, 2016
```
Add test guidelines



See merge request !39
```
  8b9f31be
- Delay HTTP headers for Git HTTP responses · 1fa69c8d
  Jacob Vosmaer authored Mar 09, 2016
  
  1fa69c8d
08 Mar, 2016 2 commits
- Merge branch 'master' of https://gitlab.com/gitlab-org/gitlab-workhorse · 1500542c
  Jacob Vosmaer authored Mar 08, 2016
  
  1500542c
- Always remove the senddata header · 162424b9
  Jacob Vosmaer authored Mar 08, 2016
  
  162424b9
07 Mar, 2016 1 commit

Merge branch 'content-length-on-raw-blobs' into 'master' · 74efa43f

Jacob Vosmaer authored Mar 07, 2016

Set content-length when sending git blobs

This gives the HTTP client receiving the blob a way to detect some
transmission failures.

See merge request !41

74efa43f

04 Mar, 2016 1 commit
- No 'v' in VERSION · cca59c98
  Jacob Vosmaer authored Mar 04, 2016
  
  cca59c98
03 Mar, 2016 1 commit
- Set content-length when sending git blobs · 1c60885c
  Jacob Vosmaer authored Mar 03, 2016
```
This gives the HTTP client receiving the blob a way to detect some
transmission failures.
```
  1c60885c