1. 31 Oct, 2017 22 commits
    • Cherry Zhang's avatar
      cmd/compile: don't fold address of global into load/store on PPC64 · 622cfd88
      Cherry Zhang authored
      On PPC64 (and a few other architectures), accessing global
      requires multiple instructions and use of temp register.
      The compiler emits a single MOV prog, and the assembler
      expands it to multiple instructions. If globals are accessed
      multiple times, each time it generates a reload of the temp
      register. As this is done by the assembler, the compiler
      cannot optimize it.
      
      This CL makes the compiler not fold address of global into load
      and store. If a global is accessed multiple times, or multiple
      fields of a struct are accessed, the compiler can CSE the
      address. Currently, this doesn't help the case where different
      globals are accessed, even though they may be close to each
      other in the address space (which we don't know at compile time).
      
      It helps a little bit in go1 benchmark:
      
      name                     old time/op    new time/op    delta
      BinaryTree17-2              4.84s ± 1%     4.84s ± 1%    ~     (p=0.796 n=10+10)
      Fannkuch11-2                4.10s ± 0%     4.08s ± 0%  -0.58%  (p=0.000 n=9+8)
      FmtFprintfEmpty-2          97.9ns ± 1%    96.8ns ± 1%  -1.08%  (p=0.000 n=10+10)
      FmtFprintfString-2          147ns ± 0%     147ns ± 1%    ~     (p=0.129 n=9+10)
      FmtFprintfInt-2             152ns ± 0%     152ns ± 0%    ~     (p=0.294 n=10+8)
      FmtFprintfIntInt-2          218ns ± 1%     217ns ± 0%  -0.64%  (p=0.000 n=10+8)
      FmtFprintfPrefixedInt-2     263ns ± 1%     256ns ± 0%  -2.77%  (p=0.000 n=10+8)
      FmtFprintfFloat-2           375ns ± 1%     368ns ± 0%  -1.95%  (p=0.000 n=10+7)
      FmtManyArgs-2               849ns ± 0%     850ns ± 0%    ~     (p=0.621 n=8+9)
      GobDecode-2                12.3ms ± 1%    12.2ms ± 1%  -0.94%  (p=0.003 n=10+10)
      GobEncode-2                10.3ms ± 1%    10.5ms ± 1%  +2.03%  (p=0.000 n=10+10)
      Gzip-2                      414ms ± 1%     414ms ± 0%    ~     (p=0.842 n=9+10)
      Gunzip-2                   66.3ms ± 0%    66.4ms ± 0%    ~     (p=0.077 n=9+9)
      HTTPClientServer-2         66.3µs ± 5%    66.4µs ± 1%    ~     (p=0.661 n=10+9)
      JSONEncode-2               23.9ms ± 1%    23.9ms ± 1%    ~     (p=0.905 n=10+9)
      JSONDecode-2                119ms ± 1%     116ms ± 0%  -2.65%  (p=0.000 n=10+10)
      Mandelbrot200-2            5.11ms ± 0%    4.92ms ± 0%  -3.71%  (p=0.000 n=10+10)
      GoParse-2                  5.81ms ± 1%    5.84ms ± 1%    ~     (p=0.052 n=10+10)
      RegexpMatchEasy0_32-2       315ns ± 0%     317ns ± 0%  +0.67%  (p=0.000 n=10+10)
      RegexpMatchEasy0_1K-2       658ns ± 0%     638ns ± 0%  -3.01%  (p=0.000 n=9+9)
      RegexpMatchEasy1_32-2       315ns ± 1%     317ns ± 0%  +0.56%  (p=0.000 n=9+9)
      RegexpMatchEasy1_1K-2       935ns ± 0%     926ns ± 0%  -0.96%  (p=0.000 n=9+9)
      RegexpMatchMedium_32-2      394ns ± 0%     396ns ± 1%  +0.46%  (p=0.001 n=10+10)
      RegexpMatchMedium_1K-2     65.1µs ± 0%    64.5µs ± 0%  -0.90%  (p=0.000 n=9+9)
      RegexpMatchHard_32-2       3.16µs ± 0%    3.17µs ± 0%  +0.35%  (p=0.000 n=10+9)
      RegexpMatchHard_1K-2       89.4µs ± 0%    89.3µs ± 0%    ~     (p=0.136 n=9+9)
      Revcomp-2                   703ms ± 2%     694ms ± 2%  -1.41%  (p=0.009 n=10+10)
      Template-2                  107ms ± 1%     107ms ± 1%    ~     (p=0.053 n=9+10)
      TimeParse-2                 526ns ± 0%     524ns ± 0%  -0.34%  (p=0.002 n=9+9)
      TimeFormat-2                534ns ± 0%     504ns ± 1%  -5.51%  (p=0.000 n=10+10)
      [Geo mean]                 93.8µs         93.1µs       -0.70%
      
      It also helps in the case mentioned in issue #17110, main.main
      in package math's test. Now it generates 4 loads of R31 instead
      of 10, for the same piece of code.
      
      This causes a slight increase of binary size: cmd/go increases
      0.66%.
      
      If this is a good idea, we should do it on other architectures
      where accessing global is expensive.
      
      Updates #17110.
      
      Change-Id: I2687af6eafc04f2a57c19781ec300c33567094b6
      Reviewed-on: https://go-review.googlesource.com/68250
      Run-TryBot: Cherry Zhang <cherryyz@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarLynn Boger <laboger@linux.vnet.ibm.com>
      622cfd88
    • Michael Munday's avatar
      cmd/compile: intrinsify math.RoundToEven on s390x · 4745604b
      Michael Munday authored
      The new RoundToEven function can be implemented as a single FIDBR
      instruction on s390x.
      
      name         old time/op  new time/op  delta
      RoundToEven  5.32ns ± 1%  0.86ns ± 1%  -83.86%  (p=0.000 n=10+10)
      
      Change-Id: Iaf597e57a0d1085961701e3c75ff4f6f6dcebb5f
      Reviewed-on: https://go-review.googlesource.com/74350
      Run-TryBot: Michael Munday <mike.munday@ibm.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarKeith Randall <khr@golang.org>
      4745604b
    • Russ Cox's avatar
      cmd/dist: avoid darwin_amd64 assumption in debug prints · 51daa25c
      Russ Cox authored
      Noted in CL 73212 review by crawshaw.
      Neglected to update CL 73212 before submitting.
      
      Also fix printing of target goos/goarch for cross-compile build.
      
      Change-Id: If702f23071a4456810f1de6abb9115b38933c5c1
      Reviewed-on: https://go-review.googlesource.com/74631
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      51daa25c
    • Russ Cox's avatar
      cmd/vet: tighten printf format error messages · fc768da8
      Russ Cox authored
      Every time I see an error that begins `missing argument for Fprintf("%s")`
      my mental type-checker goes off, since obviously "%s" is not a valid first
      argument to Fprintf. Writing Printf("%s") to report an error in Printf("hello %s")
      is almost as confusing.
      
      This CL rewords the errors reported by vet's printf check to be more
      consistent with each other, avoid placing context like "in printf call"
      in the middle of the message, and to avoid the imprecisions above by
      not quoting the format string at all.
      
      Before:
      
      	bad.go:9: no formatting directive in Printf call
      	bad.go:10: missing argument for Printf("%s"): format reads arg 1, have only 0 args
      	bad.go:11: wrong number of args for format in Printf call: 1 needed but 2 args
      	bad.go:12: bad syntax for printf argument index: [1]
      	bad.go:13: index value [0] for Printf("%[0]s"); indexes start at 1
      	bad.go:14: missing argument for Printf("%[2]s"): format reads arg 2, have only 1 args
      	bad.go:15: bad syntax for printf argument index: [abc]
      	bad.go:16: unrecognized printf verb 'z'
      	bad.go:17: arg "hello" for * in printf format not of type int
      	bad.go:18: arg fmt.Sprint in printf call is a function value, not a function call
      	bad.go:19: arg fmt.Sprint in Print call is a function value, not a function call
      	bad.go:20: arg "world" for printf verb %d of wrong type: string
      	bad.go:21: missing argument for Printf("%q"): format reads arg 2, have only 1 args
      	bad.go:22: first argument to Print is os.Stderr
      	bad.go:23: Println call ends with newline
      	bad.go:32: arg r in Sprint call causes recursive call to String method
      	bad.go:34: arg r for printf causes recursive call to String method
      
      After:
      
      	bad.go:9: Printf call has arguments but no formatting directives
      	bad.go:10: Printf format %s reads arg #1, but have only 0 args
      	bad.go:11: Printf call needs 1 args but has 2 args
      	bad.go:12: Printf format %[1 is missing closing ]
      	bad.go:13: Printf format has invalid argument index [0]
      	bad.go:14: Printf format has invalid argument index [2]
      	bad.go:15: Printf format has invalid argument index [abc]
      	bad.go:16: Printf format %.234z has unknown verb z
      	bad.go:17: Printf format %.*s uses non-int "hello" as argument of *
      	bad.go:18: Printf format %s arg fmt.Sprint is a func value, not called
      	bad.go:19: Print arg fmt.Sprint is a func value, not called
      	bad.go:20: Printf format %d has arg "world" of wrong type string
      	bad.go:21: Printf format %q reads arg #2, but have only 1 args
      	bad.go:22: Print does not take io.Writer but has first arg os.Stderr
      	bad.go:23: Println args end with redundant newline
      	bad.go:32: Sprint arg r causes recursive call to String method
      	bad.go:34: Sprintf format %s with arg r causes recursive String method call
      
      Change-Id: I5719f0fb9f2cd84df8ad4c7754ab9b79c691b060
      Reviewed-on: https://go-review.googlesource.com/74352
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRob Pike <r@golang.org>
      fc768da8
    • Russ Cox's avatar
      go/importer: support lookup in importer.For · 9aa6f80e
      Russ Cox authored
      The support in this CL assumes that something at a higher level than
      the toolchain-specific importers is taking care of converting imports
      in source code into canonical import paths before invoking the
      toolchain-specific importers. That kind of "what does an import mean"
      as opposed to "find me the import data for this specific path"
      should be provided by higher-level layers.
      
      That's a different layering than the default behavior but matches the
      current layering in the compiler and linker and works with the metadata
      planned for generation by the go command for package management.
      It should also eventually allow the importer code to stop concerning
      itself with source directories and vendor import translation and maybe
      deprecate ImporterFrom in favor of Importer once again. But that's all
      in the future. For now, just make non-nil lookups work, and test that.
      
      Fixes #13847.
      Adds #22550.
      
      Change-Id: I048c6a384492e634988a7317942667689ae680ff
      Reviewed-on: https://go-review.googlesource.com/74354
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRobert Griesemer <gri@golang.org>
      9aa6f80e
    • Kenny Grant's avatar
      time: document that valid layouts are not valid Parse values · 3e887ff7
      Kenny Grant authored
      For #9346 #22135 explicitly state under layout constants
      that they are not valid time values for Parse. Also add
      examples of parsing valid RFC3339 values and the layout
      to the example for time.Parse.
      
      Fix capitalisation of time.Parse and Time.Format.
      
      For #20869 include RFC3339 in the list of layouts that do
      not accept all the time formats allowed by RFCs (lowercase z).
      This does not fully address #20869.
      
      Fixes #9346
      Fixes #22135
      
      Change-Id: Ia4c13e5745de583db5ef7d5b1688d7768bc42c1b
      Reviewed-on: https://go-review.googlesource.com/74231
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      3e887ff7
    • Ramazan AYYILDIZ's avatar
      strings: add examples for specialCase · bc98cea9
      Ramazan AYYILDIZ authored
      Change-Id: Ifa0384722dd879af7f5edb7b7aaac5ede3cff46d
      Reviewed-on: https://go-review.googlesource.com/74690
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      bc98cea9
    • Austin Clements's avatar
      cmd/compile,runtime: update instrumentation comments · 52cf91a5
      Austin Clements authored
      The compiler's instrumentation pass has some out-of-date comments
      about the write barrier and some confusing comments about
      typedslicecopy. Update these comments and add a comment to
      typedslicecopy explaining why it's manually instrumented while none of
      the other operations are.
      
      Change-Id: I024e5361d53f1c3c122db0c85155368a30cabd6b
      Reviewed-on: https://go-review.googlesource.com/74430Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      Reviewed-by: default avatarKeith Randall <khr@golang.org>
      52cf91a5
    • Joe Kyo's avatar
      net/http: display connect methods table properly in `go doc` · 5536180a
      Joe Kyo authored
      When run `go doc -u http.connectMethod`, the whole table is treated as
      a single long line. This commit inserts `\t` at the begining of each line,
      so the table can be displayed properly in `go doc`.
      
      Change-Id: I6408efd31f84c113e81167d62e1791643000d629
      Reviewed-on: https://go-review.googlesource.com/74651Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      5536180a
    • Russ Cox's avatar
      fmt: hide bad format in test from vet · 8f4f1f63
      Russ Cox authored
      Hide in the source code instead of in the separate whitelist.
      Removes the only printf false positive in the standard library.
      
      Change-Id: I99285e67588c7c93bd56d59ee768a03be7c301e7
      Reviewed-on: https://go-review.googlesource.com/74590
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRob Pike <r@golang.org>
      8f4f1f63
    • Russ Cox's avatar
      cmd/vet: do not import net/http at startup · 9364c0e3
      Russ Cox authored
      The httpresponse.go module wants to be able to tell if a particular type t
      is net/http.Response (and also net/http.Client). It does this by importing
      net/http, looking up Response, and then comparing that saved type against
      each t.
      
      Instead of doing an eager import of net/http, wait until we have a type t
      to ask a question about, and then just look to see if that t is http.Response.
      This kind of lazy check does not require assuming that net/http is available
      or will be important (perhaps the check is disabled in this run, or perhaps
      other conditions that lead to the comparison are not satisfied).
      
      Not loading these kinds of types at startup time will scale better.
      
      Change-Id: Ibb00623901a96e725a4ff6f231e6d15127979dfd
      Reviewed-on: https://go-review.googlesource.com/74353
      Run-TryBot: Russ Cox <rsc@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      9364c0e3
    • Russ Cox's avatar
      build: quiet make.bash, make.bat, make.rc · c1e026a5
      Russ Cox authored
      The signal-to-noise ratio is too low.
      Stop printing the name of every package.
      Can still get the old output with make.bash -v.
      
      Change-Id: Ib2c82e037166e6d2ddc31ae2a4d29af5becce574
      Reviewed-on: https://go-review.googlesource.com/74351
      Run-TryBot: Russ Cox <rsc@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      c1e026a5
    • Russ Cox's avatar
      test/run: use go tool compile + link instead of go run when possible · 84dc501d
      Russ Cox authored
      This cuts 6 seconds off all.bash with the new go command.
      Not a ton, but also an easy 6 seconds to grab.
      
      The -tags=use_go_run in the misc/cgo tests is just some
      go command flag that will make run.go use go run,
      but without making everything look stale.
      (Those tests have relative imports,
      so go tool compile+link is not enough.)
      
      Change-Id: I43bf4bb661d3adde2b2d4aad5e8f64b97bc69ba9
      Reviewed-on: https://go-review.googlesource.com/73994Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      84dc501d
    • Russ Cox's avatar
      runtime/race: install alternate packages to temp dir · 2e2047a0
      Russ Cox authored
      The content-based staleness code means that
      
      	go run -gcflags=-l helloworld.go
      
      recompiles all of helloworld.go's dependencies with -gcflags=-l,
      whereas before it would have assumed installed packages were
      up-to-date. In this test, that means every race iteration rebuilds
      the runtime and maybe a few other packages. Instead, install them
      to a temporary location for reuse.
      
      This speeds the test from 17s to 9s on my MacBook Pro.
      
      Change-Id: Ied136ce72650261083bb19cc7dee38dac0ad05ca
      Reviewed-on: https://go-review.googlesource.com/73992Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      2e2047a0
    • Russ Cox's avatar
      runtime: shorten tests in all.bash · 94471f63
      Russ Cox authored
      This cuts 23 seconds from all.bash on my MacBook Pro.
      
      Change-Id: Ibc4d7c01660b9e9ebd088dd55ba993f0d7ec6aa3
      Reviewed-on: https://go-review.googlesource.com/73991Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      94471f63
    • Russ Cox's avatar
      cmd/dist: log timing to $GOBUILDTIMELOGFILE · 6c8418f5
      Russ Cox authored
      We can't make all.bash faster if we can't measure it.
      Measure it.
      
      Change-Id: Ia5da791d4cfbfa1fd9a8e905b3188f63819ade73
      Reviewed-on: https://go-review.googlesource.com/73990Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      6c8418f5
    • Russ Cox's avatar
      cmd/go: switch to entirely content-based staleness determination · 7dea5097
      Russ Cox authored
      This CL changes the go command to base all its rebuilding decisions
      on the content of the files being processed and not their file system
      modification times. It also eliminates the special handling of release
      toolchains, which were previously considered always up-to-date
      because modification time order could not be trusted when unpacking
      a pre-built release.
      
      The go command previously tracked "build IDs" as a backup to
      modification times, to catch changes not reflected in modification times.
      For example, if you remove one .go file in a package with multiple .go
      files, there is no modification time remaining in the system that indicates
      that the installed package is out of date. The old build ID was the hash
      of a list of file names and a few other factors, expected to change if
      those factors changed.
      
      This CL moves to using this kind of build ID as the only way to
      detect staleness, making sure that the build ID hash includes all
      possible factors that need to influence the rebuild decision.
      
      One such factor is the compiler flags. As of this CL, if you run
      
      	go build -gcflags -N cmd/gofmt
      
      you will get a gofmt where every package is built with -N,
      regardless of what may or may not be installed already.
      
      Another such factor is the linker flags. As of this CL, if you run
      
      	go install myprog
      	go install -ldflags=-s myprog
      
      the second go install will now correctly build a new myprog with
      the updated linker flags. (Previously the installed myprog appeared
      up-to-date, because the ldflags were not included in the build ID.)
      
      Because we have more precise information we can also validate whether
      the target of a "go test -c" operation is already the right binary and
      therefore can avoid a rebuild.
      
      This CL sets us up for having a more general build artifact cache,
      maybe even a step toward not having a pkg directory with .a files,
      but this CL does not take that step. For now the result of go install
      is the same as it ever was; we just do a better job of what needs to
      be installed.
      
      This CL does slow down builds a small amount by reading all the
      dependent source files in full. (The go command already read the
      beginning of every dependent source file to discover build tags
      and imports.) On my MacBook Pro, before this CL all.bash takes
      3m58s, while after this CL and a few optimizations stacked above it
      all.bash takes 4m28s. Given that CL 73850 cut 1m43s off the all.bash
      time earlier today, we can afford adding 30s back for now.
      More optimizations are planned that should make the go command
      more efficient than it was even before this CL.
      
      Fixes #15799.
      Fixes #18369.
      Fixes #19340.
      Fixes #21477.
      
      Change-Id: I10d7ca0e31ca3f58aabb9b1f11e2e3d9d18f0bc9
      Reviewed-on: https://go-review.googlesource.com/73212
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      7dea5097
    • Russ Cox's avatar
      cmd/compile: change ssa test to avoid go run -gcflags=-d=ssa/check/on · 4b5018ce
      Russ Cox authored
      In the new content-based staleness world, setting -gcflags like this
      recompiles all the packages involved in running the program, not just
      the "stale" ones. So go run -gcflags=-d=ssa/check/on recompiles
      runtime with those flags too, which is not what the test is trying
      to check.
      
      Change-Id: I4dbd5bf2970c3a622c01de84bd8aa9d5e9ec5239
      Reviewed-on: https://go-review.googlesource.com/74570
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      4b5018ce
    • Russ Cox's avatar
      all: respect $GO_GCFLAGS during run.bash · 2beb173e
      Russ Cox authored
      If the go install doesn't use the same flags as the main build
      it can overwrite the installed standard library, leading to
      flakiness and slow future tests.
      
      Force uses of 'go install' etc to propagate $GO_GCFLAGS
      or disable them entirely, to avoid problems.
      
      As I understand it, the main place this happens is the ssacheck builder.
      If there are other uses that need to run some of the now-disabled
      tests we can reenable fixed tests in followup CLs.
      
      Change-Id: Ib860a253539f402f8a96a3c00ec34f0bbf137c9a
      Reviewed-on: https://go-review.googlesource.com/74470Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      2beb173e
    • Tim Cooper's avatar
      flag: add (*FlagSet).Name, (*FlagSet).ErrorHandling, export (*FlagSet).Output · 99be9cc0
      Tim Cooper authored
      Allows code that operates on a FlagSet to know the name and error
      handling behavior of the FlagSet without having to call FlagSet.Init.
      
      Fixes #17628
      Fixes #21888
      
      Change-Id: Ib0fe4c8885f9ccdacf5a7fb761d5ecb23f3bb055
      Reviewed-on: https://go-review.googlesource.com/70391
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      99be9cc0
    • Jason Wangsadinata's avatar
      container/ring: fix example_test.go · 26e49e69
      Jason Wangsadinata authored
      The Len method is a linear operation. CL 73090 used Len to iterate over
      a ring, resulting in a quadratic time operation.
      
      Change-Id: Ib69c19190ba648311e6c345d8cb26292b50121ee
      Reviewed-on: https://go-review.googlesource.com/74390
      Run-TryBot: Ian Lance Taylor <iant@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      26e49e69
    • Ian Lance Taylor's avatar
      os: clarify that OpenFile reqires one of O_RDONLY/O_WRONLY/O_RDWR · 94d93717
      Ian Lance Taylor authored
      Fixes #21322.
      
      Change-Id: Ia589c576be0b5cdb7cde5d35cd857ad7c93c372b
      Reviewed-on: https://go-review.googlesource.com/74550Reviewed-by: default avatarRobert Griesemer <gri@golang.org>
      94d93717
  2. 30 Oct, 2017 18 commits
    • Michael Munday's avatar
      cmd/asm, cmd/internal/obj/s390x, math: add "test under mask" instructions · c2801265
      Michael Munday authored
      Adds the following s390x test under mask (immediate) instructions:
      
      TMHH
      TMHL
      TMLH
      TMLL
      
      These are useful for testing bits and are already used in the math package.
      
      Change-Id: Idffb3f83b238dba76ac1e42ac6b0bf7f1d11bea2
      Reviewed-on: https://go-review.googlesource.com/41092
      Run-TryBot: Michael Munday <mike.munday@ibm.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarCherry Zhang <cherryyz@google.com>
      c2801265
    • Michael Munday's avatar
      cmd/asm, cmd/compile: optimize math.Abs and math.Copysign on s390x · 96cdacb9
      Michael Munday authored
      This change adds three new instructions:
      
      - LPDFR: load positive (math.Abs(x))
      - LNDFR: load negative (-math.Abs(x))
      - CPSDR: copy sign (math.Copysign(x, y))
      
      By making use of GPR <-> FPR moves we can now compile math.Abs and
      math.Copysign to these instructions using SSA rules.
      
      This CL also adds new rules to merge address generation into combined
      load operations. This makes GPR <-> FPR move matching more reliable.
      
      name                 old time/op  new time/op  delta
      Copysign             1.85ns ± 0%  1.40ns ± 1%  -24.65%  (p=0.000 n=8+10)
      Abs                  1.58ns ± 1%  0.73ns ± 1%  -53.64%  (p=0.000 n=10+10)
      
      The geo mean improvement for all math package benchmarks was 4.6%.
      
      Change-Id: I0cec35c5c1b3fb45243bf666b56b57faca981bc9
      Reviewed-on: https://go-review.googlesource.com/73950
      Run-TryBot: Michael Munday <mike.munday@ibm.com>
      Reviewed-by: default avatarKeith Randall <khr@golang.org>
      96cdacb9
    • Bill O'Farrell's avatar
      runtime: remove unnecessary sync from publicationBarrier on s390x · 7fff1db0
      Bill O'Farrell authored
      Memory accesses on z are at least as ordered as they are on AMD64.
      
      Change-Id: Ia515430e571ebd07e9314de05c54dc992ab76b95
      Reviewed-on: https://go-review.googlesource.com/74010
      Run-TryBot: Michael Munday <mike.munday@ibm.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarMichael Munday <mike.munday@ibm.com>
      7fff1db0
    • Matthew Dempsky's avatar
      cmd/compile: skip compiling wrappers for imported defined types · 03c8c566
      Matthew Dempsky authored
      When compiling a package that defines a type T with method T.M, we
      already compile and emit the wrapper method (*T).M. There's no need
      for every package that uses T to do the same.
      
      Change-Id: I3ca2659029907570f8b98d66111686435fad7ed0
      Reviewed-on: https://go-review.googlesource.com/74412
      Run-TryBot: Matthew Dempsky <mdempsky@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarDavid Crawshaw <crawshaw@golang.org>
      03c8c566
    • Mark Theunissen's avatar
      net/url: preserve leading slashes when resolving path · 84e91e1d
      Mark Theunissen authored
      When doing resolvePath, if there are multiple leading slashes in the
      target, preserve them. This prevents an issue where the Go http.Client
      cleans up multiple leading slashes in the Location header in a
      redirect, resulting in a redirection to the incorrect target.
      
      Fixes #21158.
      
      Change-Id: I6a21ea61ca3bc7033f3c8a6ccc21ecaa3e996fa8
      Reviewed-on: https://go-review.googlesource.com/51050Reviewed-by: default avatarRuss Cox <rsc@golang.org>
      Run-TryBot: Russ Cox <rsc@golang.org>
      84e91e1d
    • David Chase's avatar
      cmd/compile: adjust expectations of test for issue 18902 · b4c3fe7b
      David Chase authored
      The test for #18902 reads the assembly stream to be sure
      that the line number does not change too often (this is an
      indication that debugging the code will be unpleasant and
      that the compiler is probably getting line numbers "wrong").
      
      It checks that it is getting "enough" input, but the
      compiler has gotten enough better since the test was written
      that it now fails for lack of enough input.  The old
      threshould was 200 instructions, the new one is 150 (the
      minimum observed input is on arm64 with 184 instructions).
      
      Fixes #22494.
      
      Change-Id: Ibba7e9ff4ab6a7be369e5dd5859d150b7db94653
      Reviewed-on: https://go-review.googlesource.com/74357
      Run-TryBot: David Chase <drchase@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarAustin Clements <austin@google.com>
      b4c3fe7b
    • Keith Randall's avatar
      cmd/compile: fix runtime.KeepAlive · 0153a413
      Keith Randall authored
      KeepAlive needs to introduce a use of the spill of the
      value it is keeping alive.  Without that, we don't guarantee
      that the spill dominates the KeepAlive.
      
      This bug was probably introduced with the code to move spills
      down to the dominator of the restores, instead of always spilling
      just after the value itself (CL 34822).
      
      Fixes #22458.
      
      Change-Id: I94955a21960448ffdacc4df775fe1213967b1d4c
      Reviewed-on: https://go-review.googlesource.com/74210Reviewed-by: default avatarCherry Zhang <cherryyz@google.com>
      Reviewed-by: default avatarDavid Chase <drchase@google.com>
      0153a413
    • Russ Cox's avatar
      cmd/dist: force non-devel version for cross-build buildlets · b09e2de7
      Russ Cox authored
      If the compiler has a non-devel version it will report that version
      to the go command for use as the "compiler ID" instead of using
      the content ID of the binary. This in turn allows the go command
      to see the compiled-for-amd64 arm compiler and the compiled-for-arm
      arm compiler as having the same ID, so that packages cross-compiled
      from amd64 look up-to-date when copied to the arm system
      during the linux-arm buildlets and trybots.
      
      Change-Id: I76cbf129303941f8e31bdb100e263478159ddaa5
      Reviewed-on: https://go-review.googlesource.com/74360
      Run-TryBot: Russ Cox <rsc@golang.org>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarCherry Zhang <cherryyz@google.com>
      b09e2de7
    • Michael Munday's avatar
      math: optimize dim and remove s390x assembly implementation · b97688d1
      Michael Munday authored
      By calculating dim directly, rather than calling max, we can simplify
      the generated code significantly. The compiler now reports that dim
      is easily inlineable, but it can't be inlined because there is still
      an assembly stub for Dim.
      
      Since dim is now very simple I no longer think it is worth having
      assembly implementations of it. I have therefore removed the s390x
      assembly. Removing the other assembly for Dim is #21913.
      
      name  old time/op  new time/op  delta
      Dim   4.29ns ± 0%  3.53ns ± 0%  -17.62%  (p=0.000 n=9+8)
      
      Change-Id: Ic38a6b51603cbc661dcdb868ecf2b1947e9f399e
      Reviewed-on: https://go-review.googlesource.com/64194
      Run-TryBot: Michael Munday <mike.munday@ibm.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRobert Griesemer <gri@golang.org>
      b97688d1
    • Sam Whited's avatar
      encoding/xml: don't panic when custom Unmarshaler sees StartElement · be08ddbf
      Sam Whited authored
      Change-Id: I90aa0a983abd0080f3de75d3340fdb15c1f9ca35
      Reviewed-on: https://go-review.googlesource.com/70891Reviewed-by: default avatarIan Lance Taylor <iant@golang.org>
      Run-TryBot: Sam Whited <sam@samwhited.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      be08ddbf
    • Terin Stock's avatar
      net/http/pprof: attach handlers using http.HandleFunc · 01c144c4
      Terin Stock authored
      Simplify how pprof attaches the handlers to the DefaultMux by using
      http.HandleFunc instead of manually wrapping the handlers in
      a http.HandlerFunc.
      
      Change-Id: I65db262ebb2e29e4b6f30df9d2688f5daf782c29
      Reviewed-on: https://go-review.googlesource.com/71251Reviewed-by: default avatarSam Whited <sam@samwhited.com>
      Reviewed-by: default avatarTom Bergan <tombergan@google.com>
      Run-TryBot: Sam Whited <sam@samwhited.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      01c144c4
    • Austin Clements's avatar
      runtime: use buffered write barrier for bulkBarrierPreWrite · 877387e3
      Austin Clements authored
      This modifies bulkBarrierPreWrite to use the buffered write barrier
      instead of the eager write barrier. This reduces the number of system
      stack switches and sanity checks by a factor of the buffer size
      (currently 256). This affects both typedmemmove and typedmemclr.
      
      Since this is purely a runtime change, it applies to all arches
      (unlike the pointer write barrier).
      
      name                 old time/op  new time/op  delta
      BulkWriteBarrier-12  7.33ns ± 6%  4.46ns ± 9%  -39.10%  (p=0.000 n=20+19)
      
      Updates #22460.
      
      Change-Id: I6a686a63bbf08be02b9b97250e37163c5a90cdd8
      Reviewed-on: https://go-review.googlesource.com/73832
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRick Hudson <rlh@golang.org>
      877387e3
    • Austin Clements's avatar
      runtime: simplify and optimize typedslicecopy · 6a5f1e58
      Austin Clements authored
      Currently, typedslicecopy meticulously performs a typedmemmove on
      every element of the slice. This probably used to be necessary because
      we only had an individual element's type, but now we use the heap
      bitmap, so we only need to know whether the type has any pointers and
      how big it is. Hence, this CL rewrites typedslicecopy to simply
      perform one bulk barrier and one memmove.
      
      This also has a side-effect of eliminating two unnecessary write
      barriers per slice element that were coming from updates to dstp and
      srcp, which were stored in the parent stack frame. However, most of
      the win comes from eliminating the loops.
      
      name                 old time/op  new time/op  delta
      BulkWriteBarrier-12  7.83ns ±10%  7.33ns ± 6%  -6.45%  (p=0.000 n=20+20)
      
      Updates #22460.
      
      Change-Id: Id3450e9f36cc8e0892f268319b136f0d8f5464b8
      Reviewed-on: https://go-review.googlesource.com/73831
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRick Hudson <rlh@golang.org>
      6a5f1e58
    • Austin Clements's avatar
      runtime: benchmark for bulk write barriers · f96b95bc
      Austin Clements authored
      This adds a benchmark of typedslicecopy and its bulk write barriers.
      
      For #22460.
      
      Change-Id: I439ca3b130bb22944468095f8f18b464e5bb43ca
      Reviewed-on: https://go-review.googlesource.com/74051
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRick Hudson <rlh@golang.org>
      f96b95bc
    • Austin Clements's avatar
      cmd/compile: compiler support for buffered write barrier · 7e343134
      Austin Clements authored
      This CL implements the compiler support for calling the buffered write
      barrier added by the previous CL.
      
      Since the buffered write barrier is only implemented on amd64 right
      now, this still supports the old, eager write barrier as well. There's
      little overhead to supporting both and this way a few tests in
      test/fixedbugs that expect to have liveness maps at write barrier
      calls can easily opt-in to the old, eager barrier.
      
      This significantly improves the performance of the write barrier:
      
      name             old time/op  new time/op  delta
      WriteBarrier-12  73.5ns ±20%  19.2ns ±27%  -73.90%  (p=0.000 n=19+18)
      
      It also reduces the size of binaries because the write barrier call is
      more compact:
      
      name        old object-bytes  new object-bytes  delta
      Template           398k ± 0%         393k ± 0%  -1.14%  (p=0.008 n=5+5)
      Unicode            208k ± 0%         206k ± 0%  -1.00%  (p=0.008 n=5+5)
      GoTypes           1.18M ± 0%        1.15M ± 0%  -2.00%  (p=0.008 n=5+5)
      Compiler          4.05M ± 0%        3.88M ± 0%  -4.26%  (p=0.008 n=5+5)
      SSA               8.25M ± 0%        8.11M ± 0%  -1.59%  (p=0.008 n=5+5)
      Flate              228k ± 0%         224k ± 0%  -1.83%  (p=0.008 n=5+5)
      GoParser           295k ± 0%         284k ± 0%  -3.62%  (p=0.008 n=5+5)
      Reflect           1.00M ± 0%        0.99M ± 0%  -0.70%  (p=0.008 n=5+5)
      Tar                339k ± 0%         333k ± 0%  -1.67%  (p=0.008 n=5+5)
      XML                404k ± 0%         395k ± 0%  -2.10%  (p=0.008 n=5+5)
      [Geo mean]         704k              690k       -2.00%
      
      name        old exe-bytes     new exe-bytes     delta
      HelloSize         1.05M ± 0%        1.04M ± 0%  -1.55%  (p=0.008 n=5+5)
      
      https://perf.golang.org/search?q=upload:20171027.1
      
      (Amusingly, this also reduces compiler allocations by 0.75%, which,
      combined with the better write barrier, speeds up the compiler overall
      by 2.10%. See the perf link.)
      
      It slightly improves the performance of most of the go1 benchmarks and
      improves the performance of the x/benchmarks:
      
      name                      old time/op    new time/op    delta
      BinaryTree17-12              2.40s ± 1%     2.47s ± 1%  +2.69%  (p=0.000 n=19+19)
      Fannkuch11-12                2.95s ± 0%     2.95s ± 0%  +0.21%  (p=0.000 n=20+19)
      FmtFprintfEmpty-12          41.8ns ± 4%    41.4ns ± 2%  -1.03%  (p=0.014 n=20+20)
      FmtFprintfString-12         68.7ns ± 2%    67.5ns ± 1%  -1.75%  (p=0.000 n=20+17)
      FmtFprintfInt-12            79.0ns ± 3%    77.1ns ± 1%  -2.40%  (p=0.000 n=19+17)
      FmtFprintfIntInt-12          127ns ± 1%     123ns ± 3%  -3.42%  (p=0.000 n=20+20)
      FmtFprintfPrefixedInt-12     152ns ± 1%     150ns ± 1%  -1.02%  (p=0.000 n=18+17)
      FmtFprintfFloat-12           211ns ± 1%     209ns ± 0%  -0.99%  (p=0.000 n=20+16)
      FmtManyArgs-12               500ns ± 0%     496ns ± 0%  -0.73%  (p=0.000 n=17+20)
      GobDecode-12                6.44ms ± 1%    6.53ms ± 0%  +1.28%  (p=0.000 n=20+19)
      GobEncode-12                5.46ms ± 0%    5.46ms ± 1%    ~     (p=0.550 n=19+20)
      Gzip-12                      220ms ± 1%     216ms ± 0%  -1.75%  (p=0.000 n=19+19)
      Gunzip-12                   38.8ms ± 0%    38.6ms ± 0%  -0.30%  (p=0.000 n=18+19)
      HTTPClientServer-12         79.0µs ± 1%    78.2µs ± 1%  -1.01%  (p=0.000 n=20+20)
      JSONEncode-12               11.9ms ± 0%    11.9ms ± 0%  -0.29%  (p=0.000 n=20+19)
      JSONDecode-12               52.6ms ± 0%    52.2ms ± 0%  -0.68%  (p=0.000 n=19+20)
      Mandelbrot200-12            3.69ms ± 0%    3.68ms ± 0%  -0.36%  (p=0.000 n=20+20)
      GoParse-12                  3.13ms ± 1%    3.18ms ± 1%  +1.67%  (p=0.000 n=19+20)
      RegexpMatchEasy0_32-12      73.2ns ± 1%    72.3ns ± 1%  -1.19%  (p=0.000 n=19+18)
      RegexpMatchEasy0_1K-12       241ns ± 0%     239ns ± 0%  -0.83%  (p=0.000 n=17+16)
      RegexpMatchEasy1_32-12      68.6ns ± 1%    69.0ns ± 1%  +0.47%  (p=0.015 n=18+16)
      RegexpMatchEasy1_1K-12       364ns ± 0%     361ns ± 0%  -0.67%  (p=0.000 n=16+17)
      RegexpMatchMedium_32-12      104ns ± 1%     103ns ± 1%  -0.79%  (p=0.001 n=20+15)
      RegexpMatchMedium_1K-12     33.8µs ± 3%    34.0µs ± 2%    ~     (p=0.267 n=20+19)
      RegexpMatchHard_32-12       1.64µs ± 1%    1.62µs ± 2%  -1.25%  (p=0.000 n=19+18)
      RegexpMatchHard_1K-12       49.2µs ± 0%    48.7µs ± 1%  -0.93%  (p=0.000 n=19+18)
      Revcomp-12                   391ms ± 5%     396ms ± 7%    ~     (p=0.154 n=19+19)
      Template-12                 63.1ms ± 0%    59.5ms ± 0%  -5.76%  (p=0.000 n=18+19)
      TimeParse-12                 307ns ± 0%     306ns ± 0%  -0.39%  (p=0.000 n=19+17)
      TimeFormat-12                325ns ± 0%     323ns ± 0%  -0.50%  (p=0.000 n=19+19)
      [Geo mean]                  47.3µs         46.9µs       -0.67%
      
      https://perf.golang.org/search?q=upload:20171026.1
      
      name                       old time/op  new time/op  delta
      Garbage/benchmem-MB=64-12  2.25ms ± 1%  2.20ms ± 1%  -2.31%  (p=0.000 n=18+18)
      HTTP-12                    12.6µs ± 0%  12.6µs ± 0%  -0.72%  (p=0.000 n=18+17)
      JSON-12                    11.0ms ± 0%  11.0ms ± 1%  -0.68%  (p=0.000 n=17+19)
      
      https://perf.golang.org/search?q=upload:20171026.2
      
      Updates #14951.
      Updates #22460.
      
      Change-Id: Id4c0932890a1d41020071bec73b8522b1367d3e7
      Reviewed-on: https://go-review.googlesource.com/73712
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarCherry Zhang <cherryyz@google.com>
      7e343134
    • Austin Clements's avatar
      runtime: buffered write barrier implementation · e9079a69
      Austin Clements authored
      This implements runtime support for buffered write barriers on amd64.
      The buffered write barrier has a fast path that simply enqueues
      pointers in a per-P buffer. Unlike the current write barrier, this
      fast path is *not* a normal Go call and does not require the compiler
      to spill general-purpose registers or put arguments on the stack. When
      the buffer fills up, the write barrier takes the slow path, which
      spills all general purpose registers and flushes the buffer. We don't
      allow safe-points or stack splits while this frame is active, so it
      doesn't matter that we have no type information for the spilled
      registers in this frame.
      
      One minor complication is cgocheck=2 mode, which uses the write
      barrier to detect Go pointers being written to non-Go memory. We
      obviously can't buffer this, so instead we set the buffer to its
      minimum size, forcing the write barrier into the slow path on every
      call. For this specific case, we pass additional information as
      arguments to the flush function. This also requires enabling the cgo
      write barrier slightly later during runtime initialization, after Ps
      (and the per-P write barrier buffers) have been initialized.
      
      The code in this CL is not yet active. The next CL will modify the
      compiler to generate calls to the new write barrier.
      
      This reduces the average cost of the write barrier by roughly a factor
      of 4, which will pay for the cost of having it enabled more of the
      time after we make the GC pacer less aggressive. (Benchmarks will be
      in the next CL.)
      
      Updates #14951.
      Updates #22460.
      
      Change-Id: I396b5b0e2c5e5c4acfd761a3235fd15abadc6cb1
      Reviewed-on: https://go-review.googlesource.com/73711
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRick Hudson <rlh@golang.org>
      e9079a69
    • Austin Clements's avatar
      runtime: add benchmark for write barriers · 1e8ab99b
      Austin Clements authored
      For #22460.
      
      Change-Id: I798f26d45bbe1efd16b632e201413cb26cb3e6c7
      Reviewed-on: https://go-review.googlesource.com/73811
      Run-TryBot: Austin Clements <austin@google.com>
      TryBot-Result: Gobot Gobot <gobot@golang.org>
      Reviewed-by: default avatarRick Hudson <rlh@golang.org>
      1e8ab99b
    • Austin Clements's avatar
      runtime: make systemstack tail call if already switched · 15d6ab69
      Austin Clements authored
      Currently systemstack always calls its argument, even if we're already
      on the system stack. Unfortunately, traceback with _TraceJump stops at
      the first systemstack it sees, which often cuts off runtime stacks
      early in profiles.
      
      Fix this by performing a tail call if we're already on the system
      stack. This eliminates it from the traceback entirely, so it won't
      stop prematurely (or all get mushed into a single node in the profile
      graph).
      
      Change-Id: Ibc69e8765e899f8d3806078517b8c7314da196f4
      Reviewed-on: https://go-review.googlesource.com/74050Reviewed-by: default avatarCherry Zhang <cherryyz@google.com>
      Reviewed-by: default avatarKeith Randall <khr@golang.org>
      15d6ab69