Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
G
gitlab-ce
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
1
Merge Requests
1
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
nexedi
gitlab-ce
Commits
1ac6a03f
Commit
1ac6a03f
authored
May 03, 2018
by
Jacob Schatz
Committed by
Micaël Bergeron
Jun 22, 2018
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Use enumerators to save memory instead of just plain old `each`
parent
c88f66e5
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
52 additions
and
53 deletions
+52
-53
lib/assets/pseudonymity_dump.yml
lib/assets/pseudonymity_dump.yml
+41
-41
lib/pseudonymity/table.rb
lib/pseudonymity/table.rb
+10
-12
lib/tasks/gitlab/db.rake
lib/tasks/gitlab/db.rake
+1
-0
No files found.
lib/assets/pseudonymity_dump.yml
View file @
1ac6a03f
...
...
@@ -6,7 +6,7 @@ tables:
-
user_id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
merge_request_id
-
user_id
...
...
@@ -17,7 +17,7 @@ tables:
-
group_id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
group_id
board_assignees
:
...
...
@@ -25,7 +25,7 @@ tables:
-
id
-
board_id
-
assignee_id
anon
:
pseudo
:
-
id
-
board_id
-
assignee_id
...
...
@@ -34,7 +34,7 @@ tables:
-
id
-
board_id
-
label_id
anon
:
pseudo
:
-
id
-
board_id
-
label_id
...
...
@@ -47,7 +47,7 @@ tables:
-
milestone_id
-
group_id
-
weight
anon
:
pseudo
:
-
id
-
project_id
-
milestone_id
...
...
@@ -58,7 +58,7 @@ tables:
-
epic_id
-
issue_id
-
relative_position
anon
:
pseudo
:
-
id
-
epic_id
-
issue_id
...
...
@@ -68,7 +68,7 @@ tables:
-
epic_id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
epics
:
whitelist
:
...
...
@@ -89,7 +89,7 @@ tables:
-
updated_at
-
title
-
description
anon
:
pseudo
:
-
id
-
milestone_id
-
group_id
...
...
@@ -109,7 +109,7 @@ tables:
whitelist
:
-
user_id
-
issue_id
anon
:
pseudo
:
-
user_id
-
issue_id
issue_links
:
...
...
@@ -119,7 +119,7 @@ tables:
-
target_id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
source_id
-
target_id
...
...
@@ -132,7 +132,7 @@ tables:
-
first_added_to_board_at
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
issue_id
issues
:
...
...
@@ -157,7 +157,7 @@ tables:
-
discussion_locked
-
closed_at
-
closed_by_id
anon
:
pseudo
:
-
id
-
title
-
author_id
...
...
@@ -177,7 +177,7 @@ tables:
-
target_type
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
label_id
-
target_id
...
...
@@ -189,7 +189,7 @@ tables:
-
priority
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
project_id
-
label_id
...
...
@@ -204,7 +204,7 @@ tables:
-
template
-
type
-
group_id
anon
:
pseudo
:
-
id
-
title
-
color
...
...
@@ -219,7 +219,7 @@ tables:
-
id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
merge_request_diff_commits
:
whitelist
:
...
...
@@ -231,7 +231,7 @@ tables:
-
author_email
-
committer_name
-
committer_email
anon
:
pseudo
:
-
merge_request_diff_id
-
author_name
-
author_email
...
...
@@ -247,7 +247,7 @@ tables:
-
too_large
-
a_mode
-
b_mode
anon
:
pseudo
:
-
merge_request_diff_id
merge_request_diffs
:
whitelist
:
...
...
@@ -261,7 +261,7 @@ tables:
-
head_commit_sha
-
start_commit_sha
-
commits_count
anon
:
pseudo
:
-
id
-
merge_request_id
-
base_commit_sha
...
...
@@ -281,7 +281,7 @@ tables:
-
merged_by_id
-
latest_closed_by_id
-
latest_closed_at
anon
:
pseudo
:
-
id
-
merge_request_id
-
pipeline_id
...
...
@@ -316,7 +316,7 @@ tables:
-
discussion_locked
-
latest_merge_request_diff_id
-
allow_maintainer_to_push
anon
:
pseudo
:
-
id
-
target_branch
-
source_branch
...
...
@@ -337,7 +337,7 @@ tables:
-
issue_id
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
merge_request_id
-
issue_id
...
...
@@ -351,7 +351,7 @@ tables:
-
state
-
start_date
-
group_id
anon
:
pseudo
:
-
id
-
project_id
-
group_id
...
...
@@ -361,7 +361,7 @@ tables:
-
namespace_id
-
shared_runners_seconds
-
shared_runners_seconds_last_reset
anon
:
pseudo
:
-
id
-
namespace_id
namespaces
:
...
...
@@ -391,7 +391,7 @@ tables:
-
two_factor_grace_period
-
plan_id
-
project_creation_level
anon
:
pseudo
:
-
id
-
name
-
path
...
...
@@ -441,7 +441,7 @@ tables:
-
cached_markdown_version
-
change_position
-
resolved_by_push
anon
:
pseudo
:
-
id
-
note
-
noteable_type
...
...
@@ -489,7 +489,7 @@ tables:
-
success_pipeline
-
push_to_merge_request
-
issue_due
anon
:
pseudo
:
-
id
-
user_id
-
source_id
...
...
@@ -516,7 +516,7 @@ tables:
-
user_id
-
project_id
-
access_level
anon
:
pseudo
:
-
user_id
-
project_id
-
access_level
...
...
@@ -528,7 +528,7 @@ tables:
-
updated_at
-
enabled
-
domain
anon
:
pseudo
:
-
id
-
project_id
-
created_at
...
...
@@ -540,7 +540,7 @@ tables:
-
id
-
project_id
-
group_runners_enabled
anon
:
pseudo
:
-
id
-
project_id
-
group_runners_enabled
...
...
@@ -552,7 +552,7 @@ tables:
-
project_id
-
key
-
value
anon
:
pseudo
:
-
id
-
created_at
-
updated_at
...
...
@@ -565,7 +565,7 @@ tables:
-
project_id
-
deploy_token_id
-
created_at
anon
:
pseudo
:
-
id
-
project_id
-
deploy_token_id
...
...
@@ -582,7 +582,7 @@ tables:
-
created_at
-
updated_at
-
repository_access_level
anon
:
pseudo
:
-
id
-
project_id
-
merge_requests_access_level
...
...
@@ -602,7 +602,7 @@ tables:
-
updated_at
-
group_access
-
expires_at
anon
:
pseudo
:
-
id
-
project_id
-
group_id
...
...
@@ -618,7 +618,7 @@ tables:
-
encrypted_credentials
-
encrypted_credentials_iv
-
encrypted_credentials_salt
anon
:
pseudo
:
-
id
-
project_id
-
data
...
...
@@ -635,7 +635,7 @@ tables:
-
next_execution_timestamp
-
created_at
-
updated_at
anon
:
pseudo
:
-
id
-
project_id
-
retry_count
...
...
@@ -652,7 +652,7 @@ tables:
-
wiki_verification_checksum
-
last_repository_verification_failure
-
last_wiki_verification_failure
anon
:
pseudo
:
-
id
-
project_id
-
repository_verification_checksum
...
...
@@ -671,7 +671,7 @@ tables:
-
build_artifacts_size
-
shared_runners_seconds
-
shared_runners_seconds_last_reset
anon
:
pseudo
:
-
id
-
project_id
-
namespace_id
...
...
@@ -753,7 +753,7 @@ tables:
-
external_authorization_classification_label
-
external_webhook_token
-
pages_https_only
anon
:
pseudo
:
-
id
-
name
-
path
...
...
@@ -833,7 +833,7 @@ tables:
-
created_at
-
updated_at
-
project_id
anon
:
pseudo
:
-
id
-
user_id
-
subscribable_id
...
...
@@ -901,7 +901,7 @@ tables:
-
preferred_language
-
rss_token
-
theme_id
anon
:
pseudo
:
-
id
-
email
-
remember_created_at
...
...
lib/pseudonymity/table.rb
View file @
1ac6a03f
...
...
@@ -12,13 +12,14 @@ module Pseudonymity
columns
=
results
.
columns
# Assume they all have the same table
to_filter
=
@anon_fields
&
columns
results
.
each
do
|
result
|
to_filter
.
each
do
|
field
|
result
[
field
]
=
Digest
::
SHA2
.
new
(
256
).
hexdigest
(
result
[
field
])
unless
result
[
field
].
nil?
Enumerator
.
new
do
|
yielder
|
results
.
each
do
|
result
|
to_filter
.
each
do
|
field
|
result
[
field
]
=
Digest
::
SHA2
.
new
(
256
).
hexdigest
(
result
[
field
])
unless
result
[
field
].
nil?
end
yielder
<<
result
end
end
results
end
end
...
...
@@ -34,16 +35,17 @@ module Pseudonymity
tables
=
@config
[
"tables"
]
tables
.
map
do
|
k
,
v
|
table_to_csv
(
k
,
v
[
"whitelist"
],
v
[
"
anon
"
])
table_to_csv
(
k
,
v
[
"whitelist"
],
v
[
"
pseudo
"
])
end
end
def
table_to_csv
(
table
,
whitelist_columns
,
pseudonymity_columns
)
sql
=
"SELECT
#{
whitelist_columns
.
join
(
","
)
}
from
#{
table
}
"
results
=
ActiveRecord
::
Base
.
connection
.
exec_query
(
sql
)
return
if
results
.
empty?
anon
=
Anon
.
new
(
pseudonymity_columns
)
results
=
anon
.
anonymize
results
write_to_csv_file
table
,
results
write_to_csv_file
(
table
,
anon
.
anonymize
(
results
))
end
def
parse_config
...
...
@@ -52,10 +54,6 @@ module Pseudonymity
def
write_to_csv_file
(
title
,
contents
)
file_path
=
"/tmp/
#{
title
}
.csv"
if
contents
.
empty?
File
.
open
(
file_path
,
"w"
)
{}
return
file_path
end
column_names
=
contents
.
first
.
keys
contents
=
CSV
.
generate
do
|
csv
|
csv
<<
column_names
...
...
lib/tasks/gitlab/db.rake
View file @
1ac6a03f
...
...
@@ -73,6 +73,7 @@ namespace :gitlab do
desc
'Output pseudonymity dump of selected table'
task
:pseudonymity_dump
=>
:environment
do
table
=
Pseudonymity
::
Table
.
new
table
.
tables_to_csv
end
end
end
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment