YAML files don't support inline whitelisting #50

LouisTrezzini · 2018-07-04T10:57:13Z

No description provided.

domanchi · 2018-07-05T14:09:45Z

Does this only fail with YAML files? Or does it fail on other file types too?

LouisTrezzini · 2018-07-05T15:27:57Z

It works fine with python files

domanchi · 2018-07-05T15:30:59Z

@LouisTrezzini, please open an issue to report this bug, or complete this pull request to fix it. Pull requests should be used for code wanting to be merged in with the master branch.

domanchi · 2018-07-06T14:50:51Z

detect_secrets/plugins/base.py

@@ -38,17 +38,17 @@ def analyze_string(self, string, line_num, filename):   # pragma: no cover

        NOTE: line_num and filename are used for PotentialSecret creation only.
        """
-        pass
+        raise NotImplementedError


This isn't really needed, because abc.abstractmethod will prevent it from even being initialized.

domanchi · 2018-07-06T14:54:32Z

detect_secrets/plugins/high_entropy_strings.py

-                                filename,
-                            ),
-                        )
+                        if not item['__line__'] in ignored_lines:


Use and instead, to avoid hadouken code?

http://i.imgur.com/BtjZedW.jpg

if you look more attentively, the continue statement is shared

but I agree it's not very elegant

Ah, gotcha. ++

domanchi · 2018-07-06T15:00:46Z

detect_secrets/plugins/high_entropy_strings.py

-        data = YamlLineInjector(file).json()
+        parser = YamlFileParser(file)
+        data = parser.json()
+        ignored_lines = parser.get_ignored_lines()


Any reason why we want to do this all at one time, as compared to scanning as needed?

e.g.

if '__line__' in item and not WHITELIST_REGEX.search(item['__value__']): pass

the current behavior is doing what you suggest:

if '__line__' in item: potential_secrets.update( self.analyze_string( item['__value__'], item['__line__'], filename, ), )

But value is actually the value, not the full line, so the comment is dropped
pyYAML drops comments during preprocessing so we can't use it

I decided to scan the file once to identify all ignored lines and later do a simple O(1) line in ignored_lines check

Ahh. Right. Good point.

Can you explain this in the docstring for get_ignored_lines? Specifically regarding the fact that the parser drops the comments, and thus, we need to parse the file separately from yaml parsing.

domanchi

fix'n'ship!

domanchi · 2018-07-09T18:07:55Z

detect_secrets/plugins/high_entropy_strings.py

-                                filename,
-                            ),
-                        )
+                        if not item['__line__'] in ignored_lines:


Ah, gotcha. ++

domanchi · 2018-07-09T18:10:37Z

detect_secrets/plugins/high_entropy_strings.py

-        data = YamlLineInjector(file).json()
+        parser = YamlFileParser(file)
+        data = parser.json()
+        ignored_lines = parser.get_ignored_lines()


Ahh. Right. Good point.

Can you explain this in the docstring for get_ignored_lines? Specifically regarding the fact that the parser drops the comments, and thus, we need to parse the file separately from yaml parsing.

LouisTrezzini · 2018-07-13T10:30:17Z

I pushed the changes you asked for, feel free to merge

High entropy string should be whitelist-able

270a752

add whitelisted secret in python file

42373eb

domanchi changed the title ~~High entropy string should be whitelist-able~~ YAML files don't support inline whitelisting Jul 5, 2018

Support inline whitelisting for YAML files

f31080b

LouisTrezzini force-pushed the master branch from be49fa7 to f31080b Compare July 6, 2018 11:19

KevinHock requested a review from domanchi July 6, 2018 19:04

domanchi reviewed Jul 9, 2018

View reviewed changes

domanchi approved these changes Jul 9, 2018

View reviewed changes

Review fixes

8842526

domanchi merged commit 11b8768 into Yelp:master Jul 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YAML files don't support inline whitelisting #50

YAML files don't support inline whitelisting #50

LouisTrezzini commented Jul 4, 2018

domanchi commented Jul 5, 2018

LouisTrezzini commented Jul 5, 2018

domanchi commented Jul 5, 2018

domanchi Jul 6, 2018

domanchi Jul 6, 2018

LouisTrezzini Jul 9, 2018

domanchi Jul 9, 2018

domanchi Jul 6, 2018

LouisTrezzini Jul 9, 2018

domanchi Jul 9, 2018

domanchi left a comment

domanchi Jul 9, 2018

domanchi Jul 9, 2018

LouisTrezzini commented Jul 13, 2018 •

edited

Loading

YAML files don't support inline whitelisting #50

YAML files don't support inline whitelisting #50

Conversation

LouisTrezzini commented Jul 4, 2018

domanchi commented Jul 5, 2018

LouisTrezzini commented Jul 5, 2018

domanchi commented Jul 5, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

domanchi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LouisTrezzini commented Jul 13, 2018 • edited Loading

LouisTrezzini commented Jul 13, 2018 •

edited

Loading