concise phrase transform #811

eschultink · 2024-09-25T22:41:34Z

Features

transform more suited to the calendar event title use-case

Change implications

dependencies added/changed? no
something important to note in future release notes? not yet

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1208336236290115

jlorper · 2024-09-27T20:11:55Z

java/core/src/test/java/co/worklytics/psoxy/impl/RESTApiSanitizerImplTest.java

@@ -186,6 +186,27 @@ void redactRegexMatches(String source) {
    }


+    @SneakyThrows


test by copilot?? typo PHASE instead of PHRASE everywhere :)

eschultink · 2024-10-04T16:04:36Z

java/core/src/test/java/co/worklytics/psoxy/rules/generics/CalendarTest.java

-        "Prep Customer Meeting,Prep ",
-        "Prep: Customer,Prep: ",
+        "Out of the Office: Vacation,Out of the Office",
+        "Focus Time,'Focus Time,Focus'",


so, this is reality of this; bc transform products ALL matches, not just first. In effect, it's more like categorization than redaction.

behavior is needed if you want "Team weekly" to match both "team" and "weekly" cases. that will end up as team,weekly.

jlorper

a minor typo

jlorper · 2024-10-07T20:37:04Z

java/core/src/main/java/co/worklytics/psoxy/impl/RESTApiSanitizerImpl.java

+                return patterns.stream()
+                    .map(p -> p.matcher((String) s))
+                    .filter(Matcher::matches)
+                    .map(m -> m.group(1)) //group 1, bc we created caputuring group in regex above


Suggested change

.map(m -> m.group(1)) //group 1, bc we created caputuring group in regex above

.map(m -> m.group(1)) //group 1, bc we created capturing group in regex above

jlorper · 2024-10-07T20:49:05Z

java/core/src/test/java/co/worklytics/psoxy/rules/generics/CalendarTest.java

+        "Focus Time Block,'Focus Time,Focus'",
+        "Focus: Secret Project,Focus",
+        "No Meeting Wednesday,No Meeting",
+        " No Meetings,'No Meetings,No Meetings'", // q: why????


Because matches with No Meeting and No Meetings

🤔 it shouldn't though ... No Meeting token string shouldn't match No Meetings, bc I'm transforming it into a pattern with \b tokens on either side to only match "word boundaries. So it shouldn't match prefixes.

eschultink · 2024-10-10T19:31:13Z

java/core/src/main/java/co/worklytics/psoxy/impl/RESTApiSanitizerImpl.java

+
+        List<Pattern> patterns = transform.getAllowedPhrases().stream()
+            .map(p -> "\\Q" + p + "\\E") // quote it
+            .map(p -> "\\b" + p + "[\\\\s:]*\\b") //boundary match, with optional whitespace or colon at end


@jlorper this is where should be limiting to word boundaries.

maybe bug is that "[\\\\s:]*" is wrong, so it's matching character 's' specifically, instead of \s for whitespace as I intended ...

yeah, it was that - fixed now and works as expected.

…attern

* wip of concise phrase transform * fix import * cleanup, fix tests * cleaner rules and examples * fix tests * fix tests * drop case-insensitive multi-pattern stuff * fix pattern to allow spaces rather than s char; and tighten capture pattern * fix tests

wip of concise phrase transform

670bc86

eschultink requested review from jlorper and aperez-worklytics September 25, 2024 22:41

eschultink self-assigned this Sep 25, 2024

fix import

9beedee

eschultink changed the title ~~S184 : concise phrase transform~~ concise phrase transform Sep 27, 2024

jlorper reviewed Sep 27, 2024

View reviewed changes

eschultink added 2 commits October 3, 2024 10:56

cleanup, fix tests

69068db

Merge branch 'rc-v0.4.61' into s184-concise-phrase-transform

a4a99ee

Base automatically changed from rc-v0.4.61 to main October 3, 2024 20:41

cleaner rules and examples

e1679e4

eschultink changed the base branch from main to rc-v0.5.0 October 3, 2024 21:24

eschultink added 3 commits October 3, 2024 16:15

fix tests

f412dbc

fix tests

66aafc8

drop case-insensitive multi-pattern stuff

641e301

eschultink marked this pull request as ready for review October 4, 2024 00:29

eschultink commented Oct 4, 2024

View reviewed changes

jlorper approved these changes Oct 7, 2024

View reviewed changes

eschultink commented Oct 10, 2024

View reviewed changes

fix pattern to allow spaces rather than s char; and tighten capture p…

020768c

…attern

aperez-worklytics approved these changes Oct 14, 2024

View reviewed changes

fix tests

c5dc823

eschultink merged commit 8d3c34c into rc-v0.5.0 Oct 15, 2024
67 checks passed

eschultink deleted the s184-concise-phrase-transform branch October 15, 2024 20:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

concise phrase transform #811

concise phrase transform #811

eschultink commented Sep 25, 2024 •

edited

Loading

jlorper Sep 27, 2024

eschultink Oct 4, 2024

jlorper left a comment

jlorper Oct 7, 2024

jlorper Oct 7, 2024

eschultink Oct 10, 2024

eschultink Oct 10, 2024 •

edited

Loading

eschultink Oct 10, 2024

		@@ -186,6 +186,27 @@ void redactRegexMatches(String source) {
		}


		@SneakyThrows

	.map(m -> m.group(1)) //group 1, bc we created caputuring group in regex above
	.map(m -> m.group(1)) //group 1, bc we created capturing group in regex above

concise phrase transform #811

concise phrase transform #811

Conversation

eschultink commented Sep 25, 2024 • edited Loading

Features

Change implications

jlorper Sep 27, 2024

Choose a reason for hiding this comment

eschultink Oct 4, 2024

Choose a reason for hiding this comment

jlorper left a comment

Choose a reason for hiding this comment

jlorper Oct 7, 2024

Choose a reason for hiding this comment

jlorper Oct 7, 2024

Choose a reason for hiding this comment

eschultink Oct 10, 2024

Choose a reason for hiding this comment

eschultink Oct 10, 2024 • edited Loading

Choose a reason for hiding this comment

eschultink Oct 10, 2024

Choose a reason for hiding this comment

eschultink commented Sep 25, 2024 •

edited

Loading

eschultink Oct 10, 2024 •

edited

Loading