Count variable occurrences #39

dcz-self · 2025-02-08T11:04:37Z

This helps find typos, because logru otherwise doesn't warn about underspecified variables.

This is a quick and simple prototype.

Things that could be done differently:

abstracting VarScope into a trait, although it's probably overkill
VarScope.name is made public. Not sure what other interface should be used to copy one. ::new(Vec<_>) or ::from_iter? And then ::empty() for creating an empty one.
currently it's just a println. tracing::warn would be another "worse is better" solution, but tracing is not imported by logru. Otherwise this could turn into a warning interface, but I didn't want to go into that rabbit hole before getting a general approval.

dcz-self · 2025-02-08T11:06:04Z

Example of a repl session:

?- :define dupa(B) :- A.
Some variables appear only once in this rule: B, A,
are those typos?
Defined!

This will mark all free variables as suspicious, so the one way to silence this warning is to replace the variable with _.

This introduces the analysis module and a warning in the repl whenever a variable is defined with only one occurrence.

fatho

This sounds like a nice thing to have, thanks for the suggestion! I do however have two concerns with the proposed implementation:

This sounds more like a program analysis (albeit a very simple one), so I don't think it should be the parser's business to figure this out. Instead, this would be well suited as a separate function (fn unique_named_vars(rule: &Rule) -> HashSet<Var>) that takes a rule and returns all unique named variables based on the enclosed scope. Then we also don't need this VarScopeCounted.
Since all of this is library code, it should never write directly to stderr, since we cannot assume that the application actually using it is fine with that. Instead, the warning can then be implemented as part of the repl by calling the unique_named_vars function on rules defined within the REPL and printing an appropriate message directly from the REPL, rather than library code.

dcz-self · 2025-02-15T17:41:53Z

That would be done.

fatho

Thanks for updating this, looks good! I still have a small suggestion below:

fatho · 2025-02-15T23:05:29Z

src/textual.rs

    /// Load a set of rules from a string.
-    pub fn load_str(&mut self, rules: &str) -> Result<(), ParseError> {
+    pub fn load_str(&mut self, rules: &str) -> Result<Vec<Rule>, ParseError> {
        let rules = Parser::new(&mut self.symbols).parse_rules_str(rules)?;
-        for rule in rules {
-            self.rules.insert(rule);
+        for rule in &rules {
+            self.rules.insert(rule.clone());
        }
-        Ok(())
+        Ok(rules)
    }


Most use sites of this function don't care about the results, so often we'd be cloning the rules unnecessarily.

One thing I can think of is splitting this like so

/// Parse a set of rules using the symbols defined in this universe. pub fn parse_rules(&mut self, rules: &str) -> Result<Vec<Rule>, ParseError> { Parser::new(&mut self.symbols).parse_rules_str(rules) } /// Insert rules previously parsed using [`Self::parse_rules`]. pub fn insert_rules(&mut self, rules: Vec<Rule>) { for rule in rules { self.rules.insert(rule); } } /// Load a set of rules from a string. pub fn load_str(&mut self, rules: &str) -> Result<(), ParseError> { let rules = self.parse_rules(rules)?; self.insert_rules(rules); Ok(()) }

Places that don't care about the rules can continue to use load_str, while in other places, one can first run parse_rules, do any analysis as desired, and then call insert_rules once done.

I don't like this a lot because this makes it possible to inject Rule objects from a different Universe (so with mismatched symbols), but I can't think of anything else without cloning.

Done.

fatho

LGTM

dcz-self force-pushed the typos branch from eeb6860 to e0a8f42 Compare February 8, 2025 11:07

repl: Alert on orphaned variables

5e8af4c

This introduces the analysis module and a warning in the repl whenever a variable is defined with only one occurrence.

fatho requested changes Feb 13, 2025

View reviewed changes

dcz-self force-pushed the typos branch 2 times, most recently from 8178cab to 06059f6 Compare February 15, 2025 17:41

dcz-self changed the title ~~RFC: Count variable occurrences~~ Count variable occurrences Feb 15, 2025

fatho reviewed Feb 15, 2025

View reviewed changes

dcz-self force-pushed the typos branch 2 times, most recently from ffd0206 to 5e8af4c Compare February 16, 2025 13:06

fatho approved these changes Mar 1, 2025

View reviewed changes

fatho merged commit aa08c96 into fatho:main Mar 1, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Count variable occurrences #39

Count variable occurrences #39

dcz-self commented Feb 8, 2025

dcz-self commented Feb 8, 2025

fatho left a comment

dcz-self commented Feb 15, 2025

fatho left a comment

fatho Feb 15, 2025

dcz-self Feb 16, 2025

fatho left a comment

Count variable occurrences #39

Count variable occurrences #39

Conversation

dcz-self commented Feb 8, 2025

dcz-self commented Feb 8, 2025

fatho left a comment

Choose a reason for hiding this comment

dcz-self commented Feb 15, 2025

fatho left a comment

Choose a reason for hiding this comment

fatho Feb 15, 2025

Choose a reason for hiding this comment

dcz-self Feb 16, 2025

Choose a reason for hiding this comment

fatho left a comment

Choose a reason for hiding this comment