only show unique entries in history search when filtering #60066

KristofferC · 2025-11-06T20:37:30Z

I quite often get the whole history search filled with duplicates which feels like low signal ratio. This only keep uniques:

Before vs after

cc @tecosaur @topolarity

Tests written by Claude Code 🤖

Keno · 2025-11-06T22:25:12Z

How do the up and down arrow keys work with this. I sometimes deliberately choose a different identical copy not for the contents, but for the place in history (no pun intended).

tecosaur · 2025-11-07T02:19:09Z

So, I've already got something like this in #59953 (now that I look, it may not be pushed though: I brought in some commits from another PR to try building on, so I'll need to do some cherry-picking). There's a key difference in the approaches we've taken though, and I wonder which behaviour we actually want: here we use an allocated Set{String} to have only the first instance of a history entry be shown, in each processed chunk (chunks do not have a fixed on consistent size).

By contrast, I use the history content + mode + status to deduplicate consecutive entries. At the moment, I feel like this is probably the preferable behaviour, since I also like the history as a log of what I've done that I can look at, and this preserves that aspect better while getting rid of consecutive duplicates. I also like that the implementation is O(n) instead of O(n^2), particularly as someone with 200k history entries 🙂.

KristofferC · 2025-11-07T07:33:34Z

@tecosaur If you have an alternative implementation that you say perform better please put it up so it can actually be benchmarked against. And please make it a separate change instead of in the amalgamation PR.

I sometimes deliberately choose a different identical copy not for the contents, but for the place in history

Can you elaborate on this. You write ENV[" and you see 15 ENV["JULIA_PKG_AUTO_PRECOMPILE"] and you pick the sixth? I can make a keybind to disable the duplication?

tecosaur · 2025-11-07T07:37:10Z

If you have an alternative implementation that you say perform better please put it up so it can actually be benchmarked against. And please make it a separate change instead of in the amalgamation PR.

Yea, I'll pick the bugfixes and tweaks off that so we can discuss/merge them separately to the expanded history format. Let's me know if you have a preference between a "minor changes" PR vs. individual fix/tweak PRs.

Keno · 2025-11-07T07:48:55Z

Can you elaborate on this. You write ENV[" and you see 15 ENV["JULIA_PKG_AUTO_PRECOMPILE"] and you pick the sixth? I can make a keybind to disable the duplication?

Usually not the sixth, but the second from the bottom because I messed up whatever sequence I was testing in the latest iteration and want to start over with the previous one (by hitting down arrow after running the line)

KristofferC · 2025-11-07T07:55:58Z

and want to start over with the previous one (by hitting down arrow after running the line)

I'm not sure that feature exists any more with the new history search. But note that you can select multiple entries (with tab) so if you want a set of changes you can just bring all of those in in one go.

KristofferC · 2025-11-07T08:00:21Z

Filtering out repeats seems to at least be done for whatever default behavior I have with zsh + fzf.

…e duplicates when not filtering

KristofferC · 2025-11-07T14:03:44Z

I changed this to not remove duplicates when you are not filtering in case you want to go and tab-collect a collection of consecutive ones them earlier use.

Also fixed not creating a Set inside the function that is called repeatedly when collecting entries to show.

tecosaur · 2025-11-07T14:13:09Z

Three things:

I don't think this should deduplicate the same content in different modes
Can't we have seen::Set{String}?
I'd rather reuse a Set{String} than create a new one every time (I'd like to also cycle through nine different vectors for the search, but that's trickier)

- use a consistent type for `seen` and pass in if we should deduplicate - reuse a set accross searches

KristofferC · 2025-11-07T14:54:21Z

updated based on that

tecosaur · 2025-11-07T14:54:31Z

One further comment: When !isfiltering we could just skip calling filterchunkrev! at all (the final history is hist) and set filter_idx to 0, allowing us to change seen from a Union{Set{...}, Nothing} to just be a Set, and drop the isnothing logic within filterchunkrev!.

KristofferC · 2025-11-07T14:56:38Z

allowing us to change seen from a Union{Set{...}, Nothing} to just be a Set, and drop the isnothing logic within filterchunkrev!.

I did do that change already (but not in the way you described).

tecosaur · 2025-11-07T14:57:35Z

I have a feeling my comments came (i.e. were written) mid force-pushed changes, I'll have a look at the latest.

tecosaur · 2025-11-07T15:01:31Z

Hmm, I know filterchunkrev! is a little bit of a lost cause in terms of the number of arguments (it's never going to be a nice 2/3 argument function), but I do think it would be nice to minimise the number of arguments it takes, to the extent possible.

To this end, I'd advocate for a signature dropping deduplicate::Bool like

filterchunkrev!(state::SelectorState, candidates::DenseVector{HistEntry}, seen::Set{Tuple{Symbol, String}}, idx::Int = length(candidates);
                maxtime::Float64 = Inf, maxresults::Int = length(candidates))

and skipping filterchunkrev! when no filtering is needed.

It would also allow us to drop the isfiltering state from persisting outside while loop runs if we re-use the implication of filter_idx == 0 that no filtering is needed.

Unless there's something I've missed?

Edit: I'm thinking something like this:

cands_current = hist
if isempty(filter_spec.exacts) && isempty(filter_spec.negatives) &&
    isempty(filter_spec.regexps) && isempty(filter_spec.modes)
    # No filtering needed, show all history
    filter_idx = 0
    append!(state.candidates, cands_current)
else
    # Find the most strict candidate list available
    for (cond, cands) in Iterators.reverse(cands_cache)
        if ismorestrict(cands_cond, cond)
            cands_current = cands
            break
        end
    end
    # Start filtering candidates
    empty!(filter_seen)
    filter_idx = filterchunkrev!(
        state, cands_current, seen;
        maxtime = time() + 0.01,
        maxresults = outsize[1])
end

KristofferC · 2025-11-07T15:16:17Z

👍

tecosaur

I'm pretty happy with how this all looks now, thanks for putting up with my nitpicks!

KristofferC · 2025-11-07T15:18:00Z

Thanks for review!

ghyatzo · 2025-11-07T17:15:54Z

Cross posting from Zulip:
Thinking about this, what about having only consecutive items being filtered? So sequences like: A B C A B D A E C still show the call progression with its variations instead of just A B C D E which loses a lot of information regarding the context of D and E. While it would filter correctly sequences like A B A B A B if I filter only for A or B.

Basically Run length encoding the (currently shown, filtered or not) history. Independent in behavior if I am filtering or not. Same behavior, but auto off normally and auto on when filtering (which can then be toggled and configured (how many entries to show before collapsing)).

It would only need a last_entry and a counter for how many consecutive times you've seen it.

I know it would be a rather big change and Maybe It's too late and this train is gone, but in any case thanks for this work on the REPL 😊

KristofferC · 2025-11-07T17:37:09Z

I will merge this because I think it is an improvement over status quo but it could be modified in the future. Regarding what you wrote,

Think about the following history (each line is an entry):

ENV["JULIA_PRECOMPILE_AUTO"] = 1
a bunch of lines here
bla bla
print("this is a string that happens to have the word `env` in it")
some other lines
bla bla
ENV["JULIA_PRECOMPILE_AUTO"] = 1

and

ENV["JULIA_PRECOMPILE_AUTO"] = 1
a bunch of lines here
bla bla
print("this is a string that happens to not have it in it")
some other lines
bla bla
ENV["JULIA_PRECOMPILE_AUTO"] = 1

Now you filter on env. Why should the print line in the middle decide if ENV["JULIA_PRECOMPILE_AUTO"] = 1 is printed twice in the output? It has no relation to the other ENV entries except they both happen to be shown by a fuzzy search for "env".

Co-authored-by: KristofferC <[email protected]> (cherry picked from commit 6058082)

only show unique entries in history search

8f01412

KristofferC requested a review from tecosaur November 6, 2025 20:37

KristofferC added REPL Julia's REPL (Read Eval Print Loop) backport 1.13 labels Nov 6, 2025

fix not looking at all conten in filterchunkrev! and also not remov…

62a4e31

…e duplicates when not filtering

KristofferC changed the title ~~only show unique entries in history search~~ only show unique entries in history search when filtering Nov 7, 2025

KristofferC force-pushed the kc/unique_history branch from e87a137 to 9a89549 Compare November 7, 2025 14:51

- do not filter out same string from different modes

0d6e7eb

- use a consistent type for `seen` and pass in if we should deduplicate - reuse a set accross searches

KristofferC force-pushed the kc/unique_history branch from 9a89549 to 0d6e7eb Compare November 7, 2025 14:53

make seen obligatory and remove the deduplicate kwarg

aff738e

KristofferC force-pushed the kc/unique_history branch from 17f256c to aff738e Compare November 7, 2025 15:16

tecosaur approved these changes Nov 7, 2025

View reviewed changes

KristofferC merged commit 6058082 into master Nov 7, 2025
5 of 7 checks passed

KristofferC deleted the kc/unique_history branch November 7, 2025 17:37

KristofferC added a commit that referenced this pull request Nov 10, 2025

only show unique entries in history search when filtering (#60066)

ab44b75

Co-authored-by: KristofferC <[email protected]> (cherry picked from commit 6058082)

KristofferC added a commit that referenced this pull request Nov 11, 2025

only show unique entries in history search when filtering (#60066)

02ca408

Co-authored-by: KristofferC <[email protected]> (cherry picked from commit 6058082)

Uh oh!

only show unique entries in history search when filtering #60066

only show unique entries in history search when filtering #60066

Uh oh!

Conversation

KristofferC commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Keno commented Nov 6, 2025

Uh oh!

tecosaur commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

tecosaur commented Nov 7, 2025

Uh oh!

Keno commented Nov 7, 2025

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

tecosaur commented Nov 7, 2025

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

tecosaur commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

tecosaur commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tecosaur commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

tecosaur left a comment

Choose a reason for hiding this comment

Uh oh!

KristofferC commented Nov 7, 2025

Uh oh!

ghyatzo commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KristofferC commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KristofferC commented Nov 6, 2025 •

edited

Loading

tecosaur commented Nov 7, 2025 •

edited

Loading

tecosaur commented Nov 7, 2025 •

edited

Loading

tecosaur commented Nov 7, 2025 •

edited

Loading

tecosaur commented Nov 7, 2025 •

edited

Loading

ghyatzo commented Nov 7, 2025 •

edited

Loading

KristofferC commented Nov 7, 2025 •

edited

Loading