Communities

Writing

Codidact Meta

The Great Outdoors

Photography & Video

Scientific Speculation

Cooking

Electrical Engineering

Judaism

Languages & Linguistics

$Mathematics$

tag:snake search within a tag

answers:0 unanswered questions

user:xxxx search by author id

score:0.5 posts with 0.5+ score

"snake oil" exact phrase

votes:4 posts with 4+ votes

created:<1w created < 1 week ago

post_type:xxxx type of post

Search help

Notifications

Mark all as read See all your notifications »

Q&A Meta

Q&A

Posts Tags Edits

Ask Question

greedy capture with sed

+1

−0

I am trying to greedily capture text with sed. For example, I have the string abbbc, and I want to capture all of the repeated b characters, so that my result is bbb. Here's an attempt at a solution:

$ sed -n 's/.*\(b\+\).*/\1/p' <<< abbbc
b

As shown in the output of the command, the capture only obtains a single b rather than my desired result bbb.

I know I could prepend and append the "not b" pattern ([^b]) to my capture, which would give me the desired result:

$ sed -n 's/.*[^b]\(b\+\)[^b].*/\1/p' <<< abbbc
bbb

However, this solution is a bit inelegant, and may become much more complicated when the match is not as simple. So I'm hoping there's another way to force the capture to be greedy.

sed regex

posted 2 days ago

CC BY-SA 4.0

Trevor‭

101 reputation 8 1 17 7

Raw

Markdown

History

is a duplicate

This question has been asked before and has already been answered. It should be marked as a duplicate.

Please enter the URL of the proposed duplicate in the details field below.

not constructive

This question cannot be answered in a way that is helpful to anyone. It's not possible to learn something from possible answers, except for the solution for the specific problem of the asker.

0 comment threads

1 answer

Score Active Age

+2

−0

Worked for Trevor‭

The following users marked this post as Works for me:

User	Comment	Date
Trevor‭	(no comment)	Jun 1, 2025 at 02:27

The b\+ part of the regex is already greedy. In sed, all repetitions are greedy. Your problem is that the initial .* is also greedy, and so that's gobbling up both the a and as many bs as it can. For this example, you can change that part to [^b]*:

$ sed -n 's/[^b]*\(b\+\).*/\1/p' <<< abbbc
bbb

For more complicated situations, sed is unlikely to cut it. grep might be a more natural fit for what you're trying to do anyway.

$ grep -o 'b\+' <<< abbbc
bbb

posted 1 day ago

CC BY-SA 4.0

r~~‭

1057 reputation 0 29 111 11

Copy Link

Raw

Markdown

History

0 comment threads

Sign up to answer this question »