On Fri, Nov 20, 2015 at 11:52:16AM +0100, Otto Moerbeek wrote:
> On Thu, Nov 19, 2015 at 05:52:39PM -0500, Michael McConville wrote:
>
> > I'm already cache-thrashing with all of my side projects, so if anyone's
> > interested I'll leave this to them.
> >
> > A few days ago, I wanted to try American Fuzzy Lop (afl), and bc(1)
> > seemed like a good first target: it pretty much just goes from stdin to
> > stdout, so there's no code reorganization needed.
> >
> > For those not familiar, bc compiles its input to dc(1)'s syntax and
> > forks to dc.
> >
> > There are many unique crash paths - 1041 before I killed afl. Most
> > center around emit(), which emits a dc instr. Many pass NULL to fputs()
> > in emit(). I found at least one (crashes/id:001041*) that
> > nondeterministically passes the str pointer 0xdfdfdfdfdfdfdfdf to
> > fputs(), which is probably uninitialized or already-freed memory.
> > Backtrace below.
> >
> > malloc.conf(5) may be useful.
> >
> > Here's the full afl directory:
> >
> > http://www.sccs.swarthmore.edu/users/16/mmcconv1/bc-afl/
> >
> >
> > Core was generated by `bc'.
> > Program terminated with signal SIGBUS, Bus error.
> > #0 strlen () at /usr/src/lib/libc/arch/amd64/string/strlen.S:152
> > 152 movq (%rax),%rdx /* first data in high bytes
> > */
> > (gdb) bt
> > #0 strlen () at /usr/src/lib/libc/arch/amd64/string/strlen.S:152
> > #1 0x000019f79fa7c43d in *_libc_fputs (s=0xdfdfdfdfdfdfdfdf <error: Cannot
> > access memory at address 0xdfdfdfdfdfdfdfdf>, fp=0x1) at
> > /usr/src/lib/libc/stdio/fputs.c:50
> > #2 0x000019f4ecb0f401 in emit (i=28548786530304) at
> > /usr/src/usr.bin/bc/bc.y:810
> > #3 yyparse () at /usr/src/usr.bin/bc/bc.y:178
> > #4 0x000019f4ecb13f3e in main (argc=1, argv=0x7f7fffffa570) at
> > /usr/src/usr.bin/bc/bc.y:1188
>
> This fixes at least one case (id-000141*) and make the printing of
> non-ascci chars better
>
> -Otto
>
New version, which solves all cases found in crashes, hangs and queue
above. The remaining cases were emit going into an infinite recursion
becuse the tree wasn't a tree but a cyclic graph.
Regress still succeeds.
-Otto
Index: bc.y
===================================================================
RCS file: /cvs/src/usr.bin/bc/bc.y,v
retrieving revision 1.48
diff -u -p -r1.48 bc.y
--- bc.y 10 Oct 2015 19:28:54 -0000 1.48
+++ bc.y 20 Nov 2015 13:19:07 -0000
@@ -72,7 +72,7 @@ static void grow(void);
static ssize_t cs(const char *);
static ssize_t as(const char *);
static ssize_t node(ssize_t, ...);
-static void emit(ssize_t);
+static void emit(ssize_t, int);
static void emit_macro(int, ssize_t);
static void free_tree(void);
static ssize_t numnode(int);
@@ -175,7 +175,7 @@ program : /* empty */
input_item : semicolon_list NEWLINE
{
- emit($1);
+ emit($1, 0);
macro_char = reset_macro_char;
putchar('\n');
free_tree();
@@ -803,12 +803,17 @@ node(ssize_t arg, ...)
}
static void
-emit(ssize_t i)
+emit(ssize_t i, int level)
{
- if (instructions[i].index >= 0)
- while (instructions[i].index != END_NODE)
- emit(instructions[i++].index);
- else
+ if (level > 1000)
+ errx(1, "internal error: tree level > 1000");
+ if (instructions[i].index >= 0) {
+ while (instructions[i].index != END_NODE &&
+ instructions[i].index != i) {
+ emit(instructions[i].index, level + 1);
+ i++;
+ }
+ } else if (instructions[i].index != END_NODE)
fputs(instructions[i].u.cstr, stdout);
}
@@ -816,7 +821,7 @@ static void
emit_macro(int node, ssize_t code)
{
putchar('[');
- emit(code);
+ emit(code, 0);
printf("]s%s\n", instructions[node].u.cstr);
nesting--;
}
@@ -951,7 +956,7 @@ yyerror(char *s)
!isprint((unsigned char)yytext[0]))
n = asprintf(&str,
"%s: %s:%d: %s: ascii char 0x%02x unexpected",
- __progname, filename, lineno, s, yytext[0]);
+ __progname, filename, lineno, s, yytext[0] & 0xff);
else
n = asprintf(&str, "%s: %s:%d: %s: %s unexpected",
__progname, filename, lineno, s, yytext);