Add Flex/Bison parser #181

makaimann · 2021-02-27T04:24:21Z

This PR adds an SmtLibReader class with virtual methods for common SMT-LIB commands. This way, a user can inherit the class and perform their own actions on standard commands. By default, it just executes the commands on the provided solver.

Currently supports:

Bool/Int/Real/BV/Array/UF
quantifiers
let bindings
declare-fun/declare-const/define-fun
set-logic/set-option/set-info
check-sat/check-sat-assuming/push/pop/exit

The infrastructure is based on the calc++ example.

I'm new to Flex/Bison so please let me know if there are any inefficiencies/broken conventions that you notice. Of course, trying to balance making it nice vs. getting it done soon so it's not going to be perfect. One particularly ugly part right now is here:
https://github.com/makaimann/smt-switch/blob/9b7fe79103648da4d5e7fc4ea0bf973abfd0e4a7/src/smtlibscanner.l#L72

Basically, there can be multiline string literals in set-info, or in a pipe-quoted symbol, e.g.:

(set-logic QF_LIA)
(declare-const |a
                multi-line
                symbol|
                Int)

Currently, the lexer combines that into one token. However, this can mess up error reporting since the line number is off. So that linked code just increments the line number accordingly. I did briefly try splitting them into one token per line and recombining them in the bison parser but ran into some issues. I could probably get it working if you think that's better. Let me know if you think that's a lot better than the current solution.

makaimann · 2021-03-01T19:10:31Z

Here are the initial parser results. Note: incremental benchmarks have more solved than the total since each sat/unsat counts as a solve. I have no idea why smt-switch solves so many fewer QF_AUFLIA benchmarks. It could be a parser issue, but since all the others seem to be fine, I'm guessing it's actually a difference in the default options. I'll look into that more today.

The most important thing is that the status is "ok", meaning there were no disagreements.

makaimann · 2021-03-01T19:28:59Z

Btw, the tests above were ran with an older version. I'll run tests again soon with the latest. Looking into cvc4's default options first.

makaimann · 2021-03-02T21:34:03Z

All right, I found the slowdown. The crash3.smt2 benchmark in the safari set was very large and made frequent use of define-funs. My define-fun handling uses naive substitution, and the substitution operation was slow for CVC4. Fixed with: #184. Here are the latest results where smt-switch wins in most categories:

barrettcw · 2021-03-03T00:55:59Z

Awesome!

ahmed-irfan

Looks great!

I guess the get commands (get-unsat-core, get-value, get-model etc) are left for future work. RIght?

src/smtlib_reader.cpp

src/smtlibscanner.l

src/smtlibparser.yy

ahmed-irfan · 2021-03-03T01:08:31Z

All right, I found the slowdown. The crash3.smt2 benchmark in the safari set was very large and made frequent use of define-funs. My define-fun handling uses naive substitution, and the substitution operation was slow for CVC4. Fixed with: #184. Here are the latest results where smt-switch wins in most categories:

That is indeed great!

Interestingly, smt-switch-cvc4 solves way more in the QF_AUFLIA

makaimann · 2021-03-03T01:09:19Z

Thanks for the review!

I guess the get commands (get-unsat-core, get-value, get-model etc) are left for future work. Right?

Yeah that's right -- I haven't thought about those much yet. get-value would be easy if we need that. I think get-unsat-core and get-model would take a bit more thought.

makaimann · 2021-03-03T01:12:48Z

Interestingly, smt-switch-cvc4 solves way more in the QF_AUFLIA

Right, so that's actually entirely from the single crash3.smt2 benchmark. It has a huge number of check-sat calls (1,109,912 total) and they're all very quick (at least all the ones that we get to in the time limit). So I think there's a noticeable difference there just because flex/bison is faster than ANTLR and there's very little time in the solver.

amaleewilson

Looks good to me

ahmed-irfan

LGTM.

Suggestion: please open issues for the todos, like supporting getter functions in the parser.

makaimann · 2021-03-05T17:56:44Z

Thanks for the reviews! Made an issue for the getters: #190.

makaimann added 30 commits February 8, 2021 22:03

Working on flex/bison files

a10b107

First pass on driver

69dafc0

First pass on smtlibparser.yy

6e98a88

remove unnecessary file

31a474d

First pass on smtlibscanner from calc++ example

c081602

Update Makefile

28bee74

Minor

6f56076

Add test

c49f1ce

Various fixes

3678d7b

Fix YY_NULL issue

5bcaffa

Pass text through for symbol

ed0ac2f

Create specific case for set-logic

28fb1d3

Fix top-level of grammar and add assert

8c56131

Remove old files

8ec48e3

Rename: smtlibscanner.ll -> smtlibscanner.l because uses C

813867a

Add Bool, Int, and Real sorts, and declare-const

317484c

use getters and try text in sorts

8e254c0

Add a solver to the driver

fbcdcea

Basic sorts working

c48df31

declare-const working

756093f

Put SmtLibDriver in smt namespace

029d98a

Add lookup_primop feature

abd35c1

Some clean up and add indprefix

79d303b

Some progress on creating terms

448cb76

Support assert and check-sat

c4ff8a5

Clean up and add some arithmetic operators

c51aa42

Add boolean operators

068285a

Add declare-fun

857fdb8

Add push/pop

baca763

Add true/false

1e9c12c

Increment location according to newlines in string literals

ec97c95

makaimann added 2 commits March 1, 2021 11:12

Update comment

6ed0727

Remove addressed issue

9b7fe79

makaimann marked this pull request as ready for review March 1, 2021 19:28

makaimann requested review from ahmed-irfan and amaleewilson March 1, 2021 19:29

makaimann assigned amaleewilson, makaimann and ahmed-irfan Mar 1, 2021

makaimann added the enhancement New feature or request label Mar 1, 2021

Merge branch 'master' into parser

8574661

ahmed-irfan reviewed Mar 3, 2021

View reviewed changes

makaimann added 5 commits March 2, 2021 17:18

Return definition without substituting if there are no arguments

e7e7531

apply_define_fun requires arguments

8a8f7d5

Fix hex regex

ee87e56

format

6e2020d

Add newline at end of file

5113195

amaleewilson approved these changes Mar 4, 2021

View reviewed changes

ahmed-irfan approved these changes Mar 4, 2021

View reviewed changes

Remove old TODO comments

82a8d63

makaimann merged commit 54760e1 into master Mar 5, 2021

makaimann deleted the parser branch March 5, 2021 18:01

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Flex/Bison parser #181

Add Flex/Bison parser #181

makaimann commented Feb 27, 2021 •

edited

Loading

makaimann commented Mar 1, 2021 •

edited

Loading

makaimann commented Mar 1, 2021

makaimann commented Mar 2, 2021

barrettcw commented Mar 3, 2021

ahmed-irfan left a comment

ahmed-irfan commented Mar 3, 2021

makaimann commented Mar 3, 2021

makaimann commented Mar 3, 2021 •

edited

Loading

amaleewilson left a comment

ahmed-irfan left a comment

makaimann commented Mar 5, 2021

Add Flex/Bison parser #181

Add Flex/Bison parser #181

Conversation

makaimann commented Feb 27, 2021 • edited Loading

makaimann commented Mar 1, 2021 • edited Loading

makaimann commented Mar 1, 2021

makaimann commented Mar 2, 2021

barrettcw commented Mar 3, 2021

ahmed-irfan left a comment

Choose a reason for hiding this comment

ahmed-irfan commented Mar 3, 2021

makaimann commented Mar 3, 2021

makaimann commented Mar 3, 2021 • edited Loading

amaleewilson left a comment

Choose a reason for hiding this comment

ahmed-irfan left a comment

Choose a reason for hiding this comment

makaimann commented Mar 5, 2021

makaimann commented Feb 27, 2021 •

edited

Loading

makaimann commented Mar 1, 2021 •

edited

Loading

makaimann commented Mar 3, 2021 •

edited

Loading