-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Adding SRL #215
base: master
Are you sure you want to change the base?
WIP: Adding SRL #215
Conversation
Simplified the predicate candidate generation for Nom Added dummy sentence creation for Nom during SemanticRoleLabeler init Removed pre-processing during training (inside PreExtractor) Changed system output to logger Cleaned up pom.xml Removed pom from .gitignore
Resolved a clash in lexicon naming by adding the parser view (also removed the "lexicon." prefix)
Replaced console output with logger calls Updated compileModelsToJar to accommodate both Stanford and Charniak
Removed unnecessary dependencies Switched to illinois-nlp-pipeline-0.1.2 Minor fixes
Fixed model deployment
fixed a bug that prevented the TextPreProcessor from being initialized outside SRL.
-Only works with the Lagrange inference solver for now
handle exception from getParsePhrase
…SRL (otherwise exception is thrown if not present, and it usually isn't...) Added a memory test to gain some insight into whether memory is freed properly (symptoms were in pipeline); behavior seems reasonable.
updated to use updated Annotator API, which accommodates lazy initialization had to change constructor signatures, push some constructor params into ResourceManager argument updated tests to conform to new multi-sentence DummyTextAnnotation TODO: verify that nom is behaving correctly (only finds predicates in first sentence) @christod there is no rush on this -- I've deployed a snapshot for use with a snapshot of pipeline for my own purposes. See merge request !13
updated version number finalizing coordinated updates with illinois-nlp-pipeline See merge request !14
# Conflicts: # .gitignore
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would suggest removing the curator-release
components.
<groupId>edu.illinois.cs.cogcomp</groupId> | ||
<artifactId>illinois-srl-models</artifactId> | ||
<classifier>verb-stanford</classifier> | ||
<version>5.1</version> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Make sure to change the models version (after retraining)
<dependency> | ||
<groupId>edu.illinois.cs.cogcomp</groupId> | ||
<artifactId>inference</artifactId> | ||
<version>0.5.0</version> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this the latest inference package?
</dependency> | ||
</dependencies> | ||
|
||
<build> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This section is not needed since it's inherited from the parent
<dependency> | ||
<groupId>edu.illinois.cs.cogcomp</groupId> | ||
<artifactId>illinois-core-utilities</artifactId> | ||
<version>${cogcomp-nlp-version}</version> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These cogcomp-nlp-version
variables should be replaced with the current super-project version.
<dependency> | ||
<groupId>edu.illinois.cs.cogcomp</groupId> | ||
<artifactId>illinois-common-resources</artifactId> | ||
<classifier>illinoisSRL</classifier> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note the use of the classifier in case this needs to be addressed (see #43)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
huh? I am not clean on the classifier
tag usage here ...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was using the classifier
property to split the common-resources per project. Each project would use its own subunit of the common resources module.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
private final static Logger log = LoggerFactory.getLogger(LegalArguments.class); | ||
|
||
private final Map<String, Set<String>> legalArgs = new HashMap<>(); | ||
private final Map<String, Set<String>> legalArgsForSense = new HashMap<>(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Indentation error: are we using the formatter
plugin?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
@@ -0,0 +1,93 @@ | |||
/** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This exists in edison
.
@@ -0,0 +1,134 @@ | |||
/** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be moved to core-utilities
or inference
or illinois-sl
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
core-utils
doesn't have sl
as dependency (and vice versa). Maybe we keep it here for the time being?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
illinois-srl
has a dependency to sl
and inference
(and core-utilities
of course). So moving this file to one of those package would be the best solution.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think my point is different: if you look at the file, it contains functions from both core-utils and SL. Moving this file to one of those projects require one of them to be dependency of the other.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh I see... but the core-utilities
dependencies are limited to just IOUtils
and illinois-sl
already has a solution for these methods (see WeightVector). So maybe just a quick fix and this file will be ready to be moved to sl
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That sounds doable.
@@ -0,0 +1,83 @@ | |||
(define features |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
@@ -0,0 +1,75 @@ | |||
(define features |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
<dependency> | ||
<groupId>edu.illinois.cs.cogcomp</groupId> | ||
<artifactId>illinois-common-resources</artifactId> | ||
<version>1.5</version> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we need both the general illinois-common-resources
, as well as the srl
and the ner
ones? The appropriate solution would be to pack everything we need into the srl
common-resources.
Removed `SentenceDBHandler` and replaced it with the `TextAnnotationCache` interface Added a `close()` method to the interface to avoid problems with MapDB (and any future implementers of the interface)
@danyaljj I've updated your branch from upstream and updated the cache mechanism. |
Yasss! Awesome will test it. |
Pending issues to discuss:
curator-release
components.WeightVectorUtils
to SL.