Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Adding SRL #215

Open
wants to merge 127 commits into
base: master
Choose a base branch
from
Open

WIP: Adding SRL #215

wants to merge 127 commits into from

Conversation

danyaljj
Copy link
Member

@danyaljj danyaljj commented Sep 20, 2016

Pending issues to discuss:

christos-c and others added 30 commits December 11, 2014 18:24
Simplified the predicate candidate generation for Nom
Added dummy sentence creation for Nom during SemanticRoleLabeler init
Removed pre-processing during training (inside PreExtractor)
Changed system output to logger
Cleaned up pom.xml
Removed pom from .gitignore
Resolved a clash in lexicon naming by adding the parser view (also removed the "lexicon." prefix)
Replaced console output with logger calls
Updated compileModelsToJar to accommodate both Stanford and Charniak
Removed unnecessary dependencies
Switched to illinois-nlp-pipeline-0.1.2
Minor fixes
Fixed model deployment
fixed a bug that prevented the TextPreProcessor from being initialized outside SRL.
-Only works with the Lagrange inference solver for now
handle exception from getParsePhrase
Mark Sammons and others added 12 commits August 14, 2016 02:11
…SRL (otherwise exception is thrown if not present, and it usually isn't...)

Added a memory test to gain some insight into whether memory is freed properly (symptoms were in pipeline); behavior seems reasonable.
updated to use updated Annotator API, which accommodates lazy initialization

had to change constructor signatures, push some constructor params into ResourceManager argument
updated tests to conform to new multi-sentence DummyTextAnnotation
TODO: verify that nom is behaving correctly (only finds predicates in first sentence)
@christod  there is no rush on this -- I've deployed a snapshot for use with a snapshot of pipeline for my own purposes. 

See merge request !13
updated version number

finalizing coordinated updates with illinois-nlp-pipeline

See merge request !14
Copy link
Member

@christos-c christos-c left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would suggest removing the curator-release components.

<groupId>edu.illinois.cs.cogcomp</groupId>
<artifactId>illinois-srl-models</artifactId>
<classifier>verb-stanford</classifier>
<version>5.1</version>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Make sure to change the models version (after retraining)

<dependency>
<groupId>edu.illinois.cs.cogcomp</groupId>
<artifactId>inference</artifactId>
<version>0.5.0</version>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this the latest inference package?

</dependency>
</dependencies>

<build>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section is not needed since it's inherited from the parent

<dependency>
<groupId>edu.illinois.cs.cogcomp</groupId>
<artifactId>illinois-core-utilities</artifactId>
<version>${cogcomp-nlp-version}</version>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These cogcomp-nlp-version variables should be replaced with the current super-project version.

<dependency>
<groupId>edu.illinois.cs.cogcomp</groupId>
<artifactId>illinois-common-resources</artifactId>
<classifier>illinoisSRL</classifier>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note the use of the classifier in case this needs to be addressed (see #43)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

huh? I am not clean on the classifier tag usage here ...

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was using the classifier property to split the common-resources per project. Each project would use its own subunit of the common resources module.

Copy link
Member Author

@danyaljj danyaljj Sep 29, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

private final static Logger log = LoggerFactory.getLogger(LegalArguments.class);

private final Map<String, Set<String>> legalArgs = new HashMap<>();
private final Map<String, Set<String>> legalArgsForSense = new HashMap<>();
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indentation error: are we using the formatter plugin?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

@@ -0,0 +1,93 @@
/**
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This exists in edison.

@@ -0,0 +1,134 @@
/**
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be moved to core-utilities or inference or illinois-sl.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

core-utils doesn't have sl as dependency (and vice versa). Maybe we keep it here for the time being?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

illinois-srl has a dependency to sl and inference (and core-utilities of course). So moving this file to one of those package would be the best solution.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think my point is different: if you look at the file, it contains functions from both core-utils and SL. Moving this file to one of those projects require one of them to be dependency of the other.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oh I see... but the core-utilities dependencies are limited to just IOUtils and illinois-sl already has a solution for these methods (see WeightVector). So maybe just a quick fix and this file will be ready to be moved to sl

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That sounds doable.

@@ -0,0 +1,83 @@
(define features
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be removed.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

@@ -0,0 +1,75 @@
(define features
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be removed.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

<dependency>
<groupId>edu.illinois.cs.cogcomp</groupId>
<artifactId>illinois-common-resources</artifactId>
<version>1.5</version>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do we need both the general illinois-common-resources, as well as the srl and the ner ones? The appropriate solution would be to pack everything we need into the srl common-resources.

Removed `SentenceDBHandler` and replaced it with the `TextAnnotationCache` interface
Added a `close()` method to the interface to avoid problems with MapDB (and any future implementers of the interface)
@christos-c
Copy link
Member

@danyaljj I've updated your branch from upstream and updated the cache mechanism.
NB: I have NOT tested this on my end since I don't have access to the Propbank/Treebank files; if it doesn't work just let me know and I can try to debug it on my Illinois desktop.

@danyaljj
Copy link
Member Author

Yasss! Awesome will test it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants