Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Paper: RoughPy #904

Open
wants to merge 32 commits into
base: 2024
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 11 commits
Commits
Show all changes
32 commits
Select commit Hold shift + click to select a range
3c40519
first draft
inakleinbottle May 15, 2024
86330b7
fixed a few problems
inakleinbottle May 19, 2024
4750c64
fix a couple more problems
inakleinbottle May 19, 2024
f7c41d6
change the modeline
inakleinbottle May 19, 2024
fde193b
final few corrections
inakleinbottle May 19, 2024
135ded1
change to note, avoid duplication
inakleinbottle May 19, 2024
af7c7be
Add more keywords and minor fixes
inakleinbottle May 20, 2024
e0b0d0d
Update roles on submission
inakleinbottle May 21, 2024
d55ac64
Added TL as an author
inakleinbottle May 22, 2024
6e5a857
update acknowledgements
inakleinbottle May 22, 2024
f2cbdf4
add dois for those that have them, ignore errors on the rest
inakleinbottle May 22, 2024
d33e15f
Fixed typo and added comment about installing roughpy.
inakleinbottle Jul 25, 2024
2b6beee
fixed notation for shuffle tensor and free tensor
inakleinbottle Jul 25, 2024
48b7673
fixed naming of the lie algebra
inakleinbottle Jul 25, 2024
ad3d3db
fixed typo
inakleinbottle Aug 1, 2024
bfde263
added missing period at end of paragraph
inakleinbottle Aug 1, 2024
df1f072
fixed typo in 'one simple'
inakleinbottle Aug 1, 2024
d6e41e9
fixed linebreak before period
inakleinbottle Aug 1, 2024
e622a79
clarify the use of floats and defaults
inakleinbottle Aug 1, 2024
4dcfc11
change the wording to remove 'in the sequel'
inakleinbottle Aug 1, 2024
4216b57
changed the wording from 'speed' to 'rate'
inakleinbottle Aug 1, 2024
a7fffdf
remove semicolon
inakleinbottle Aug 1, 2024
22c67e0
added paragraph about applications on datasig
inakleinbottle Aug 1, 2024
65ad5e3
Fixed case of "of"
inakleinbottle Aug 3, 2024
47ecd34
fix reference to later section
inakleinbottle Aug 3, 2024
41b5d0a
remove surplus comma
inakleinbottle Aug 3, 2024
97fc6ce
Fix a few issues in mathematical introduction
inakleinbottle Aug 3, 2024
ddfac4a
Fixed issues in section 3
inakleinbottle Aug 3, 2024
20d0d2b
fix wording and reformat pars
inakleinbottle Aug 3, 2024
f44cf59
fix more wording
inakleinbottle Aug 3, 2024
7dd6607
fix remaining issues
inakleinbottle Aug 3, 2024
e9f57ca
Remove export format
inakleinbottle Aug 3, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added papers/sam_morley/banner.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
74 changes: 74 additions & 0 deletions papers/sam_morley/myst.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,74 @@
# See docs at: https://mystmd.org/guide/frontmatter
version: 1
project:
id: scipy-2024-sam_morley
title: RoughPy
subtitle: Streaming data is rarely smooth
short_title: RoughPy

authors:
- name: Sam Morley
email: [email protected]
orcid: 0000-0001-5971-7418
affiliations:
- University Of Oxford
roles:
- Conceptualization
- Writing - original draft
- Software
corresponding: true
- name: Terry Lyons
email: [email protected]
orcid: 0000-0002-9972-2809
affiliations:
- University of Oxford
roles:
- Conceptualization
- Supervision
- Writing - review & editing

# description:
keywords:
- sequential data
- unparametrised paths
- time series
- rough paths
- signatures
- data science
- machine learning
- signature kernels
- Log-ODE method
github: datasig-ac-uk/roughpy
bibliography:
- roughpy.bib
# Add the abbreviations that you use in your paper here
abbreviations:
RFI: Radio frequency interference
NLP: Natural Language Processing
ODE: Ordinary Differential Equation
CDE: Controlled differential equation
PDE: Partial differential equation

# It is possible to explicitly ignore the `doi-exists` check for certain citation keys
error_rules:
- rule: doi-exists
severity: ignore
keys:
- Lyons1998
- esig
- dlpack
- pybind11
- coropa_project
- Granlund12
- NEURIPS2021_18a9042b

# A banner will be generated for you on publication, this is a placeholder
banner: banner.png
# The rest of the information shouldn't be modified
subject: Research Article
open_access: true
license: CC-BY-4.0
venue: Scipy 2024
date: 2024-07-10
site:
template: article-theme
140 changes: 140 additions & 0 deletions papers/sam_morley/roughpy.bib
Original file line number Diff line number Diff line change
@@ -0,0 +1,140 @@
@inproceedings{tseriotou_etal_2024_sig,
title = "Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling",
author = "Tseriotou, Talia and
Chan, Ryan and
Tsakalidis, Adam and
Bilal, Iman Munire and
Kochkina, Elena and
Lyons, Terry and
Liakata, Maria",
editor = "Aletras, Nikolaos and
De Clercq, Orphee",
booktitle = "Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations",
month = mar,
year = "2024",
address = "St. Julians, Malta",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.eacl-demo.24",
pages = "223--237",
abstract = "We present an open-source, pip installable toolkit, Sig-Networks, the first of its kind for longitudinal language modelling. A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks. We apply and extend published research providing a full suite of signature-based models. Their components can be used as PyTorch building blocks in future architectures. Sig-Networks enables task-agnostic dataset plug-in, seamless preprocessing for sequential data, parameter flexibility, automated tuning across a range of models. We examine signature networks under three different NLP tasks of varying temporal granularity: counselling conversations, rumour stance switch and mood changes in social media threads, showing SOTA performance in all three, and provide guidance for future tasks. We release the Toolkit as a PyTorch package with an introductory video, Git repositories for preprocessing and modelling including sample notebooks on the modeled NLP tasks.",
doi = "10.48550/arXiv.2312.03523",
}

@inproceedings{NEURIPS2020_4a5876b4,
author = {Kidger, Patrick and Morrill, James and Foster, James and Lyons, Terry},
booktitle = {Advances in Neural Information Processing Systems},
editor = {H. Larochelle and M. Ranzato and R. Hadsell and M.F. Balcan and H. Lin},
pages = {6696--6707},
publisher = {Curran Associates, Inc.},
title = {Neural Controlled Differential Equations for Irregular Time Series},
url = {https://proceedings.neurips.cc/paper_files/paper/2020/file/4a5876b450b45371f6cfe5047ac8cd45-Paper.pdf},
volume = {33},
year = {2020},
doi = "10.48550/arXiv.1906.08215",
}

@article{Lyons1998,
abstract = {This paper aims to provide a systematic approach to the treatment of differential equations of the typedyt = Σi fi(yt) dxti where the driving signal xt is a rough path. Such equations are very common and occur particularly frequently in probability where the driving signal might be a vector valued Brownian motion, semi-martingale or similar process.However, our approach is deterministic, is totally independent of probability and permits much rougher paths than the Brownian paths usually discussed. The results here are strong enough to treat the main probabilistic examples and significantly widen the class of stochastic processes which can be used to drive stochastic differential equations. (For a simple example see [10], [1]).We hope our results will have an influence on infinite dimensional analysis on path spaces, loop groups, etc. as well as in more applied situations. Variable step size algorithms for the numerical integration of stochastic differential equations [8] have been constructed as a consequence of these results.},
author = {Lyons, Terry J.},
journal = {Revista Matemática Iberoamericana},
keywords = {Ecuaciones diferenciales estocásticas; Proceso de difusión; Movimiento browniano; stochastic differential equations; rough paths; Brown motion; Gauss and Markov processes; Lie algebras},
language = {eng},
number = {2},
pages = {215-310},
title = {Differential equations driven by rough signals.},
url = {http://eudml.org/doc/39555},
volume = {14},
year = {1998},
}

@inproceedings{liao2021a,
edition = {},
number = {},
journal = {},
pages = {},
publisher = {British Machine Vision Association},
school = {},
title = {Logsig-RNN: a novel network for robust and efficient skeleton-based action recognition},
volume = {},
author = {Liao, S and Lyons, TJ and Yang, W and Schlegel, K and Ni, H},
editor = {},
year = {2021},
organizer = {32nd British Machine Vision Conference (BMVC 2021)},
series = {},
doi = "10.48550/arXiv.2110.13008",
}

@inproceedings{NEURIPS2021_18a9042b,
author = {Fermanian, Adeline and Marion, Pierre and Vert, Jean-Philippe and Biau, G\'{e}rard},
booktitle = {Advances in Neural Information Processing Systems},
editor = {M. Ranzato and A. Beygelzimer and Y. Dauphin and P.S. Liang and J. Wortman Vaughan},
pages = {3121--3134},
publisher = {Curran Associates, Inc.},
title = {Framing RNN as a kernel method: A neural ODE approach},
url = {https://proceedings.neurips.cc/paper_files/paper/2021/file/18a9042b3fc5b02fe3d57fea87d6992f-Paper.pdf},
volume = {34},
year = {2021},
}

@article{JMLR_v20_16_314,
author = {Franz J. Kiraly and Harald Oberhauser},
title = {Kernels for Sequentially Ordered Data},
journal = {Journal of Machine Learning Research},
year = {2019},
volume = {20},
number = {31},
pages = {1--45},
url = {http://jmlr.org/papers/v20/16-314.html},
doi = "10.48550/arXiv.2102.03657",
}

@misc{pybind11,
author = {Wenzel Jakob and Jason Rhinelander and Dean Moldovan},
year = {2017},
note = {https://github.com/pybind/pybind11},
title = {pybind11 -- Seamless operability between C++11 and Python}
}

@misc{esig,
author = {Terry Lyons and David Maxwell},
year = 2017,
note = {https://github.com/datasig-ac-uk/esig},
title = {esig}
}

@misc{coropa_project,
author = {Stephen Buckley and Djalil Chafai and Greg Gyurk\`{o} and Arend
Janssen and Christophe Ladroue and Christian Litterer and Terry Lyons
and ChangLiang Xu},
year = 2006,
note = {https://coropa.sourceforge.io},
title = {CoRoPa -- Computational Rough Paths project}
}

@inproceedings{NEURIPS2019_d2cdf047,
author = {Kidger, Patrick and Bonnier, Patric and Perez Arribas, Imanol and Salvi, Cristopher and Lyons, Terry},
booktitle = {Advances in Neural Information Processing Systems},
editor = {H. Wallach and H. Larochelle and A. Beygelzimer and F. d\textquotesingle Alch\'{e}-Buc and E. Fox and R. Garnett},
pages = {},
publisher = {Curran Associates, Inc.},
title = {Deep Signature Transforms},
url = {https://proceedings.neurips.cc/paper_files/paper/2019/file/d2cdf047a6674cef251d56544a3cf029-Paper.pdf},
volume = {32},
year = {2019},
doi = "10.48550/arXiv.1905.08494"
}

@misc{dlpack,
author = {DLPack},
year = 2023,
title = {Open In Memory Tensor structure},
note = {https://dmlc.github.io/dlpack/latest/}
}

@Manual{Granlund12,
title = "{GNU MP}: {T}he {GNU} {M}ultiple {P}recision {A}rithmetic {L}ibrary",
author = "Torbjörn Granlund and {the GMP development team}",
edition = "5.0.5",
year = 2012,
note = "http://gmplib.org/"
}
Loading