Automatic language detection #14

markusressel · 2019-01-22T19:14:14Z

Is your feature request related to a problem? Please describe.
Currently the dev has to know what syntax highlighter to use for a given text.

Describe the solution you'd like
The KodeEditor (or a layer in between) should be able to detect what language is most likely used and apply syntax highlighting automatically. This behaviour should be optional so that the dev can still force a specific language if desired.

markusressel · 2019-01-22T21:12:06Z

Using something like this would be an option, although the trained models are pretty big (approx. 150 MB):
https://github.com/aliostad/deep-learning-lang-detection

Integrating this seems to be relatively easy:
https://medium.com/capital-one-tech/using-a-pre-trained-tensorflow-model-on-android-part-2-153ebdd4c465

GitHub
aliostad/deep-learning-lang-detection
Deep Learning using Keras to detect programming language of a file or snippet - aliostad/deep-learning-lang-detection

Medium
Using a Pre-Trained TensorFlow Model on Android — Part 2
In Part 1, I introduced you to the TensorFlowInferenceInterface and the org.tensorflow:tensorflow-android dependency. Together they provide an easy way to embed pre-trained TensorFlow models in your…

markusressel · 2020-08-04T22:30:52Z

A more naive approach could be to simply count the number of role matches for all available rule books and use the one with the highest count.

markusressel · 2020-08-04T22:35:29Z

It would also be nice to inlude common file extensions in the rule book, to detect the language simply based on the file name.

Both detection variants should be usable independently.

markusressel · 2021-04-14T02:07:18Z

Also interesting:
https://github.com/dlaststark/machine-learning-projects/tree/master/Programming%20Language%20Detection

https://medium.com/swlh/detecting-programming-languages-from-code-snippets-d758589bddb0

markusressel self-assigned this Jan 22, 2019

markusressel transferred this issue from markusressel/KodeEditor Jun 27, 2020

markusressel added the enhancement New feature or request label Apr 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic language detection #14

Automatic language detection #14

markusressel commented Jan 22, 2019

markusressel commented Jan 22, 2019 •

edited by unfurl-links bot

Loading

markusressel commented Aug 4, 2020

markusressel commented Aug 4, 2020

markusressel commented Apr 14, 2021 •

edited

Loading

Automatic language detection #14

Automatic language detection #14

Comments

markusressel commented Jan 22, 2019

markusressel commented Jan 22, 2019 • edited by unfurl-links bot Loading

markusressel commented Aug 4, 2020

markusressel commented Aug 4, 2020

markusressel commented Apr 14, 2021 • edited Loading

markusressel commented Jan 22, 2019 •

edited by unfurl-links bot

Loading

markusressel commented Apr 14, 2021 •

edited

Loading