multilspy
multilspy is a lsp client library in Python intended to be used to build applications around language servers.
Science Score: 46.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
✓Committers with academic emails
1 of 19 committers (5.3%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.8%) to scientific vocabulary
Keywords
Repository
multilspy is a lsp client library in Python intended to be used to build applications around language servers.
Basic Info
- Host: GitHub
- Owner: microsoft
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://www.microsoft.com/en-us/research/publication/guiding-language-models-of-code-with-global-context-using-monitors/
- Size: 262 KB
Statistics
- Stars: 420
- Watchers: 5
- Forks: 76
- Open Issues: 58
- Releases: 22
Topics
Metadata Files
README.md
Multilspy: LSP client library in Python to build applications around language servers
Introduction
This repository hosts multilspy, a library developed as part of research conducted for NeruIPS 2023 paper titled "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context" ("Guiding Language Models of Code with Global Context using Monitors" on Arxiv). The paper introduces Monitor-Guided Decoding (MGD) for code generation using Language Models, where a monitor uses static analysis to guide the decoding, ensuring that the generated code follows various correctness properties, like absence of hallucinated symbol names, valid order of method calls, etc. For further details about Monitor-Guided Decoding, please refer to the paper and GitHub repository microsoft/monitors4codegen.
multilspy is a cross-platform library designed to simplify the process of creating language server clients to query and obtain results of various static analyses from a wide variety of language servers that communicate over the Language Server Protocol. It is easily extensible to support any language that has a Language Server and we aim to continuously add support for more language servers and languages.
Language servers are tools that perform a variety of static analyses on code repositories and provide useful information such as type-directed code completion suggestions, symbol definition locations, symbol references, etc., over the Language Server Protocol (LSP). Since LSP is language-agnostic, multilspy can provide the results for static analyses of code in different languages over a common interface.
multilspy intends to ease the process of using language servers, by handling various steps in using a language server:
* Automatically handling the download of platform-specific server binaries, and setup/teardown of language servers
* Handling JSON-RPC based communication between the client and the server
* Maintaining and passing hand-tuned server and language specific configuration parameters
* Providing a simple API to the user, while executing all steps of server-specific protocol steps to execute the query/request.
Some of the analysis results that multilspy can provide are:
- Finding the definition of a function or a class (textDocument/definition)
- Finding the callers of a function or the instantiations of a class (textDocument/references)
- Providing type-based dereference completions (textDocument/completion)
- Getting information displayed when hovering over symbols, like method signature (textDocument/hover)
- Getting list/tree of all symbols defined in a given file, along with symbol type like class, method, etc. (textDocument/documentSymbol)
- Please create an issue/PR to add any other LSP request not listed above
Installation
It is ideal to create a new virtual environment with python>=3.10. To create a virtual environment using conda and activate it:
conda create -n multilspy_env python=3.10
conda activate multilspy_env
Further details and instructions on creation of Python virtual environments can be found in the official documentation. Further, we also refer users to Miniconda, as an alternative to the above steps for creation of the virtual environment.
To install multilspy using pip, execute the following command:
pip install multilspy
Supported Languages
multilspy currently supports the following languages:
| Code Language | Language Server |
| --- | --- |
| java | Eclipse JDTLS |
| python | jedi-language-server |
| rust | Rust Analyzer |
| csharp | OmniSharp / RazorSharp |
| typescript | TypeScriptLanguageServer |
| javascript | TypeScriptLanguageServer |
| go | gopls |
| dart | Dart |
| ruby | Solargraph |
| kotlin | KotlinLanguageServer |
Usage
Example usage:
python
from multilspy import SyncLanguageServer
from multilspy.multilspy_config import MultilspyConfig
from multilspy.multilspy_logger import MultilspyLogger
...
config = MultilspyConfig.from_dict({"code_language": "java"}) # Also supports "python", "rust", "csharp", "typescript", "javascript", "go", "dart", "ruby"
logger = MultilspyLogger()
lsp = SyncLanguageServer.create(config, logger, "/abs/path/to/project/root/")
with lsp.start_server():
result = lsp.request_definition(
"relative/path/to/code_file.java", # Filename of location where request is being made
163, # line number of symbol for which request is being made
4 # column number of symbol for which request is being made
)
result2 = lsp.request_completions(
...
)
result3 = lsp.request_references(
...
)
result4 = lsp.request_document_symbols(
...
)
result5 = lsp.request_hover(
...
)
...
multilspy also provides an asyncio based API which can be used in async contexts. Example usage (asyncio):
python
from multilspy import LanguageServer
...
lsp = LanguageServer.create(...)
async with lsp.start_server():
result = await lsp.request_definition(
...
)
...
The file src/multilspy/language_server.py provides the multilspy API. Several tests for multilspy present under tests/multilspy/ provide detailed usage examples for multilspy. The tests can be executed by running:
bash
pytest tests/multilspy
Use of multilspy in AI4Code Scenarios like Monitor-Guided Decoding
multilspy provides all the features that language-server-protocol provides to IDEs like VSCode. It is useful to develop toolsets that can interface with AI systems like Large Language Models (LLM).
Monitor-Guided Decoding
One such usecase is Monitor-Guided Decoding, where multilspy is used to find results of static analyses like type-directed completions, to guide the token-by-token generation of code using an LLM, ensuring that all generated identifier/method names are valid in the context of the repository, significantly boosting the compilability of generated code. MGD also demonstrates use of multilspy to create monitors that ensure all function calls in LLM generated code receive correct number of arguments, and that functions of an object are called in the right order following a protocol (like not calling "read" before "open" on a file object).
Multilspy in other usecases
- "Fix the Tests: Augmenting LLMs to Repair Test Cases with Static Collector and Neural Reranker," in 2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)
- Tutorial on obtaining python completions with multilspy
- Gathering and utilizing repository-wide context for repository-level coding agents
Frequently Asked Questions (FAQ)
asyncio related Runtime error when executing the tests for MGD
If you get the following error:
RuntimeError: Task <Task pending name='Task-2' coro=<_AsyncGeneratorContextManager.__aenter__() running at
python3.8/contextlib.py:171> cb=[_chain_future.<locals>._call_set_state() at
python3.8/asyncio/futures.py:367]> got Future <Future pending> attached to a different loop python3.8/asyncio/locks.py:309: RuntimeError
Please ensure that you create a new environment with Python >=3.10. For further details, please have a look at the StackOverflow Discussion.
Citing Multilspy
If you're using Multilspy in your research or applications, please cite using this BibTeX:
@inproceedings{NEURIPS2023_662b1774,
author = {Agrawal, Lakshya A and Kanade, Aditya and Goyal, Navin and Lahiri, Shuvendu and Rajamani, Sriram},
booktitle = {Advances in Neural Information Processing Systems},
editor = {A. Oh and T. Naumann and A. Globerson and K. Saenko and M. Hardt and S. Levine},
pages = {32270--32298},
publisher = {Curran Associates, Inc.},
title = {Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context},
url = {https://proceedings.neurips.cc/paper_files/paper/2023/file/662b1774ba8845fc1fa3d1fc0177ceeb-Paper-Conference.pdf},
volume = {36},
year = {2023}
}
Contributing
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.
Trademarks
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.
Owner
- Name: Microsoft
- Login: microsoft
- Kind: organization
- Email: opensource@microsoft.com
- Location: Redmond, WA
- Website: https://opensource.microsoft.com
- Twitter: OpenAtMicrosoft
- Repositories: 7,257
- Profile: https://github.com/microsoft
Open source projects and samples from Microsoft
GitHub Events
Total
- Create event: 42
- Release event: 21
- Issues event: 62
- Watch event: 312
- Delete event: 2
- Member event: 1
- Issue comment event: 239
- Push event: 51
- Pull request event: 118
- Pull request review comment event: 77
- Pull request review event: 106
- Fork event: 73
Last Year
- Create event: 42
- Release event: 21
- Issues event: 62
- Watch event: 312
- Delete event: 2
- Member event: 1
- Issue comment event: 239
- Push event: 51
- Pull request event: 118
- Pull request review comment event: 77
- Pull request review event: 106
- Fork event: 73
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Lakshya A Agrawal | l****l@b****u | 38 |
| Steve Brudz | s****z@d****o | 21 |
| nj.jo | j****v@g****m | 19 |
| themichaelusa | m****o@i****m | 19 |
| v4rgas | 6****s | 14 |
| Avi Avni | a****i@g****m | 9 |
| Microsoft Open Source | m****e | 5 |
| Jet Zhou | j****t@j****m | 4 |
| Developer | t****e@t****x | 3 |
| Andrej730 | a****v@g****m | 3 |
| pratham1002 | p****a@d****i | 3 |
| Nasser Mohamed | n****t@g****m | 2 |
| Aditya Kanade | a****e@g****m | 1 |
| Huang YunKun | h****n@g****m | 1 |
| Jason | j****3@g****m | 1 |
| SunBK201 | s****m@g****m | 1 |
| microsoft-github-operations[bot] | 5****] | 1 |
| moz-mdauer | m****r@m****m | 1 |
| mrT23 | t****r@c****i | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 54
- Total pull requests: 129
- Average time to close issues: 22 days
- Average time to close pull requests: 9 days
- Total issue authors: 33
- Total pull request authors: 31
- Average comments per issue: 0.61
- Average comments per pull request: 1.5
- Merged pull requests: 86
- Bot issues: 0
- Bot pull requests: 3
Past Year
- Issues: 49
- Pull requests: 125
- Average time to close issues: 9 days
- Average time to close pull requests: 7 days
- Issue authors: 30
- Pull request authors: 31
- Average comments per issue: 0.37
- Average comments per pull request: 1.54
- Merged pull requests: 83
- Bot issues: 0
- Bot pull requests: 2
Top Authors
Issue Authors
- LakshyAAAgrawal (9)
- imanewman (4)
- Andrej730 (4)
- v4rgas (3)
- jason-sachs (2)
- SaitejaUtpala (2)
- masterSunflowers (2)
- jbellis (2)
- MischaPanch (2)
- adlternative (1)
- good-riverdeer (1)
- jinulee-v (1)
- InternetOfTofu (1)
- jbyibm (1)
- crystalcd (1)
Pull Request Authors
- LakshyAAAgrawal (42)
- v4rgas (10)
- nassermohamedit (6)
- Andrej730 (6)
- jetzhou (6)
- IDontHaveBrain (4)
- themichaelusa (4)
- argonaut0 (4)
- zjxszzzcb (4)
- dependabot[bot] (3)
- pratham1002 (2)
- mrT23 (2)
- shplack (2)
- htynkn (2)
- SunBK201 (2)
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 2
-
Total downloads:
- pypi 12,632 last-month
-
Total dependent packages: 0
(may contain duplicates) -
Total dependent repositories: 0
(may contain duplicates) - Total versions: 32
- Total maintainers: 1
proxy.golang.org: github.com/microsoft/multilspy
- Documentation: https://pkg.go.dev/github.com/microsoft/multilspy#section-documentation
- License: mit
-
Latest release: v0.0.15
published 11 months ago
Rankings
pypi.org: multilspy
A language-agnostic LSP client in Python, with a library interface. Intended to be used to build applications around language servers. Currently multilspy supports language servers for Python, Rust, Java, Go, JavaScript, Ruby, C# and Dart. Originally appeared as part of Monitor-Guided Decoding (https://github.com/microsoft/monitors4codegen)
- Homepage: https://github.com/microsoft/multilspy
- Documentation: https://multilspy.readthedocs.io/
- License: mit
-
Latest release: 0.0.15
published 11 months ago
Rankings
Maintainers (1)
Dependencies
- actions/checkout v3 composite
- github/codeql-action/analyze v2 composite
- github/codeql-action/autobuild v2 composite
- github/codeql-action/init v2 composite
- jedi-language-server ==0.41.1
- pydantic ==1.10.5
- requests ==2.32.3
- jedi-language-server ==0.41.1
- pydantic ==1.10.5
- pytest ==7.3.1
- pytest-asyncio ==0.21.1
- requests ==2.32.3