pyvrp / vrplib Goto Github PK

Python package to read and write vehicle routing problem instances.

Home Page: https://github.com/PyVRP/VRPLIB

License: MIT License

Python 100.00%

capacitated-vehicle-routing-problem cvrp operations-research python travelling-salesman-problem tsp vehicle-routing-problem vrp cvrplib instances

vrplib's Introduction

PyVRP is an open-source, state-of-the-art vehicle routing problem (VRP) solver. It currently supports VRPs with:

Pickups and deliveries between depots and clients (capacitated VRP, VRP with simultaneous pickup and delivery, VRP with backhaul);
Vehicles of different capacities, costs, shift durations, routing profiles, and maximum distance and duration constraints (heterogeneous fleet VRP, site-dependent VRP);
Time windows, client service durations, and release times (VRP with time windows and release times);
Multiple depots (multi-depot VRP);
Optional clients with prizes for visiting (prize collecting, team orienteering problem);
Client groups imposing additional restrictions on multiple clients jointly (generalised VRP, VRP with multiple time windows).

PyVRP is available on the Python package index as pyvrp. It may be installed in the usual way as

pip install pyvrp

This also resolves the few core dependencies PyVRP has. The documentation is available here.

Tip

If you are new to vehicle routing or metaheuristics, you might benefit from first reading the introduction to VRP and introduction to HGS pages.

Examples

We provide some example notebooks that show how PyVRP may be used to solve vehicle routing problems. These include:

A short tutorial and introduction to PyVRP's modelling interface, here. This is a great way to get started with PyVRP.
A notebook solving classical VRP variants, here. In this notebook we solve several benchmark instances of the CVRP and VRPTW problems. We also demonstrate how to use the plotting tools available in PyVRP to visualise the instance and statistics collected during the search procedure.
A notebook implementing a solve method using PyVRP's components, here. This notebook is a great way to dive deeper into how PyVRP works internally.

Contributing

We are very grateful for any contributions you are willing to make. Please have a look here to get started. If you aim to make a large change, it is helpful to discuss the change first in a new GitHub issue. Feel free to open one!

Getting help

Feel free to open an issue or a new discussion thread here on GitHub. Please do not e-mail us with questions, modelling issues, or code examples. Those are much easier to discuss via GitHub than over e-mail. When writing your issue or discussion, please follow the instructions here.

How to cite PyVRP

If you use PyVRP in your research, please consider citing the following paper:

Wouda, N.A., L. Lan, and W. Kool (2024). PyVRP: a high-performance VRP solver package. INFORMS Journal on Computing, forthcoming. https://doi.org/10.1287/ijoc.2023.0055

Or, using the following BibTeX entry:

@article{Wouda_Lan_Kool_PyVRP_2024,
  doi = {10.1287/ijoc.2023.0055},
  url = {https://doi.org/10.1287/ijoc.2023.0055},
  year = {2024},
  publisher = {INFORMS},
  author = {Niels A. Wouda and Leon Lan and Wouter Kool},
  title = {{PyVRP}: a high-performance {VRP} solver package},
  journal = {INFORMS Journal on Computing},
}

A preprint of this paper is available on arXiv. Since PyVRP extends HGS-CVRP, please also consider citing Vidal (2022).

vrplib's People

Contributors

Stargazers

Watchers

Forkers

junhua wouterkool batman0911 sollalf rw-jhyt

vrplib's Issues

Download full instance and solution sets

Would be nice to download the full instance and solution sets in one function call. I'm not sure yet how to implement this. Some ideas:

Provide a new function named download_set, which can be used like

download_set("X", output_dir="instances/")

- Two directions here: 
	- This function downloads the corresponding "X" set zip and unzips it.
	- It downloads all "X" instances and solutions.

Allow for globs in download_instance and download_solution.

Add support for downloading EURO-NeurIPS instances

I have uploaded the instances at https://github.com/VRPLIB/VRPTW to experiment with this idea.

TODOs

Add BKS to VRPLIB/VRPTW
Ask Wouter how these instance should be named / should there be an extended set?

Check if depot section ends with -1

        # TODO: check must be done by VRPLIB
        # ("data/DepotSectionDoesNotEndInMinusOne.txt", RuntimeError),

Timeout when download takes too long

The CVRPLIB website has been offline the entire day. The code currently just hangs waiting for a response. I think we should throw an error if downloading hasn't finished after 20 seconds.

Write VRP instances

It would be nice to have a function that could also write VRP instances. This was provided in the EURO-NeurIPS competition.

Check if LKH-3 instances can be read

Optional distance computation for large instances

CVRPLIB has an XXL instance set (e.g., http://vrp.atd-lab.inf.puc-rio.br/index.php/en/plotted-instances?data=Flanders2). vrplib takes very long to parse these large instances due to the distance matrix (size $O(n^2)$). For Brussels1 (11K nodes), it takes about 1 minute on my laptop and 99.9% time is spent on computing the distance matrix.

I think it's unreasonable to compute the distances matrix for such large instances, so we should probably add an argument to disable the computation of distance matrices.

Read LKH-3 VRPLIB formatted instance

cvrplib.read currently supports instances from CVPRLIB. I want to extend this to be able to read instances for LKH-3 as well. The formatting is not much different I think, so it's mostly about how the codebase is organized.

The output does need to change. I currently use CVRP and VRPTW classes, but I think I want only to output a dictionary.

Add support for Python 3.11

Setup readthedocs

Good to start early with this :-)

Standardize output

To support more instances, it's best to standardize the output to a dictionary. Currently I have added extra attributes etc. which I might want to remove as well later.

TODOs

Reorganizes code base
Standardizes output to dictionary / removes Instance class.
Removes DEPOT constant

Update README about download changes

I forgot to update the README about the new download interface.

Change package name to cvrplib

I thought it would be helpful to name the package to pycvrplib to make it apparant that the library is written for Python, but I think that's being overly pedantic. I should rename the package to cvrplib, just like tsplib95

Bug in distance calculation from flattened matrix

Just found a bug in the distance calculation from flattened distance matrices.
The function 'from_flattened' (line 141 in cvrp.py) assumes that
" The numbers in a flattened list correspond the matrix element indices
(1, 0), (2, 0), (2, 1), (3, 0), (3, 1), (3, 2), (4, 0), ..."
However, for the instance E-n13-k4 the flattened list correponds to indices
(0, 1), (0, 2), (0, 3), (0, 4), ... (11,12).
Changing line 150
indices = sorted([(i, j) for (j, i) in combinations(range(n), r=2)])
to
indices = sorted([(i, j) for (i, j) in combinations(range(n), r=2)])
resolves this issue.
However, I do not know whether the original sorting might be correct for some other instances.

Hard-code instance names

It should not be necessary to make a request to the library to obtain all instance names. The instance names don't change over time, only the best known solutions do.

We can simply add a .txt file to this library, which gets updated at every new instances set.

Change list_instances to list_names

Default argument for path in download & allow dirs

Users must pass a path argument to download_instance.

This shouldn't be necessary. If it's unspecified, then it'll just be downloaded and saved to where the function is called from?
The file name does not need to be specified. If a dir is provided instead, then save it to that dir with the original file name.

Increase coverage to 100%

Try out Ruff

A fast Python linter, looks promising and can be used to replace flake8 and isort.

https://github.com/charliermarsh/ruff

Instances generator

There are plenty of descriptions in the literature to generate new instances. See among others:

Would be interesting to provide functions that could do that at some point.

Update documentation about VRPLIB

Remove lru_cache

https://github.com/leonlan/VRPLIB/blob/542d014dd49c2c108fefd891918bed374fcdab4b/vrplib/download/download_instance.py#L8

https://github.com/leonlan/VRPLIB/blob/542d014dd49c2c108fefd891918bed374fcdab4b/vrplib/download/download_solution.py#L8

Caching no longer makes sense since the files are saved locally.

Change key names for reading Solomon instances

https://github.com/leonlan/CVRPLIB/blob/46b444e61f8e5f72a87aa7f36fc72bb40a3729d1/cvrplib/parse/parse_solomon.py#L28-L39

The key names should not be altered but instead follow the exact names from the instance data.

Redesign test suite

I have been sloppy in writing my tests. Especially with the module rehaul, most of the tests are not in the right place, or not even implemented at all. I think that coverage can be very helpful to detect the missing spots.

TODOs

Add coverage
Implement tests for parse module
Gather all the VRPLIB instances that you can find
- ORTEC Euro-NeurIPS VRPTW instances

Replace requests with urllib

Annotate dict

Type annotate dict https://stackoverflow.com/questions/64938027/type-annotation-for-dict-arguments with ypedict

Cache installations in CI

The CI is pretty slow.

In the ALNS project, caching was added to the CI and reduced the CI to below 1 minute for each Python version. See N-Wouda/ALNS@dcff19a.

Verify that instance is VRPLIB format

We should verify that an instance is in VRPLIB format. I don't know yet what needs to be checked (is there a minimum requirement?), but that's something we can figure out along the way.

Instance/solution fields as numpy arrays instead of lists

The array-like fields in the Instance and Solution classes are current of type List or List[List]. I prefer to change this to numpy arrays, as they provide more useful methods when solving VRP problems. The only hurdle is that I'm not sure how to annotate numpy arrays in a meaningful way. Maybe refer to this?

Separate read/download instance and solution

To make the code more explicit, I want to make separate functions for reading instances and solutions, and also downloading instances and solutions.

Example:

import cvrplib

# Read
instance = cvrplib.read_instance('/path/to/A-n32-k5.vrp')
solution = cvrplib.read_solution('/path/to/A-n32-k5.sol')

# Download
instance = cvrplib.download_instance('/path/to/A-n32-k5.vrp')
solution = cvrplib.download_solution('/path/to/A-n32-k5.sol')

After finishing #14, I also want to have separate functions for writing instances and solutions:

# Write
cvrplib.write_instance('/path/to/A-n32-k5.vrp', specifications, sections)
cvrplib.write_solution('/path/to/A-n32-k5.sol', routes)

Download instances and solutions locally

Downloading an instance currently does not save the instance locally. We should rewrite the code to actually download the files to save them locally.

The user would then have to write this:

import vrplib

# Download the instance and solution locally
vrplib.download_instance("X-n101-k25", "/path/to/X-n101-k25.vrp")
vrplib.download_solution("X-n101-k25", "/path/to/X-n101-k25.sol")

# Read the downloaded instance and solution
instance = vrplib.read_instance("/path/to/X-n101-k25.vrp")
solution = vrplib.read_solution("/path/to/X-n101-k25.sol")

Use numpy testing library

I have been using standard assert statements and numpy.testing assertions. I think it's good to make this more consistent, and to use numpy.testing everywhere.

Host instances and best known solutions

In the long term, we could make a separate repository to maintain the instances and best known solutions.

Using Github Actions we can make this process mostly automated.
Templates can streamline the submission process
We need to write a validator for each problem type. See #27

Check validity of instances

Except for some edge weight format and types, the current code does not check if the instance data are correct. For example, time windows can be checked for early <= late. Or that all data sections should have enough entries.

As of now, the library is just really about reading existing instances, so there should be no need to checking validity of the instances. But at some point it might be useful to include this as well, combined with #34 and #14.

Test solomon instance parsing

Speed-up parsing

It currently takes about 2 seconds to read a 1000-customer instance without explicit matrix:

In [73]: %timeit vrplib.read_instance(f"tmp/X-n1001-k43.txt")
2.14 s ± 44 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

This is slow. I think it's mainly due to computing the Euclidean distances. For example, another 1000 customer instance with explicit matrix reads much faster:

In [76]: %timeit vrplib.read_instance(f"tmp/Loggi-n1001-k31.txt")
246 ms ± 8.88 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Distance matrix computation performance

It seems reading a large instance is rather slow, most likely due to computing the distance matrix using a Python for loop. E.g. for GH1000 instance, running on Colab:

!pip install vrplib
import vrplib
vrplib.download_instance("RC2_10_8", "RC2_10_8.vrp")

%%timeit
instance = vrplib.read_instance("RC2_10_8.vrp", instance_format="solomon")

3.35 s ± 178 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Pass custom distance rounding functions

For the non-integral CVRP instances and VRPTW instances, I am unable to compute the same cost as given by the solution.

Roadmap V1.0

The library currently supports reading and downloading instances from the CVRPLIB library. I'm going to work towards a V1 version of this library to make it support more general VRPLIB instances, particularly those from the LKH-3 library. I think that the LKH-3 and CVRPLIB library are currently the best maintained VRP libraries with a consistent format. Moreover, LKH-3 has way more instances than CVRPLIB, which only has CVRP and VRPTW instances.

New features that I want to introduce:

Read LKH-3 VRPLIB formatted instances
Write VRPLIB formatted instances #14
Extensively test VRPLIB reading

Some edge cases (mostly about how to support CVRPLIB):

Maybe drop support for downloading CVRPLIB instances? I think it's nice feature to have, but such a feature adds complexity to the library.
- I'll keep the download feature for now because I think it's quite useful for running quick instances on Colab.
Maybe drop support for Solomon/GH instances that are not formatted by VRPLIB standards? I think we need this still because CVRPLIB relies on it.
- We will not drop support for Solomon-formatted instances. Instead, read_instance will take the style argument which allows to parse different instance formatting styles (i.e., vrplib and solomon).

Issues

Simplify regex in parse_solution

https://github.com/leonlan/CVRPLIB/blob/8a10384ce8d69ab621c10f7f32e5ff776d92e1d4/cvrplib/parse_solution.py#L31

Not sure if better..

 route = [int(cust) for cust in re.match(r"Route #\d+: (.*)", line).group(1).split()]

Add support for VRPTW instances

Our package currently does not support reading and downloading VRPTW instances, i.e., instances that belong to the Solomon and Homberger and Gehring set. Support for this will be in a later version. A new parser function needs to be written for these instances.

Change Instance representation

It's currently just a wall of data. It would be nice to have all fields displayed.

Verification functions

It would be nice to provide functions that can verify any VRP-type solution. Of course, we need to make a verification function for each type of VRP-type problem.

I have no idea yet how the interface will look like. But here's an idea:

from cvrplib import read_instance, read_solution, verify

instance = read_instance(instance_path)
solution = read_solution(solution_path)

# Pass the solution and all relevant instance attributes
verify.cvrp(solution, instance['capacity']) 
verify.vrptw(solution, instance['capacity'], instance['duration_matrix'], instance['time_windows'])

Plotting functions

We should provide some simple plotting functionalities at some point.

Optional parameters for distance rounding

The current implementation always rounds the distances to the nearest integer. It might be helpful for the non-integral instances to allow for different rounding functions.

Related issue: #7

Ignore -1 in depot section

The -1 in the depot section does not seem to serve any purpose. I think it's OK if we do not raise in that case.

Deprecate pkg_resource.read_text

========================================================================================================== warnings summary ==========================================================================================================
tests/download/test_list_names.py::test_list_names[case0]
  /Users/leonlan/Dropbox/cvrplib/vrplib/download/list_names.py:52: DeprecationWarning: read_text is deprecated. Use files() instead. Refer to https://importlib-resources.readthedocs.io/en/latest/using.html#migrating-from-legacy for migration advice.
    fi = pkg_resource.read_text(__package__, "instance_data.csv")

tests/download/test_list_names.py::test_list_names[case0]
  /opt/homebrew/Cellar/[email protected]/3.11.3/Frameworks/Python.framework/Versions/3.11/lib/python3.11/importlib/resources/_legacy.py:80: DeprecationWarning: open_text is deprecated. Use files() instead. Refer to https://importlib-resources.readthedocs.io/en/latest/using.html#migrating-from-legacy for migration advice.
    with open_text(package, resource, encoding, errors) as fp:

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html

---------- coverage: platform darwin, python 3.11.3-final-0 ----------
Coverage XML written to file coverage.xml

=================================================================================================== 86 passed, 2 warnings in 3.39s ===================================================================================================
leonlan@Leons-Air cvrplib %

Check if TSPLIB95 can be read

Make download faster

The download and list_instances functions take several seconds to complete. The parsing of the file itself goes very fast, but receiving the response from the server takes a while. I believe it has to do with the server that hosts the CVRPLIB library. If you know how to improve this, please let me know.

Improve documentation of write_solution

The write_solution function only accept solution as dictionary types, which is odd. We should accept solutions of type List[List[int]] instead.

https://github.com/leonlan/CVRPLIB/blob/46b444e61f8e5f72a87aa7f36fc72bb40a3729d1/cvrplib/write/write_solution.py#L1-L26

Raise error when reading Solomon instance without specifying instance format

I think it'll be common for users to use the read_instance function without explicitly specifying instance_format='solomon'.
What will happen is that the function returns an empty dictionary. There are no verifications (yet) in parse_vrplib #51, so it might be helpful to issue a warning instead.

pyvrp / vrplib Goto Github PK

vrplib's Introduction

Examples

Contributing

Getting help

How to cite PyVRP

vrplib's People

Contributors

Stargazers

Watchers

Forkers

vrplib's Issues

TODOs

TODOs

Recommend Projects

Recommend Topics

Recommend Org