lendup / fs2-blobstore Goto Github PK

View Code? Open in Web Editor NEW

93.0 8.0 26.0 175 KB

Minimal, idiomatic, stream-based Scala interface for key/value store implementations

License: Apache License 2.0

Scala 99.69% Java 0.31%

fs2 scala

fs2-blobstore's People

Contributors

Stargazers

Watchers

fs2-blobstore's Issues

Problem extending paths that are created with no key

This test will fail:

  it should "extend a path with no key correctly" in {
    val path = Path("some-bucket") / "key"

    path must be(Path("some-bucket", "key", None, false, None))
  }

with some-bucket//key was not equal to some-bucket/key

The extension code doesn't check for the empty initial key and you end up with a double '/' in the resulting path.

I'm happy to put together a PR to fix this. Either it can check for a blank key string when extending, or I could change Path.key to be Option[String] to make it more explicit.

Publish version 0.3.1 for Scala 2.11

My company is currently still stuck on 2.11, and are likely to skip 2.12 completely. I need access to the TransferManager version of the S3 blobstore, but there is no release on central for 0.3.x. Can you please publish this version?

Use Minio for s3 integration tests in CI

I'm willing to send this in as a PR. How do you want it? One travis build that does the current behavior + s3 tests?

I think the alternate approach might be to move the integration tests to src/e2e so they can be invoked separately more easily, and have two travis builds, one for test and one for e2e - the downside to this approach is that it makes coverage collection more complex.

S3Store.put hangs when provided path has no size

Even if it is a good practice to provide size when calling put, especially for S3Store since Amazon's S3 client would bufferall in memory before uploading data, S3Store.put should support paths with no size.

Sample code to reproduce the issue here.

Fix codecov integration

Codecov has not been updated in the last few PRs, need to investigate.

https://codecov.io/gh/lendup/fs2-blobstore

S3Store#put uses unbounded memory when `Path.size` is None

S3Store invokes TransferManager#upload, which says

     * When uploading options from a stream, callers <b>must</b> supply the size of
     * options in the stream through the content length field in the
     * <code>ObjectMetadata</code> parameter.
     * If no content length is specified for the input
     * stream, then TransferManager will attempt to buffer all the stream
     * contents in memory and upload the options as a traditional, single part
     * upload. Because the entire stream contents must be buffered in memory,
     * this can be very expensive, and should be avoided whenever possible.

When you use too much, you get a java.lang.OutOfMemoryError: Java heap space

It is possible to do multipart uploads to use fixed memory; for reference, this is what Alpakka does for akka-streams:

Alpakka S3Client#multipartUpload
Using the AWS Java SDK for a Multipart Upload (Low-Level API)
- Use the low-level API when you [..] do not know the size of the upload data in advance

Update release to depend on fs2 1.0.0

This will help downstream folks avoid sbt eviction errors

SftpStore doesn't support concurrent writes

The SftpStore is a bit clunky at the moment:

The apply method takes a F[ChannelSftp], but closes more resources than it acquires - it acquires a ChannelSftp, but closes the session as well.

I think closing the session is fine, but we should take a F[Session] then and just manage the channels internally. As it stands now it's impossible to perform concurrent operations on the store, since one channel can handle a single write at a time, and the store only works on one channel. AND if you try to make multiple stores for multiple channels on the client side, the store closes the session.

Maybe make something like:

Internal pool of ChannelSftp
On put acquire a channel from the pool and use it

Since it's blocking IO, making the pool unbounded makes sense.

Any thoughts on this?

Maintain release changelog?

It would be really helpful for making sure that updates are safe if you had a changelog in the project. Either with github release tags or a file in the repo.

Look into using TransferManager download function for S3Store get

Although there is an explicit 5GB limit on put, the s3 docs don't mention anything about a limit for downloads. I haven't tested this out, but its possible that there is a size limit on get as well.

We mitigated the size limit issue by using TransferManager. We should look into using the TransferManager.download(...) method when calling get, and whether that is even necessary.
https://docs.aws.amazon.com/AWSJavaSDK/latest/javadoc/com/amazonaws/services/s3/transfer/TransferManager.html#download-com.amazonaws.services.s3.model.GetObjectRequest-java.io.File-

Add aws sdk v2.0 based s3 store implementation

https://aws.amazon.com/blogs/developer/aws-sdk-for-java-2-0-developer-preview/

The new aws sdk supports CompletableFuture apis and a truly non-blocking backend. Would be great to get support for that eventually here

Unresolved dependency: com.lendup.fs2-blobstore#core_2.12;0.3.0

I can't find the the 0.3.x releases of the project.
Tried with:

com.lendup.fs2-blobstore" %% "s3" % "0.3.0" // "0.3.+" // "0.3.0-SNAPSHOT"

No luck with any of them..
I browsed the maven repository and found there is no 0.3.0 version yet. However I could find an SNAPSHOT in sonatype is this intended? do I need to add the sonatype snapshots repository to use the 0.3.0 version?

I am really needing it, because I am having troubles uploading some big files to S3 - and the latest release states this is now possible.

Thanks in advance for the help.

Archive repository and point to https://github.com/fs2-blobstore/fs2-blobstore

We published the first artifact from the new repo to Maven central yesterday. Can we archive this repository and point to the new one?

https://github.com/fs2-blobstore/fs2-blobstore
https://search.maven.org/search?q=g:com.github.fs2-blobstore

Release for scala 2.13

On fs2 1.1, cats-effect 2.0, etc

Milestone release for now, since the dependencies are, but it would be good to get a newer cross-built one up just to allow cross compiling

lendup / fs2-blobstore Goto Github PK

fs2-blobstore's People

Contributors

Stargazers

Watchers

Forkers

fs2-blobstore's Issues

Recommend Projects

Recommend Topics

Recommend Org