Comments (11)
@YuxuanSnow Could you please share the snippet code that you used for point cloud generation and visualization?
Thanks in advance
from marigold.
thank you very much, i was able to get it working in my 3d editor. The amount of information you can retrieve is impressive. Even on a frontal human face, it can retrieve a profile that is not perfect, but functional for quick retouching. Really amazing.
from marigold.
Hi,
Thank you for your interest in Marigold and the generation of 3D point clouds from it.
Please note that Marigold predicts affine invariant depth, not metric depth. Meaning that even if the camera intrinsics are known, the unprojection function still is undefined by a scale and a shift. For the example of @YuxuanSnow the scale and shift are not chosen appropriately.
We quickly described our workflow in the Appendix Section 4 of the paper. In practice, it means that you have to try interatively try configurations of scale and shifts, and pick the visually best one.
Hope that helps. Feel free to ask if you have questions.
Edit: grammer
from marigold.
Hi,
Thank you for your interest in Marigold and the generation of 3D point clouds from it.
Please note that Marigold predicts affine invariant depth, not metric depth. Meaning that even if the the camera intrinsics are known. The unprojection function still is undefined by a scale and a shift. For the example of @YuxuanSnow the scale and shift are not chosen appropriately.
We quickly described our workflow in the Appendix Section 4. In practice, it means that you have to try interatively try configurations of scale and shifts, and pick the visually best one.
Hope that helps. Feel free to ask if you have questions.Hi
Thank you very much for your reply. Unfortunately, I am more of a 3d modeler than a programmer or researcher, so even going by trial and error is extremely complicated for me. Would it be possible to have a basic equation where it would be easy to intervene in the scaling and shift factors? Simple a/(depth+b) don't seems to work
I understand that it transcends the purpose of the research, but it would then be great in the future to have a built-in tool with which to play with parameters while viewing the point cloud, and then export a linearized depth map.
Thanks in advance
Hello,
Marigold predicts affine-invariant (linear) depth, which can be written as
Best.
from marigold.
However, i think my unprojection function is not valid for some scenario such as indoor room.
As you can see, the point cloud doesn't have a flat floor and ceiling.
I believe the reason is because of the normalization scale of the predicted depth range. Can you also provide the unprojection function that you used to generate point cloud in the paper?
from marigold.
Hi,
nice work! I always want to compare the predicted depth with the colorful unprojected point cloud. I compared Marigold, ZoeDepth, OmniDatav2. I tried the following image.
It's interesting that previous method predicts more likely to a flat geometry, while Marigold can preserve it better :-)
Hello,
Can you share how to generate the point clouds from the depth maps? And also what tool are you using to visualize the 3d point cloud?
from marigold.
I am also facing the same issue, the pointcloud looks distorted on indoor images.
from marigold.
@YuxuanSnow what expression did you use? is not perfect, but not so bad (far better than mine, for sure)
from marigold.
Hi,
Thank you for your interest in Marigold and the generation of 3D point clouds from it.
Please note that Marigold predicts affine invariant depth, not metric depth. Meaning that even if the the camera intrinsics are known. The unprojection function still is undefined by a scale and a shift. For the example of @YuxuanSnow the scale and shift are not chosen appropriately.
We quickly described our workflow in the Appendix Section 4. In practice, it means that you have to try interatively try configurations of scale and shifts, and pick the visually best one.
Hope that helps. Feel free to ask if you have questions.
Hi
Thank you very much for your reply.
Unfortunately, I am more of a 3d modeler than a programmer or researcher, so even going by trial and error is extremely complicated for me.
Would it be possible to have a basic equation where it would be easy to intervene in the scaling and shift factors?
Simple a/(depth+b) don't seems to work
I understand that it transcends the purpose of the research, but it would then be great in the future to have a built-in tool with which to play with parameters while viewing the point cloud, and then export a linearized depth map.
Thanks in advance
from marigold.
@coccofresco Thank you! If the data and time permit, could you please showcase some of the results on Twitter and refer our original announcement post? https://twitter.com/AntonObukhov1/status/1732946419663667464?t=8iIVVDbbrhwGHQgHNOoj2A&s=19
from marigold.
affine invariant depth
Thanks for the great work.
I was wondering if I have some prior knowledges on the scale and location (both object & camera) of the object I want predict (e.g. a rough point cloud, estimated 3D bounding-box, etc.), will it be possible to adjust Marigold prediction?
Also I wonder if the training script will be released in the future for fine-tuning the model.
from marigold.
Related Issues (20)
- the test results seem have much noise HOT 4
- Regarding the training convergence HOT 4
- Training on Custom Dataset HOT 1
- the prediction on in-the-wild example is noisy HOT 1
- Is code from the Bas-relief available
- Why set NaN depth values to zero on preprocessing? HOT 1
- Request for vkitti_val.tar and vkitti_vis.tar files HOT 1
- How to Manage the Large Hypersim Dataset for Reproduction? HOT 3
- ask for LCM distillation code HOT 2
- Low-Rank(LoRA) training of Marigold
- Multi-GPU Training HOT 1
- Clarification Needed: Training and Inference Pipeline HOT 4
- Any reason for not using vae.std to generate RGB latent? HOT 1
- do you plan to release better and more accurate models for this original marigold? HOT 1
- The purpose of using v_prediction as the target? HOT 1
- where can I get the "output/marigold_base/checkpoint/latest" HOT 1
- The demo 3D looks ok but not match with the predicted depth image HOT 1
- How to organize the vkitti data HOT 1
- train the model on my custom dataset HOT 1
- Unusual slow training speed HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from marigold.