raystack / compass Goto Github PK

View Code? Open in Web Editor NEW

63.0 63.0 7.0 4.8 MB

Compass is an enterprise data catalog that makes it easy to find, understand, and govern data.

Home Page: https://compass-raystack.vercel.app/

License: Apache License 2.0

Dockerfile 0.01% Makefile 0.31% Go 98.40% PLpgSQL 1.28%

data dataops discovery lineage metadata

compass's People

Stargazers

Watchers

Forkers

scortier grayflash sudo-suhas anshuman-gojek goto luis-sousa-pinto aryanugroho

compass's Issues

feat: grpc interface

Is your feature request related to a problem? Please describe.
We need to introduce a new grpc interface. The grpc interface could be used later by cli when we are building it.

Describe the solution you'd like
Approach should be discussed in this issue.

feat(discovery): search by fields (description, data owner, classification tags, and column's table)

Is your feature request related to a problem? Please describe.
I have a field description and ownership in every record. I also expect to have some tags populated in every record. I want user to be able to do search by

description
owner
classification tags
column names (for table type)

Describe the solution you'd like

API Changes
The existing Search API is using HTTP GET and accept the search query and other filter params in the query param like this.

/search/?text=<text>&filter.environment=integration&filter.landscape=vn&filter.landscape=th

Comparing to the existing search that does search to all available datasets and fields, searching by description, owner, and tags would only consider specific field of the data to search. To support that, we could add a new query param called search-by or searchby or within. The value of this would be the field name in the dataset with elasticsearch-like accessing field format.
For example:
Given a schema of a dataset like this

{
    "urn": "a-urn",
    "name": "a record",
    "service": "bigquery",
    "description": "a description",
    "data": {
        "properties": {
            "attributes": {
                "dataset": "a_dataset",
            },
           "labels": {
               "created_by": "table creator"
           }
     }
}

Searching by description would have a query param like this

/search/?text=<text>&searchby=description

Searching by dataset would have a query param like this

/search/?text=<text>&searchby=data.properties.attributes.dataset

add issue template in the repo

Is your feature request related to a problem? Please describe.
Currently we have no defined standard to raise an issue regarding feature request or bug in this repo. In other odpf repo (meteor for example), we have ISSUE_TEMPLATE to help us standardize Issue format.

Describe the solution you'd like
Add github ISSUE_TEMPLATE in columbus repo like the one we define in meteor.

feat(grpc): add get assets query params filtering to grpc

Is your feature request related to a problem? Please describe.
In #88, we add more filtering feature in Get Assets API. However this is only for HTTP API, we need to replicate this feature in grpc since we will deprecate HTTP API later and use grpc-gateway instead.

Describe the solution you'd like

add exact query params filtering feature to GetAllAssets API in grpc
the filtering in grpc-gateway should have the same behaviour with the existing one
Update the proto here

feat: add filtering in assets api

Is your feature request related to a problem? Please describe.
Pagination feature is already implemented in v1beta1 assets api , but filtering by certain fields are still not available.

Describe the solution you'd like
Add filtering by certain fields by handling columbus data in structured way.

filter by types
filter by services
filter by field in asset.data
querying by field ( includes by name , urn)
sort by certain fields asc , desc
size to fetch
offset

fix(lint): fix all linter warning

Is your feature request related to a problem? Please describe.
When doing lint with golangci-lint. I found there are several linter warning. We could fix all warning to avoid some unwanted bugs or unintended behaviour. After fixing all lint warning/error, we could add a new lint in github workflow.

Describe the solution you'd like
Fixing every line of code that throw lint warning/error when using golangci-lint.

feat: add pagination when fetching records

Is your feature request related to a problem? Please describe.
Columbus always returns all records for a given type when fetching. This could take really some time since some of my records inside a type could be more than 5k.

Describe the solution you'd like
New querystring to fetch only a certain size and offset with default size to e.g.20
example request

?from=10&size=20

example response

{
  "data": [] // records
  "total": 100 // all available records (this will help with pagination in clientside)
}

feat: API to patch Asset instead of full update

Is your feature request related to a problem? Please describe.
I have two sources of updating an asset with different information. I don't want each update to overwrite each other.
Both sources will update different fields inside asset.data field.

Describe the solution you'd like
I would like a new API to patch an Asset instead of fully updating it.

Solution:

[PATCH] /v1beta1/assets
{
  "urn": "some-urn", // this is required to identify an asset
  "type": "table", // this is required to identify an asset
  "service": "bigquery", // this is required to identify an asset
  "data": {
    "fieldFromSource1": "some-value",
  }
}

feat: add user aware information

Is your feature request related to a problem? Please describe.
We want to introduce user aware information in columbus.

Describe the solution you'd like
As a starting point, we need to store user information. In Columbus, we already have a DB storage. We could store user information in the DB and introduce our own internal user id instead of using user id from an external system. To support the integration with external systems, in general, we just store a one-to-one mapping between internal user id to external user id and vice versa by creating a new User table as an adapter. We can use email as the external unique identifier. This will make Columbus less decoupled to the external systems. To do table lookup faster, we could create an index for the email column.

user	data type	sample value
id (PK)	UUID	11234-4214214
email (UNIQUE)	STRING	[email protected]
provider	STRING	shield
created_at	TIMESTAMP	12345667
updated_at	TIMESTAMP	12345667

Tasks

Create User table & migration
Add function in repository store external user id and generate internal user id
Add function in repository to get user id given email

feat: add starring resources user starred API

Is your feature request related to a problem? Please describe.
I want to star/bookmark a resource to be revisited again later and able to fetch all of my starred resources.

Describe the solution you'd like

User Specific API

GET /v1/user/starred (List assets starred by the user)
- Query Params:
  - sort={created/updated}
  - direction={asc/desc}
  - page=1
  - size=10
- Header Param:
  - Email
- Response
  - 200, OK
  - 400, missing header param
GET /v1/user/starred/{asset_id} (Check if an asset is starred by a user)
- Header Param:
  - Email
- Response
  - 200, asset is starred by the user
  - 400, missing header param
  - 404, if asset not starred by user
PUT /v1/user/starred/{asset_id} (Star asset for a user)
- Header Param:
  - Email
- Response
  - 204, Ok
  - 400, missing header param
  - 404, if asset not found
DELETE /v1/user/starred/{asset_id} (Unstar an asset for a user)
- Header Param:
  - Email
- Response Status
  - 204, Ok
  - 400, missing header param
  - 404, if asset not found

feat: add starring resources generic API

Is your feature request related to a problem? Please describe.
I want to star/bookmark a resource to be revisited again later and able to fetch all of my starred resources.

Describe the solution you'd like

Generic API

GET /v1/assets/{asset_id}/stargazers (List stargazers)
- Query Params:
  - page=1
  - size=10 *page & size will follow columbus convention
- Response
  - 200, OK
GET /v1/users/{user-id}/starred(List assets starred by some user)
- Query Params:
  - sort={created/updated}
  - direction={asc/desc}
  - page=1
  - size=10
- Response
  - 200, OK
  - 400, missing header param

Error when run make on windows

trying to compile columbus with go, we got the following errors for each try/proxy conf.

go version
go version go1.16.5 windows/amd64

go env GOPROXY GONOPROXY
https://proxy.golang.org,direct

cd path\to\columbus

make
go build -ldflags "-X main.Version=" "github.com/odpf/columbus"
go: github.com/PaesslerAG/[email protected]: Get "https://proxy.golang.org/github.com/%21paessler%21a%21g/jsonpath/@v/v0.1.1.mod": read tcp 10.20.3.143:61982->64.233.177.141:443: wsarecv: An existing connection was forcibly closed by the remote host.
make: *** [Makefile:10: build] Error 1

go env -w GONOPROXY=https://github.com
go env GOPROXY GONOPROXY
https://proxy.golang.org,direct
https://github.com

make
go build -ldflags "-X main.Version=" "github.com/odpf/columbus"
go: github.com/PaesslerAG/[email protected]: Get "https://proxy.golang.org/github.com/%21paessler%21a%21g/jsonpath/@v/v0.1.1.mod": read tcp 10.20.3.143:62019->64.233.177.141:443: wsarecv: An existing connection was forcibly closed by the remote host.
make: *** [Makefile:10: build] Error 1

go env -w GOPROXY=direct
go env GOPROXY GONOPROXY
direct
https://github.com

make
go build -ldflags "-X main.Version=" "github.com/odpf/columbus"
go: github.com/PaesslerAG/[email protected]: reading github.com/PaesslerAG/jsonpath/go.mod at revision v0.1.1: unknown revision v0.1.1
make: *** [Makefile:10: build] Error 1

go env -u GONOPROXY
go env -w GOPROXY=http://my.company.proxy:port

make
go build -ldflags "-X main.Version=" "github.com/odpf/columbus"
go: github.com/PaesslerAG/[email protected]: reading http://my.comany.proxy:port/github.com/%21paessler%21a%21g/jsonpath/@v/v0.1.1.mod: 403 Forbidden
make: *** [Makefile:10: build] Error 1

feat: improve elasticsearch integration test

Is your feature request related to a problem? Please describe.
The existing elasticsearch integration test require us to spin up docker in our local machine and set the ES_TEST_SERVER_URL config to the elasticsearch host.

If we don't set the ES_TEST_SERVER_URL config, everytime we run elasticsearch integration test, it will autocreate a new elasticsearch instance but once the test is done, the container is still left behind and doesn't get cleaned up.

Describe the solution you'd like

If we don't set the ES_TEST_SERVER_URL config, we need to clean up elasticsearch container after doing integration test
Perhaps could try using dockertest

feat(lint): add `golangci-lint` to the workflow

Is your feature request related to a problem? Please describe.
This is related with issue #38. Once we fixed all lint warning/error, we need to add a lint workflow to automate lint checking.

Describe the solution you'd like
Use golangci-lint with default linters

feat: add API to get versioned metadata/assets

Is your feature request related to a problem? Please describe.
Once we have versioned metadata in #45 , we need some APIs to get the versioned metadata/assets.

Describe the solution you'd like

Get all versions of an asset
- GET /v1/assets/{asset_id}/versions
- Query Params:
  - offset=1
  - size=10
- Response Status
  - 200, OK
  - 400, missing header param
Get a specific version of an asset
- GET /v1/assets/{asset_id}/versions/{version_num}
- Response Status
  - 200, OK
  - 400, missing header param

feat: track record's activities/logs

Is your feature request related to a problem? Please describe.
I want to see all activities done on the record such as creation, update, bookmarked, issue creation.

Describe the solution you'd like
Compass to store all record's activities.

Add 5 mins timeout to golangci-lint check

Is your feature request related to a problem? Please describe.
There are intermittent errors when running github action lint with golangci-lint caused by timeout. The default timeout is 1m0s. We need to increase the timeout to give extra time for linter to work.

Describe the solution you'd like
Add timeout flag in golangci-lint script

golangci-lint run --timeout 5m

Fix badge.svg not found

Describe the bug
badge.svg not found in Readme file

To Reproduce
In https://github.com/odpf/columbus, badge.svg image is breaking

Expected behavior
In https://github.com/odpf/columbus, badge.svg image is shown

feat: comments of a discussion

Is your feature request related to a problem? Please describe.
Comments is needed in discussion feature described in #47

Describe the solution you'd like

Create comments API
Add comments repository

Response Schema

{
  "id": 1,
  "body": "body here",
  "owner": {
		"id" : "1234-5678",
            "Email": "[email protected]"  
  }, // User
  "created_at" : timestamp,
  "updated_at": timestamp	
}

DB Schema

Column Name Type Example

id (PK) serial 1

discussion_id (FK) serial 1

body text This body could be written in markdown format

owner uuid 1234-5678-9123

created_at timestamp

updated_at timestamp

Column Name	Type	Example
id (PK)	serial	1
discussion_id (FK)	serial	1
body	text	This body could be written in markdown format
owner	uuid	1234-5678-9123
created_at	timestamp
updated_at	timestamp

deprecate PUT /v1beta1/assets

Is your feature request related to a problem? Please describe.
The behaviour of ingesting a new asset is currently possible with 2 APIs PUT /v1beta1/assets and PATCH /v1beta1/assets. The difference is PATCH api could support per-field patching while PUT api will override the existing data. Currently we don't really need to have PUT behaviour. We could remove it to avoid maintenance overhead.

Describe the solution you'd like
Remove PUT /v1beta1/assets API

Additional context
We also need to remove the API definition in proton too. Right now it is fine to remove it in proton since we are still in the development phase.

refactor: clean up dynamic types

Is your feature request related to a problem? Please describe.
Currently, a Type is a resource in Columbus. This means that type is dynamic in a way that users can just create any types they want and put records under it. We want to enforce Columbus own types (Table, Dashboard, Topic, Job) since most of the metadata fall in those categories.

Describe the solution you'd like
We need to change type from a resource to an enum/hardcoded type instead.

Detailed Tasks

Add Elasticsearch migration to migrate command to create indices for all of our available types
Change Type to enum
Remove types CRUD APIs

refactor(api): rename record/resource term to asset(s) and update API

Is your feature request related to a problem? Please describe.
Record is a resource is an asset in Columbus. Sometimes those terms are confusing. We need to make it consistent and make sure every metadata in Columbus is called asset(s).

Describe the solution you'd like
Update all record/resource terms to asset(s)

Tasks

Refactor API
- from GET /v1/types/{type_name}/records to GET /v1/assets?type={type_name}
- from PUT /v1/types/{type_name}/records to PUT /v1/assets with body {"type": type_name, "metadata": records}

Changing path of /user API to /me

Is your feature request related to a problem? Please describe.
We currently have an API to get resources belong to a caller (user) with API /v1beta1/user/starring and /v1beta1/user/discussions.
But in guardian the API is v1beta1/me. I am thinking we can make it consistent across org by updating columbus API from /user to /me

Describe the solution you'd like
Updating columbus API from /user to /me

feat(api): update to support user aware information

Is your feature request related to a problem? Please describe.
Columbus does not have its own dedicated register/login flow to manage users. Regardless, we need to store information to develop user awareness features.

Describe the solution you'd like
There is one option of entry points when/where we store external user id (email) and generate internal user id at the first time. For all API, we could accept an identity header (configurable via config yaml e.g. Columbus-User-Email) and the value of it is the external user identity (e.g. email) if the user does not exist in our DB.

Tasks

Update API
Require identity header for external user id
Return 400 if identity header does not exist
Auto create user if identity header exists and user is not in DB

Integrate coveralls to track and manage code coverage

Is your feature request related to a problem? Please describe.
Integrate coveralls to track and manage code coverage

Describe the solution you'd like
Integrate coveralls to track coverage and add coverage badge in README.md

Tag template to use Asset ID instead of Record Type and Record URN

Is your feature request related to a problem? Please describe.
The existing implementation of tag-template feature in columbus is still using the old approach (record) with record type and record urn as the main resource identity. But now, we are calling resource in columbus as asset and only has a single identity called asset id. This makes tag-template feature inconsistent with other features in columbus.

Describe the solution you'd like
Migrate the identity that uses by tag-template from record type and record urn to asset id

Additional context
Some changes that are required are

API

Description	From	To
Create an asset tag	`POST /v1beta1/tags/`	`POST /v1beta1/tags/assets`
Get,Update,Delete a tag of an asset	`GET/PUT/DELETE /v1beta1/tags/types/{type}/records/{record_urn}/templates/{template_urn}`	`GET/PUT/DELETE /v1beta1/tags/assets/{asset_id}/templates/{template_urn}`
Get all tags of an asset	`GET /v1beta1/tags/types/{type}/records/{record_urn}`	`GET /v1beta1/tags/assets/{asset_id}`

PR in proton is here

DB Columns

Remove record_urn and record_type
Add asset_id with text type
Update tags_idx_record_urn_record_type_field_id to tags_idx_asset_id_field_id

Others

Update Tag Domain
Update Tag Model in Repository
Update Tag Service
Update Tag Handler

avoid conflicting data type in properties attributes field

Describe the bug
Elasticsearch is silently dropping ingested metadata when there is a data type conflict in properties.attributes field. Columbus returns 200 but the metadata is not ingested.

To Reproduce
Steps to reproduce the behavior:

Create a new type/index called dashboard
Ingest a new data to the dashboard index with properties.attributes.id field is in integer e.g. 1234
Check and verify elasticsearch, the data supposed to be ingested properly
Ingest another new data to the dashboard index with properties.attributes.id field is in string e.g. "df431-54abf42-xxxx"
Check and verify elasticsearch, the other new data is not ingested

Expected behavior
Ingesting different kind of metadata that has the same type should always be succeed.

Additional context
This is happened when we tried to ingest metabase and tableau dashboard data. We ingest metabase metadata first, then tableau data. tableau data is not ingested to elasticsearch because it has id in string and metabase has id in long.

Accept email in URL param in API `GET /users/{email}/stargazers` [Temporary]

Is your feature request related to a problem? Please describe.
Having an API with user name for GET /users/{username}/stargazers would be more proper way to do. But right now, username is still WIP and for temporary usage, we could expect email as an identity in the API.

Describe the solution you'd like

Create a new function to Get Assets By User Email
Update API to accept email

feat: discussion feature

Is your feature request related to a problem? Please describe.
I want to create issues on a resource so that other users can see it.

Describe the solution you'd like

CRUD Issue apis for a resource

refactor(api): refactor v1 API to v1beta1

Is your feature request related to a problem? Please describe.
We need to update all Columbus APIs v1 to v1beta1 for better versioning and consistent with assets version in proton.

Describe the solution you'd like
Update all columbus v1 to v1beta1

refactor: use auto generate mocks

Is your feature request related to a problem? Please describe.
Current meteor are still using manually generated mocks and creating a new mock could take some effort that can be easily avoided using auto generated mocks.

Describe the solution you'd like
Use testify/mockery to auto generate mocks.

Renaming `columbus` to `compass`

Is your feature request related to a problem? Please describe.
We come to a conclusion to rename columbus to compass. There are several changes that we need to do.

Describe the solution you'd like

odpf/columbus

odpf/charts

columbus chart name
columbus image

odpf/homebrew-taps

Add new compass formula

feat: add service to record

add Service field to RecordV2 to store metadata source e.g. bigquery, kafka, etc

feat(discovery): boost score using resource fields

Is your feature request related to a problem? Please describe.
I have a field in my resources called total_usage as integer. I want these resources with higher total_usage value to have higher score so it will show on top level page when searching via /v1/search.

Describe the solution you'd like
Allow boosting score using fields on the resource in Search API (/v1/search).

feat: allow tagging resource

Is your feature request related to a problem? Please describe.
I need the ability to tag a resource to give more context to a resource. e.g. tagging a resource with a deprecated/sensitive tag.

Describe the solution you'd like
https://github.com/odpf/dexter has a good tagging feature that would be nice if Columbus has the similar feature.

feat: versioning of metadata

Is your feature request related to a problem? Please describe.
I have a record that contains a schema which changes I would like to keep track of. This would allow me to see how my schema changes over time.

Describe the solution you'd like
I want Columbus to version records whenever there is any changes to it. And also allows users to fetch and see all the previous versions.

refactor: move discovery context from main record model to its own package

Is your feature request related to a problem? Please describe.
Right now discovery context is tightly coupled with Record model. This could get complex fast if we want to add more features to a Record later. And the one incoming is tagging feature from Dexter.

Describe the solution you'd like
Move discovery to its own package that depends on Record package. This would make record to be clean and decoupled from discovery context.

feat(lineage): not using discovery data to build lineage graph

Is your feature request related to a problem? Please describe.
Right now it takes lots of memory and time when booting because Columbus fetches all assets from Discovery data, build the lineage graph from it, then stores the graph in memory.

Describe the solution you'd like
Use a proper lineage storage to avoid building and storing lineage on memory. It is possible to use Neo4j or even postgres for this.

Enrich features of comments' discussion

Is your feature request related to a problem? Please describe.
Comments feature in discussions is currently just a simple list of text related with the respected discussion. We can add more features to the comments to give better user interaction and experience.

Describe the solution you'd like
Add more features to the comments

Comment Replies: it is possible for each comment in the discussion to have child comments as replies.
Comment Reaction (only parent comment that could have this feature)
- Upvote
- Emoji
Filter comments by Top (based on vote), Since (yesterday, last week, last month
If it is a q&a type,
- users are able to suggest a comment as an answer
- columbus could mark the comment as an answer

remove gorm library

Is your feature request related to a problem? Please describe.
Currently we are using GORM for:

ORM
Database Migration

It works fine at the moment, but right now we are depending on GORM auto migrate for db migration and if in the future we want to change our postgres client, it would harder to switch, especially if we have lots of table to migrate.

Describe the solution you'd like
For Database Migration, we can use https://github.com/golang-migrate/migrate
For ORM, we can use simple https://github.com/jmoiron/sqlx as it is a dedicated postgres client and https://github.com/jackc/pgx as its driver

feat: add starring repository

Is your feature request related to a problem? Please describe.
I want to star/bookmark a resource to be revisited again later and able to fetch all of my starred resources.

Describe the solution you'd like

We need a new table in DB with this schema

starring	type	Sample Value
id (PK)	SERIAL	1
user_id (FK)	UUID	11234-4214214
asset_id (FK)	UUID	11234-4214214
created_at	TIMESTAMP	12345667
updated_at	TIMESTAMP	12345667

We need a new starring repository layer

feat(discovery): search based on user's past behaviour (table usage)

Is your feature request related to a problem? Please describe.
I want to be able to search based on user's past behaviour (table usage). The more table is being. used, the more relevant it is shown in the search results.

Describe the solution you'd like
Counting in table usage in relevancy only makes sense if we are searching in table type context. Therefore, this feature only works in table index/resource and wouldn't apply in universal/global context (where we search in all indices).

Basically, there are 2 possible implementation details to consider table usage in the relevancy parameter: implicit or explicit.

Implicit
Everytime users search within table index, we always consider table usage as part of relevancy (e.g. always count it as boosting value)

Explicit
We could give user control whether they want to count table usage as the relevancy or not everytime users search within table index by adding more param (query param) in Search API. The new query param could be called as ~~sortby~~ rankby with value usage

Proposed Decision
For now, we could go with the Explicit one. We could give user control whether they want to count table usage as the relevancy.

feat: add nesting query in assets api filters

Is your feature request related to a problem? Please describe.
Columbus v1beta1 assets API should support nesting query params in data and query filters.

Describe the solution you'd like
The columbus should support nesting query in Data and Query Filtering :
For Data Filter : data[entity.properties.landscape]=internal
For Query Filter : q=internal&q_fields=data.entity.properties.landscape

Out of scope
Filtering array data in data field
e.g.

"data": {
   "key1": ["value1", "value2", "value3"]
}

update docs

Is your feature request related to a problem? Please describe.
There has been a lot of changes happened in this repo and our docs is outdated. We need to update the doc to make it more relevant to our current features.

Describe the solution you'd like
Update columbus docs

gitbook/compass does not point to the correct docs

Describe the bug
odpf.gitbook.io/compass/ not showing the correct compass docs but redirect request to https://odpf.gitbook.io/raccoon/compass instead.

To Reproduce
Steps to reproduce the behavior:

Go to compass
Click on gitbook link below About section

Expected behavior
The link https://odpf.gitbook.io/compass/ should show the correct gitbook compass docs

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Need @ravisuhag help to check this 🙏🏼

feat(discovery): auto suggestion (search-as-you-type) on search

Is your feature request related to a problem? Please describe.
I want columbus to support auto suggestions (search as you type)

Describe the solution you'd like

Here we are separating the problem into two: API Changes and Search Logic

API Changes

Approach 1

Define a new API

/suggestion/?text=<text>

and return list of suggested list of data with fields: urn, name, type, and service.

Approach 2

Use existing search API with a new boolean query param called suggestion

/search/?text=<text>&suggestion=true

Columbus will return list of suggested list of data with fields: urn, name, type, and service if suggestion=true, else will return the search results as it is

Search Logic

Search logic for suggestion will impact how relevant are our suggestions. In this case, it is flexible to decide which one should we pick as for now, we could optimize and iterate it later. Reference.

Approach 1

Term suggester. The one that provides "similar" term, based on the edit distance. It provides suggestions based on data in the index, there are a lot of knobs and turns to tune it.

Approach 2

Phrase suggester. It's very similar to what term suggester is doing, but taking into account a whole phrase.

Approach 3

Completion suggester or search-as-you-type functionality. If first two are doing something like did you mean functionality or spellchecking, based on the actual terms in the index. This one should "show" you some 5 or 10 relevant docs, while user is typing, and for this one you need to manually index field of suggestion type, where later ES will do a fast lookup.

Approach 4

Context suggester. This one is a continuation of the completion suggester, with the idea of the some context where user is coming from (geo) or if engine wants to boost some company over another, just because they are paid for it, or something like this. In this case you also need to manually index additional data.

Proposed Approach

Approach 2 for API changes. Adding new query param would be simpler to deliver as for now. We can make the default to false.
Approach 2 for Search Logic. Phrase suggester will capture a whole text in a field and not just a single term/word. This one is simpler than context/complete suggester and we could improve on this going forward.

feat: improve user information in columbus

Is your feature request related to a problem? Please describe.
Currently we are relying on email for external identifier. External identifier will be used on API URL path param. Since email is a PII, it is not proper to pass it to path params.

Describe the solution you'd like
Instead of an email, we can rely on more generic information like uuid. We still collecting email in columbus but that is not a primary external user identifier.

Changes needed are:

User Repository
- Migration file: add a new column called uuid with type text and nullable, and create an index for uuid
User Service
- Validation flow logic
  - Get uuid from user id header (required), Get email from user email header (required)
  - Check uuid in DB
    - if not exist, check if email exist
      - if exist, update uuid of the email
      - if not exist, create a new row
    - if exist, check if fetched email is empty
      - if empty, update email
      - if not empty, continue
When Upserting Asset
- insert asset owner (if not exist) to a new row in users table with null uuid
Users API (api to look other users)
- accept external uuid

bug: return proper payload when hitting an unregistered route

Describe the bug
Whenever user hits endpoint that is not registered, Columbus returns

status =.404
content-type = "text/plain"
body = "404 page not found"

This does not align with Columbus error body payload which is

{
  "reason": "some-reason"
}

To Reproduce
Steps to reproduce the behavior:

Hit /v1/nonexisting-api
Check response

Expected behavior
Return below payload when hitting non existing api

{
  "reason": "Route not found"
}

feat: allow disregarding size when fetching list of assets

Is your feature request related to a problem? Please describe.
I want to fetch all assets that I have created. Right now [GET] /v1beta1/assets have a default size value of 20. I am able to use big number for now but I don't think it is scalable.

Describe the solution you'd like
Make [GET] /v1beta1/assets to disregard size limit if size is not given.

feat: user/discussions API

Is your feature request related to a problem? Please describe.
Discussions feature is already implemented in #47 , we need to add users API that list down all discussions related with specific user (recognized by User email header)

Describe the solution you'd like

Add user's discussions API /user/discussions

raystack / compass Goto Github PK

compass's People

Stargazers

Watchers

Forkers

compass's Issues

Tasks

User Specific API

Generic API

Response Schema

DB Schema

DB Columns

Others

odpf/columbus

odpf/charts

odpf/homebrew-taps

API Changes

Approach 1

Approach 2

Search Logic

Approach 1

Approach 2

Approach 3

Approach 4

Recommend Projects

Recommend Topics

Recommend Org