minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	364d3a0ac9	fix: new staticheck and linter issues reported (#19340 )	2024-03-27 08:10:40 -07:00
Poorna	8bce123bba	fix: precondition check for multipart with existing object replication (#19349 )	2024-03-26 15:10:45 -07:00
Harshavardhana	0a56dbde2f	allow configuring inline shard size value (#19336 )	2024-03-26 15:06:19 -07:00
Klaus Post	7ff4164d65	Fix races in IAM cache lazy loading (#19346 ) Fix races in IAM cache Fixes #19344 On the top level we only grab a read lock, but we write to the cache if we manage to fetch it. `a03dac41eb/cmd/iam-store.go (L446)` is also flipped to what it should be AFAICT. Change the internal cache structure to a concurrency safe implementation. Bonus: Also switch grid implementation.	2024-03-26 11:12:57 -07:00
Harshavardhana	dc45a5010d	bring back minor DNS cache for k8s setups (#19341 ) k8s as it stands is flaky in DNS lookups, bring this change back such that we can cache DNS atleast for 30secs TTL.	2024-03-26 08:00:38 -07:00
jiuker	4b9192034c	fix: should return when error happend (#19342 )	2024-03-26 07:51:56 -07:00
Harshavardhana	deeadd1a37	fix: convert multiple callers to use toStorageErr(err) correctly (#19339 ) we must attempt to convert all errors at storage-rest-client into StorageErr() regardless of what functionality is being called in, this PR fixes this for multiple callers including some internally used functions.	2024-03-25 23:24:59 -07:00
Sveinn	1fc4203c19	Webhook targets refactor and bug fixes (#19275 ) - old version was unable to retain messages during config reload - old version could not go from memory to disk during reload - new version can batch disk queue entries to single for to reduce I/O load - error logging has been improved, previous version would miss certain errors. - logic for spawning/despawning additional workers has been adjusted to trigger when half capacity is reached, instead of when the log queue becomes full. - old version would json marshall x2 and unmarshal 1x for every log item. Now we only do marshal x1 and then we GetRaw from the store and send it without having to re-marshal.	2024-03-25 09:44:20 -07:00
Poorna	7fd76dbbb7	fix batch snowball to close channel after listing finishes (#19316 ) panic seen due to premature closing of slow channel while listing is still sending or list has already closed on the sender's side: ``` panic: close of closed channel goroutine 13666 [running]: github.com/minio/minio/internal/ioutil.SafeClose[...](0x101ff51e4?) /Users/kp/code/src/github.com/minio/minio/internal/ioutil/ioutil.go:425 +0x24 github.com/minio/minio/cmd.(erasureServerPools).Walk.func1() /Users/kp/code/src/github.com/minio/minio/cmd/erasure-server-pool.go:2142 +0x170 created by github.com/minio/minio/cmd.(erasureServerPools).Walk in goroutine 1189 /Users/kp/code/src/github.com/minio/minio/cmd/erasure-server-pool.go:1985 +0x228 ```	2024-03-21 16:13:43 -07:00
Krishnan Parthasarathi	da81c6cc27	Encode dir obj names before expiration (#19305 ) Object names of directory objects qualified for ExpiredObjectAllVersions must be encoded appropriately before calling on deletePrefix on their erasure set. e.g., a directory object and regular objects with overlapping prefixes could lead to the expiration of regular objects, which is not the intention of ILM. ``` bucket/dir/ ---> directory object bucket/dir/obj-1 ``` When `bucket/dir/` qualifies for expiration, the current implementation would remove regular objects under the prefix `bucket/dir/`, in this case, `bucket/dir/obj-1`.	2024-03-21 10:21:35 -07:00
Harshavardhana	a03dac41eb	use retry during policy reload from drives (#19307 )	2024-03-21 10:19:50 -07:00
Shireesh Anjal	55778ae278	fix: peer addr returned as empty string (#19308 ) In handlers related to health diagnostics e.g. CPU, Network, Partitions, etc, globalMinioHost was being passed as the addr, resulting in empty value for the same in the health report. Using globalLocalNodeName instead fixes the issue.	2024-03-21 10:19:14 -07:00
Poorna	d990661d1f	replication: enforce precondition for multipart (#19306 )	2024-03-20 18:12:37 -07:00
Harshavardhana	280526caf7	add IAM policyDB lookup fallbacks to drives (#19302 ) IAM loading is a lazy operation, allow these fallbacks to be in place when we cannot find in-memory state(). this allows us to honor the request even if pay a small price for lookup and populating the data.	2024-03-20 09:24:04 -07:00
Harshavardhana	1173b26fc8	avoid triggering heals on metacache files if any (#19299 )	2024-03-19 20:21:15 -07:00
Krishnan Parthasarathi	383489d5d9	Handle zero versions qualified for expiration (#19301 ) When objects have more versions than their ILM policy expects to retain via NewerNoncurrentVersions, but they don't qualify for expiry due to NoncurrentDays are configured in that rule. In this case, applyNewerNoncurrentVersionsLimit method was enqueuing empty tasks, which lead to a panic (panic: runtime error: index out of range [0] with length 0) in newerNoncurrentTask.OpHash method, which assumes the task to contain at least one version to expire.	2024-03-19 20:10:58 -07:00
Anis Eleuch	9370b11684	decom: Fix failed status after a failed decommission (#19300 ) When returning the status of a decommissioned pool, a pool with zero time StartedTime will be considered an active pool, which is unexpected. This commit will always ensure that a pool's canceled/failed/completed status is returned.	2024-03-19 20:09:59 -07:00
Anis Eleuch	235edd88aa	xl: Purge instead of moving to trash with near filled disks (#19294 ) Immediately remove objects from the trash when the disk is 95% full	2024-03-19 13:26:24 -07:00
Anis Eleuch	b5e074e54c	list: Fix IsTruncated and NextMarker when encountering expired objects (#19290 )	2024-03-19 13:23:12 -07:00
Harshavardhana	7213bd7131	add additional logs for the decom during metadata save (#19288 )	2024-03-18 15:25:45 -07:00
Harshavardhana	741de4cf94	fix: add a default requests deadline when deadline is 0 (#19287 )	2024-03-18 12:30:41 -07:00
Harshavardhana	f168ef9989	implement a flag to specify custom crossdomain.xml (#19262 ) fixes #16909	2024-03-17 23:42:40 -07:00
alingse	a0de56abb6	fix: wrong time.Parse params order for replication timestamp (#19279 )	2024-03-17 21:19:43 -07:00
Harshavardhana	c201d8bda9	write anything beyond 4k to be written in 4k pages (#19269 ) we were prematurely not writing 4k pages while we could have due to the fact that most buffers would be multiples of 4k upto some number and there shall be some remainder. We only need to write the remainder without O_DIRECT.	2024-03-15 12:27:59 -07:00
Harshavardhana	93fb7d62d8	allow dynamically changing max_object_versions per object (#19265 )	2024-03-14 18:07:19 -07:00
Harshavardhana	ce1c640ce0	feat: allow retaining parity SLA to be configurable (#19260 ) at scale customers might start with failed drives, causing skew in the overall usage ratio per EC set. make this configurable such that customers can turn this off as needed depending on how comfortable they are.	2024-03-14 03:38:33 -07:00
Anis Eleuch	24b4f9d748	Fix quorum calculation with zero parity objects (#19250 ) Currently, the code relies on object parity to decide whether it is a delete marker or a regular object. In the case of a delete marker, the return quorum is half of the disks in the erasure set. However, this calculation must be corrected with objects with EC = 0, mainly because EC is not a one-time fixed configuration. Though all data are correct, the manifested symptom is a 503 with an EC=0 object. This bug was manifested after we introduced the fast Get Object feature that does not read all data from all disks in case of inlined objects	2024-03-12 12:59:11 -07:00
Harshavardhana	81d7531f1f	only look for valid buckets (#19244 ) fixes #19239	2024-03-12 04:33:30 -07:00
Poorna	b4a23f720e	update build constants (#19243 )	2024-03-11 17:54:37 -07:00
Dennis Marttinen	6c964fede5	Improve handling of compression inclusion for objects (#19234 )	2024-03-11 04:55:34 -07:00
huajin tong	a25a8312d8	fix: some flyby typos in the code (#19212 ) Signed-off-by: thirdkeyword <fliterdashen@gmail.com>	2024-03-10 14:09:36 -07:00
Aditya Manthramurthy	b2c5b75efa	feat: Add Metrics V3 API (#19068 ) Metrics v3 is mainly a reorganization of metrics into smaller groups of metrics and the removal of internal aggregation of metrics received from peer nodes in a MinIO cluster. This change adds the endpoint `/minio/metrics/v3` as the top-level metrics endpoint and under this, various sub-endpoints are implemented. These are currently documented in `docs/metrics/v3.md` The handler will serve metrics at any path `/minio/metrics/v3/PATH`, as follows: when PATH is a sub-endpoint listed above => serves the group of metrics under that path; or when PATH is a (non-empty) parent directory of the sub-endpoints listed above => serves metrics from each child sub-endpoint of PATH. otherwise, returns a no resource found error All available metrics are listed in the `docs/metrics/v3.md`. More will be added subsequently.	2024-03-10 01:15:15 -08:00
Harshavardhana	88a89213ff	make immediate purge non-blocking up to 100,000 entries per drive (#19231 ) make immediate purge non-blocking upto 100000 entries per drive Bonus: turn-off O_DIRECT verification when FSType is 'XFS'	2024-03-09 18:53:48 -08:00
Poorna	8e2238ea09	some more cleanup for startup message (#19229 )	2024-03-08 22:42:32 -08:00
Poorna	31e8f7c525	Small reformatting of startup message (#19228 ) Also changing User-Agent format	2024-03-08 19:07:08 -08:00
Klaus Post	51f62a8da3	Port ListBuckets to websockets layer & some cleanup (#19199 )	2024-03-08 11:08:18 -08:00
Klaus Post	650efc2e96	Fix listing in objects split across pools (#19227 ) Merging same-object - multiple versions from different pools would not always result in correct ordering. When merging keep inputs separate. ``` λ mc ls --versions local/testbucket ------ before ------ [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v3 PUT hosts [2024-03-05 20:19:56 CET] 19KiB STANDARD null v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ------ after ------ λ mc ls --versions local/testbucket [2024-03-05 20:19:56 CET] 19KiB STANDARD null v3 PUT hosts [2024-03-05 20:17:19 CET] 228B STANDARD 1f163718-9bc5-4b01-bff7-5d8cf09caf10 v2 PUT hosts [2024-03-05 20:17:15 CET] 228B STANDARD 73c9f651-f023-4566-b012-cc537fdb7ce2 v1 PUT hosts ```	2024-03-08 09:50:48 -08:00
Harshavardhana	2cc4997d24	fix: crash on 32bit systems during pre-allocation (#19225 )	2024-03-08 05:55:28 -08:00
Poorna	934f6cabf6	sr: use site replicator creds to verify temp user claims (#19224 ) This PR continues #19209 which did not handle claims verification of temporary users created by root in site replication scenario. Fixes: #19217	2024-03-07 14:30:00 -08:00
Anis Eleuch	68dd74c5ab	batch: Separate batch job request and batch job stats (#19205 ) Currently, the progress of the batch job is saved in inside the job request object, which is normally not supported by MinIO. Though there is no apparent bug, it is better to fix this now. Batch progress is saved in .minio.sys/batch-jobs/reports/ Co-authored-by: Anis Eleuch <anis@min.io>	2024-03-07 10:58:22 -08:00
Harshavardhana	48b590e14b	fix: same server to be part of multiple pools (#19216 ) our PoolNumber calculation was costly, while we already had this information per endpoint, we needed to deduce it appropriately. This PR addresses this by assigning PoolNumbers field that carries all the pool numbers that belong to a server. properties.PoolNumber still carries a valid value only when len(properties.PoolNumbers) == 1, otherwise properties.PoolNumber is set to math.MaxInt (indicating that this value is undefined) and then one must rely on properties.PoolNumbers for server participation in multiple pools. addresses the issue originating from #11327	2024-03-07 10:24:07 -08:00
Poorna	837a2a3d4b	sr: use service account cred for claims check (#19209 ) PR #19111 overlaid service account secret with site replicator secret during token claims check. Fixes : #19206	2024-03-06 16:19:24 -08:00
Harshavardhana	74ccee6619	avoid too much auditing during decom/rebalance make it more robust (#19174 ) there can be a sudden spike in tiny allocations, due to too much auditing being done, also don't hang on the ``` h.logCh <- entry ``` after initializing workers if you do not have a way to dequeue for some reason.	2024-03-06 03:43:16 -08:00
Poorna	89f759566c	bucket import: avoid overwriting bucket creation date (#19207 )	2024-03-05 16:05:28 -08:00
Harshavardhana	cd7551031b	fix: a regression in loading replication creds (#19204 ) fixes #19200 generating STS credentials fail with site-replicated setup, with this error on a fresh environment.	2024-03-05 11:06:17 -08:00
Praveen raj Mani	df57bfcd6c	fix: cluster read health check to return proper values (#19203 ) Fixes #19202	2024-03-05 10:25:49 -08:00
Justin Griffin	dfb1f39b57	Support custom endpoint for Azure remote storage tier (#19188 ) This commits adds support for using the `--endpoint` arg when creating a tier of type `azure`. This is needed to connect to Azure's Gov Cloud instance. For example, ``` mc ilm tier add azure TARGET TIER_NAME \ --account-name ACCOUNT \ --account-key KEY \ --bucket CONTAINER \ --endpoint https://ACCOUNT.blob.core.usgovcloudapi.net --prefix PREFIX \ --storage-class STORAGE_CLASS ``` Prior to this, the endpoint was hardcoded to `https://ACCOUNT.blob.core.windows.net`. The docs were even explicit about this, stating that `--endpoint` is: "Required for `s3` or `minio` tier types. This option has no effect for any other value of `TIER_TYPE`." Now, if the endpoint arg is present it will be used. If not, it will fall back to the same default behavior of `ACCOUNT.blob.core.windows.net`.	2024-03-05 08:44:08 -08:00
Harshavardhana	1b5f28e99b	fix: skip local disks properly in cluster health maintenance check (#19184 )	2024-03-04 20:48:44 -08:00
Krishnan Parthasarathi	b69bcdcdc4	Fix ilm config at startup (#19189 ) Remove api.expiration_workers config setting which was inadvertently left behind. Per review comment https://github.com/minio/minio/pull/18926, expiration_workers can be configured via ilm.expiration_workers.	2024-03-04 18:50:24 -08:00
Harshavardhana	e385f54185	fix: nLink is unreliable on all filesystems (#19187 ) ext4, xfs support this behavior however btrfs, nfs may not support it properly. in-case when we see Nlink < 2 then we know that we need to fallback on readdir() fixes a regression from #19100 fixes #19181	2024-03-04 15:58:35 -08:00
Aditya Manthramurthy	9a4d003ac7	Add common middleware to S3 API handlers (#19171 ) The middleware sets up tracing, throttling, gzipped responses and collecting API stats. Additionally, this change updates the names of handler functions in metric labels to be the same as the name derived from Go lang reflection on the handler name. The metric api labels are now stored in memory the same as the handler name - they will be camelcased, e.g. `GetObject` instead of `getobject`. For compatibility, we lowercase the metric api label values when emitting the metrics.	2024-03-04 10:05:56 -08:00
Praveen raj Mani	d5656eeb65	fix: healthcheck to fail even if one erasure set doesn't have quorum (#19180 ) fix: healthcheck to return false even if one erasure set doesn't have quorum	2024-03-04 08:34:14 -08:00
Harshavardhana	6d08af61a0	for root disks add additional information in the error log (#19177 )	2024-03-02 23:45:39 -08:00
Krishnan Parthasarathi	a7577da768	Improve expiration of tiered objects (#18926 ) - Use a shared worker pool for all ILM expiry tasks - Free version cleanup executes in a separate goroutine - Add a free version only if removing the remote object fails - Add ILM expiry metrics to the node namespace - Move tier journal tasks to expiryState - Remove unused on-disk journal for tiered objects pending deletion - Distribute expiry tasks across workers such that the expiry of versions of the same object serialized - Ability to resize worker pool without server restart - Make scaling down of expiryState workers' concurrency safe; Thanks @klauspost - Add error logs when expiryState and transition state are not initialized (yet) * metrics: Add missed tier journal entry tasks * Initialize the ILM worker pool after the object layer	2024-03-01 21:11:03 -08:00
Harshavardhana	325fd80687	add retry logic upto 3 times for policy map and policy (#19173 )	2024-03-01 16:21:34 -08:00
Andreas Auernhammer	09626d78ff	automatically generate root credentials with KMS (#19025 ) With this commit, MinIO generates root credentials automatically and deterministically if: - No root credentials have been set. - A KMS (KES) is configured. - API access for the root credentials is disabled (lockdown mode). Before, MinIO defaults to `minioadmin` for both the access and secret keys. Now, MinIO generates unique root credentials automatically on startup using the KMS. Therefore, it uses the KMS HMAC function to generate pseudo-random values. These values never change as long as the KMS key remains the same, and the KMS key must continue to exist since all IAM data is encrypted with it. Backward compatibility: This commit should not cause existing deployments to break. It only changes the root credentials of deployments that have a KMS configured (KES, not a static key) but have not set any admin credentials. Such implementations should be rare or not exist at all. Even if the worst case would be updating root credentials in mc or other clients used to administer the cluster. Root credentials are anyway not intended for regular S3 operations. Signed-off-by: Andreas Auernhammer <github@aead.dev>	2024-03-01 13:09:42 -08:00
Anis Eleuch	8f03c6e0db	xl: Avoid called getdents for folders in listing (#19100 )	2024-03-01 08:01:28 -08:00
Harshavardhana	2c2f5d871c	debug: introduce support for configuring client connect WRITE deadline (#19170 ) just like client-conn-read-deadline, added a new flag that does client-conn-write-deadline as well. Both are not configured by default, since we do not yet know what is the right value. Allow this to be configurable if needed.	2024-03-01 08:00:42 -08:00
Harshavardhana	c599c11e70	fix: relax metadata checks for healing (#19165 ) we should do this to ensure that we focus on data healing as primary focus, fixing metadata as part of healing must be done but making data available is the main focus. the main reason is metadata inconsistencies can cause data availability issues, which must be avoided at all cost. will be bringing in an additional healing mechanism that involves "metadata-only" heal, for now we do not expect to have these checks. continuation of #19154 Bonus: add a pro-active healthcheck to perform a connection	2024-02-29 22:49:01 -08:00
Aditya Manthramurthy	6769d4dd54	Update API label names for metrics (#19162 ) This change makes the label names consistent with the handler names. This is in preparation to use reflection based API handler function names for the api labels so they will be the same as tracing, auditing and logging names for these API calls.	2024-02-29 16:14:27 -08:00
Harshavardhana	d7520f0ae6	fix: make sure maintenance=true is honored properly (#19156 ) fixes a regression from #18700	2024-02-29 08:37:57 -08:00
Harshavardhana	44b70eb646	allow creating missing parent folders during moveToTrash() (#19155 )	2024-02-29 08:28:33 -08:00
Harshavardhana	467714f33b	ignore x-amz-storage-class when its set to STANDARD (#19154 ) fixes #19135	2024-02-28 17:44:30 -08:00
Harshavardhana	f8696cc8f6	fallback to globalLocalDrives for non-distributed setups	2024-02-28 14:56:08 -08:00
Anis Eleuch	9a7c7ab2d0	fix: parsing v2 and v1 cgroup memory limit (#19153 ) Trim the newline at the end of the sysfs memory limit.	2024-02-28 14:52:20 -08:00
Harshavardhana	51874a5776	fix: allow DNS disconnection events to happen in k8s (#19145 ) in k8s things really do come online very asynchronously, we need to use implementation that allows this randomness. To facilitate this move WriteAll() as part of the websocket layer instead. Bonus: avoid instances of dnscache usage on k8s	2024-02-28 09:54:52 -08:00
Aditya Manthramurthy	62ce52c8fd	cachevalue: simplify exported interface (#19137 ) - Also add cache options type	2024-02-28 09:09:09 -08:00
Anis Eleuch	2bdb9511bd	heal: Add skipped objects to the heal summary (#19142 ) New disk healing code skips/expires objects that ILM supposed to expire. Add more visibility to the user about this activity by calculating those objects and print it at the end of healing activity.	2024-02-28 09:05:40 -08:00
Harshavardhana	9a012a53ef	initialize the disk healer early on (#19143 ) This PR fixes a bug that perhaps has been long introduced, with no visible workarounds. In any deployment, if an entire erasure set is deleted, there is no way the cluster recovers.	2024-02-27 23:02:14 -08:00
Harshavardhana	1dd8ef09a6	remove unnecessary 'recreate' code (#19136 )	2024-02-27 01:47:58 -08:00
Poorna	b1351e2dee	sr: use site replicator svcacct to sign STS session tokens (#19111 ) This change is to decouple need for root credentials to match between site replication deployments. Also ensuring site replication config initialization is re-tried until it succeeds, this deoendency is critical to STS flow in site replication scenario.	2024-02-26 13:30:28 -08:00
Praveen raj Mani	30c2596512	Read drive IO stats from sysfs instead of procfs (#19131 ) Currently, we read from `/proc/diskstats` which is found to be un-reliable in k8s environments. We can read from `sysfs` instead. Also, cache the latest drive io stats to find the diff and update the metrics.	2024-02-26 11:34:50 -08:00
Klaus Post	2b5e4b853c	Improve caching (#19130 ) * Remove lock for cached operations. * Rename "Relax" to `ReturnLastGood`. * Add `CacheError` to allow caching values even on errors. * Add NoWait that will return current value with async fetching if within 2xTTL. * Make benchmark somewhat representative. ``` Before: BenchmarkCache-12 16408370 63.12 ns/op 0 B/op After: BenchmarkCache-12 428282187 2.789 ns/op 0 B/op ``` * Remove `storageRESTClient.scanning`. Nonsensical - RPC clients will not have any idea about scanning. * Always fetch remote diskinfo metrics and cache them. Seems most calls are requesting metrics. * Do async fetching of usage caches.	2024-02-26 10:49:19 -08:00
Harshavardhana	92788e4cf4	fix: re-arrange console-sys to log properly in k8s/docker (#19129 ) fixes #19125	2024-02-26 01:33:48 -08:00
Harshavardhana	8a698fef71	fix: crash in ResourceMetrics RPC handling concurrent writers (#19123 ) Continuation of #19103 that had fixed the crash in peer metrics for cluster endpoint.	2024-02-25 00:51:38 -08:00
Harshavardhana	c2b54d92f6	allow all disk full errors to be handled (#19117 )	2024-02-24 09:11:14 -08:00
Harshavardhana	f965434022	fix: re-use endpoint strings to avoid allocation during audit (#19116 )	2024-02-23 16:19:13 -08:00
Harshavardhana	a3ac62596c	move timedValue -> cachevalue package (#19114 )	2024-02-23 13:28:14 -08:00
Harshavardhana	2faba02d6b	fix: allow diskInfo at storageRPC to be cached (#19112 ) Bonus: convert timedValue into a typed implementation	2024-02-23 09:21:38 -08:00
Krishnan Parthasarathi	ee158e1610	ilm: Update action count only on success (#19093 ) It also fixes a long-standing bug in expiring transitioned objects. The expiration action was deleting the current version in the case' of tiered objects instead of adding a delete marker.	2024-02-22 15:00:32 -08:00
Anis Eleuch	fa68efb1e7	s3: CopyObject to disallow invalid dest object names (#19110 ) By not doing so, objects can risk being in a wrong erasure set if the destination object name contains e.g. '//'	2024-02-22 10:05:17 -08:00
Anis Eleuch	8c53a4405a	Add audit for folder excess (#19109 ) Also replace ilm:expiry with scanner to avoid user confusion	2024-02-22 08:18:13 -08:00
Harshavardhana	c32f699105	turn-off md5sum for SSE-KMS/SSE-C as optimization for multipart (#19106 ) only enable md5sum if explicitly asked by the client, otherwise its not necessary to compute md5sum when SSE-KMS/SSE-C is enabled. this is continuation of #17958	2024-02-22 04:24:11 -08:00
Harshavardhana	53aa8f5650	use typos instead of codespell (#19088 )	2024-02-21 22:26:06 -08:00
Klaus Post	92180bc793	Add array recycling safety (#19103 ) Nil entries when recycling arrays.	2024-02-21 12:27:35 -08:00
Poorna	526b829a09	site replication: Disallow removal of site-replicator account (#19092 )	2024-02-21 02:09:33 -08:00
Anis Eleuch	9ea5d08ecd	site-repl: Fix endpoint in the error with unexpected deployment-id (#19086 )	2024-02-20 15:02:35 -08:00
Harshavardhana	35deb1a8e2	do not block on send channels under high load (#19090 ) all send channels must compete with `ctx` if not they will perpetually stay alive.	2024-02-20 15:00:35 -08:00
Harshavardhana	c7f7c47388	allow renames() for inlined writes without data-dir (#18801 ) data-dir not being present is okay, however we can still rely on the `rename()` atomic call instead of relying on write xl.meta write which may truncate the io.EOF.	2024-02-20 07:05:57 -08:00
Klaus Post	e06168596f	Convert more peer <--> peer REST calls (#19004 ) * Convert more peer <--> peer REST calls * Clean up in general. * Add JSON wrapper. * Add slice wrapper. * Add option to make handler return nil error if no connection is given, `IgnoreNilConn`. Converts the following: ``` + HandlerGetMetrics + HandlerGetResourceMetrics + HandlerGetMemInfo + HandlerGetProcInfo + HandlerGetOSInfo + HandlerGetPartitions + HandlerGetNetInfo + HandlerGetCPUs + HandlerServerInfo + HandlerGetSysConfig + HandlerGetSysServices + HandlerGetSysErrors + HandlerGetAllBucketStats + HandlerGetBucketStats + HandlerGetSRMetrics + HandlerGetPeerMetrics + HandlerGetMetacacheListing + HandlerUpdateMetacacheListing + HandlerGetPeerBucketMetrics + HandlerStorageInfo + HandlerGetLocks + HandlerBackgroundHealStatus + HandlerGetLastDayTierStats + HandlerSignalService + HandlerGetBandwidth ```	2024-02-19 14:54:46 -08:00
Harshavardhana	4c8197a119	reject expired STS credentials early without decoding sessionToken (#19072 )	2024-02-19 07:34:10 -08:00
Harshavardhana	b6e98aed01	fix: found races in accessing globalLocalDrives (#19069 ) make a copy before accessing globalLocalDrives Bonus: update console v0.46.0 Signed-off-by: Harshavardhana <harsha@minio.io>	2024-02-16 17:15:57 -08:00
Anis Eleuch	00dcba9ddd	Fix typo in jwt skewed date/time error (#19066 )	2024-02-16 10:48:30 -08:00
Harshavardhana	607cafadbc	converge clusterRead health into cluster health (#19063 )	2024-02-15 16:48:36 -08:00
Anis Eleuch	68dde2359f	log: Add logger.Event to send to console and other logger targets (#19060 ) Add a new function logger.Event() to send the log to Console and http/kafka log webhooks. This will include some internal events such as disk healing and rebalance/decommissioning	2024-02-15 15:13:30 -08:00
Poorna	f9dbf41e27	sr: add validation to disallow updating bandwidth limit on self (#19062 )	2024-02-15 13:03:40 -08:00
Krishnan Parthasarathi	7405760f44	Refresh tier config periodically (#19049 ) - Increase the parity for tier-config.bin object - Refresh globalTierConfigMgr cached value once every 15 mins	2024-02-15 11:52:44 -08:00
Harshavardhana	7e4a6b4bcd	remove rename2 entirely, avoids the risk of moving data (#19058 )	2024-02-14 17:09:38 -08:00
Harshavardhana	f961ec4aaf	fix: revert allow offline disks on fresh start (#19052 ) the PR in #16541 was incorrect and hand wrong assumptions about the overall setup, revert this since this expectation to have offline servers is wrong and we can end up with a bigger chicken and egg problem. This reverts commit `5996c8c4d5`. Bonus: - preserve disk in globalLocalDrives properly upon connectDisks() - do not return 'nil' from newXLStorage(), getting it ready for the next set of changes for 'format.json' loading.	2024-02-14 10:37:34 -08:00
Harshavardhana	134db72bb7	fix: reject service account access key same as root credentials (#19055 )	2024-02-14 10:37:12 -08:00
Harshavardhana	effe21f3eb	send correct objectname in audit events for DeleteAll ILM (#19053 )	2024-02-14 08:07:58 -08:00
Praveen raj Mani	1118b285d3	fix: race in deleting objects during batch expiry (#19054 )	2024-02-14 08:07:44 -08:00
Aditya Manthramurthy	a14e192376	fix: remove unnecessary panic in iam-store (#19050 )	2024-02-13 19:29:36 -08:00
Minio Trusted	f8e15e7d09	Update yaml files to latest version RELEASE.2024-02-13T15-35-11Z	2024-02-13 16:01:38 +00:00
Shireesh Anjal	7b9f9e0628	fix incorrect disk io stats in k8s environment (#19016 ) The previous logic of calculating per second values for disk io stats divides the stats by the host uptime. This doesn't work in k8s environment as the uptime is of the pod, but the stats (from /proc/diskstats) are from the host. Fix this by storing the initial values of uptime and the stats at the timme of server startup, and using the difference between current and initial values when calculating the per second values.	2024-02-13 07:35:11 -08:00
Praveen raj Mani	ac8e9ce04f	Send a bucket notification event on DeleteObject() for non-existing object (#19037 ) Send a bucket notification event on DeleteObject for non-existing objects	2024-02-13 07:34:17 -08:00
Praveen raj Mani	cfd8645843	fix: update batch replication stats for snowball uploads (#19045 )	2024-02-13 07:33:27 -08:00
Harshavardhana	0c068b15c7	add missing handler for reloading site replication config on peers (#19042 )	2024-02-13 06:55:54 -08:00
Anis Eleuch	30a466aa71	sts: Add test for DurationSeconds condition (#19044 )	2024-02-13 06:55:37 -08:00
Taran Pelkey	4d94609c44	FIx unexpected behavior when creating service account (#19036 )	2024-02-13 02:31:43 -08:00
Poorna	0cc9fb73e1	metrics: fix typo in namespace for proxy tagging metric (#19039 ) Relevant PR introducing this metric: #18957	2024-02-12 13:02:27 -08:00
Harshavardhana	eac4e4b279	honor replaced disk properly by updating globalLocalDrives (#19038 ) globalLocalDrives seem to be not updated during the HealFormat() leads to a requirement where the server needs to be restarted for the healing to continue.	2024-02-12 13:00:20 -08:00
Harshavardhana	6d381f7c0a	relax pre-emptive GetBucketInfo() for multi-object delete (#19035 )	2024-02-12 08:46:46 -08:00
Anis Eleuch	4fa06aefc6	Convert service account add/update expiration to cond values (#19024 ) In order to force some users allowed to create or update a service account to provide an expiration satifying the user policy conditions.	2024-02-12 08:36:16 -08:00
Harshavardhana	0e177a44e0	preserve conflicting objects when parent object is being deleted (#19034 ) a/prefix a/prefix/1.txt where `a/prefix` is an object which does not have `/` at the end, we do not have to aggressively recursively delete all the sub-folders as well. Instead convert the call into self contained to deleting 'xl.meta' and then subsequently attempting to Remove the parent.	2024-02-12 08:30:40 -08:00
Harshavardhana	afd19de5a9	fix: allow configuring excess versions alerting (#19028 ) Bonus: enable audit alerts for object versions beyond the configured value, default is '100' versions per object beyond which scanner will alert for each such objects.	2024-02-11 23:41:53 -08:00
Harshavardhana	e3fbac9e24	do not have to use the same distributionAlgo as first pool (#19031 ) when we expand via pools, there is no reason to stick with the same distributionAlgo as the rest. Since the algo only makes sense with-in a pool not across pools. This allows for newer pools to use newer codepaths to avoid legacy file lookups when they have a pre-existing deployment from 2019, they can expand their new pool to be of a newer distribution format, allowing the pool to be more performant.	2024-02-11 23:21:56 -08:00
Poorna	a9cf32811c	Fix panic in tagging request proxying (#19032 )	2024-02-11 18:18:43 -08:00
Harshavardhana	53997ecc79	avoid excessive logging for objects that do not exist (#19030 ) in replicated setups, that have proxying enabled for replicated buckets.	2024-02-11 14:21:08 -08:00
Harshavardhana	997ba3a574	introduce reader deadlines for net.Conn (#19023 ) Bonus: set "retry-after" header for AWS SDKs if possible to honor them.	2024-02-09 13:25:16 -08:00
Harshavardhana	62761a23e6	remove unnecessary metrics in 'mc admin info' output (#19020 ) Reduce the amount of data transfer on large deployments	2024-02-08 19:28:46 -08:00
Harshavardhana	404d8b3084	fix: dangling objects honor parityBlocks instead of dataBlocks (#19019 ) Bonus: do not recreate buckets if NoRecreate is asked.	2024-02-08 15:22:16 -08:00
Klaus Post	6005ad3d48	Fix shared top locks client (#19018 ) `client` is shared across goroutines. Seen with `mc support top locks` on minio built with `-race`.	2024-02-08 12:28:05 -08:00
Harshavardhana	035a3ea4ae	optimize startup sequence performance (#19009 ) - bucket metadata does not need to look for legacy things anymore if b.Created is non-zero - stagger bucket metadata loads across lots of nodes to avoid the current thundering herd problem. - Remove deadlines for RenameData, RenameFile - these calls should not ever be timed out and should wait until completion or wait for client timeout. Do not choose timeouts for applications during the WRITE phase. - increase R/W buffer size, increase maxMergeMessages to 30	2024-02-08 11:21:21 -08:00
Aditya Manthramurthy	e104b183d8	fix: skip policy usage validation for cache update (#19008 ) When updating the policy cache, we do not need to validate policy usage as the policy has already been deleted by the node sending the notification.	2024-02-07 20:39:53 -08:00
Klaus Post	7e082f232e	Add GetBucketInfo toStorageErr conversion (#19005 ) Convert error to storageError since it is used for quorum calculations here: `ff80cfd83d/cmd/peer-s3-client.go (L339)`	2024-02-07 14:24:24 -08:00
Harshavardhana	d28bf71f25	listing must return WalkDir() errors first (#19006 )	2024-02-07 13:20:07 -08:00
Harshavardhana	5b1a74b6b2	do not block iam.store registration (#18999 ) current implementation would quite simply block the sys.store registration, making sys.Initialized() call to be blocked.	2024-02-07 12:41:58 -08:00
Klaus Post	ebc6c9b498	Fix tracing send on closed channel (#18982 ) Depending on when the context cancelation is picked up the handler may return and close the channel before `SubscribeJSON` returns, causing: ``` Feb 05 17:12:00 s3-us-node11 minio[3973657]: panic: send on closed channel Feb 05 17:12:00 s3-us-node11 minio[3973657]: goroutine 378007076 [running]: Feb 05 17:12:00 s3-us-node11 minio[3973657]: github.com/minio/minio/internal/pubsub.(PubSub[...]).SubscribeJSON.func1() Feb 05 17:12:00 s3-us-node11 minio[3973657]: github.com/minio/minio/internal/pubsub/pubsub.go:139 +0x12d Feb 05 17:12:00 s3-us-node11 minio[3973657]: created by github.com/minio/minio/internal/pubsub.(PubSub[...]).SubscribeJSON in goroutine 378010884 Feb 05 17:12:00 s3-us-node11 minio[3973657]: github.com/minio/minio/internal/pubsub/pubsub.go:124 +0x352 ``` Wait explicitly for the goroutine to exit. Bonus: Listen for doneCh when sending to not risk getting blocked there is channel isn't being emptied.	2024-02-06 08:57:30 -08:00
Harshavardhana	630963fa6b	protect tracker copy properly to avoid race (#18984 ) ``` WARNING: DATA RACE Write at 0x00c000aac1e0 by goroutine 1133: github.com/minio/minio/cmd.(healingTracker).updateProgress() github.com/minio/minio/cmd/background-newdisks-heal-ops.go:183 +0x117 github.com/minio/minio/cmd.(erasureObjects).healErasureSet.func5() github.com/minio/minio/cmd/global-heal.go:292 +0x1d3 Previous read at 0x00c000aac1e0 by goroutine 1003: github.com/minio/minio/cmd.(allHealState).updateHealStatus() github.com/minio/minio/cmd/admin-heal-ops.go:136 +0xcb github.com/minio/minio/cmd.(healingTracker).save() github.com/minio/minio/cmd/background-newdisks-heal-ops.go:223 +0x424 ```	2024-02-06 08:56:59 -08:00
Harshavardhana	f674168b8b	Add missing gob register for map[string]string{} (#18974 ) ``` minio[1303918]: API: SYSTEM() minio[1303918]: Time: 02:04:28 UTC 02/05/2024 minio[1303918]: DeploymentID: 0972de33-2d17-4499-8967-aff6437dd9da minio[1303918]: Error: gob: type not registered for interface: map[string]string (errors.errorString) minio[1303918]: 4: internal/logger/logonce.go:118:logger.(logOnceType).logOnceIf() minio[1303918]: 3: internal/logger/logonce.go:149:logger.LogOnceIf() minio[1303918]: 2: cmd/peer-rest-server.go:533:cmd.(*peerRESTServer).GetSysConfigHandler() minio[1303918]: 1: net/http/server.go:2136:http.HandlerFunc.ServeHTTP() ```	2024-02-06 08:23:23 -08:00
Poorna	27d02ea6f7	metrics: add replication metrics on proxied requests (#18957 )	2024-02-05 22:00:45 -08:00
Harshavardhana	794a7993cb	calculate correct quorum check for metadata updates on object (#18979 ) this fixes rare bugs we have seen but never really found a reproducer for - PutObjectRetention() returning 503s - PutObjectTags() returning 503s - PutObjectMetadata() updates during replication returning 503s These calls return errors, and this perpetuates with no apparent fix. This PR fixes with correct quorum requirement.	2024-02-05 21:44:40 -08:00
Harshavardhana	6f16d1cb2c	do not count context canceled as timeout errors (#18975 )	2024-02-05 18:16:13 -08:00
Anis Eleuch	7aa00bff89	sts: Add support of AssumeRoleWithWebIdentity and DurationSeconds (#18835 ) To force limit the duration of STS accounts, the user can create a new policy, like the following: { "Version": "2012-10-17", "Statement": [{ "Effect": "Allow", "Action": ["sts:AssumeRoleWithWebIdentity"], "Condition": {"NumericLessThanEquals": {"sts:DurationSeconds": "300"}} }] } And force binding the policy to all OpenID users, whether using a claim name or role ARN.	2024-02-05 11:44:23 -08:00
Klaus Post	e046eb1d17	Disable Rename2 metrics on non-linux (#18970 ) Logging a call that always fails is pointless.	2024-02-05 10:48:14 -08:00
Anis Eleuch	ba975ca320	Add defensive code to ignore checking parts with transitioned objects (#18973 ) Though dataErrs are nil with transitioned objects, add a more defensive code to ignore counting missing parts in that case	2024-02-05 10:48:03 -08:00
Harshavardhana	fec13b0ec1	remove unused DiskMTime (#18965 )	2024-02-05 01:04:26 -08:00
Harshavardhana	100c35c281	avoid excessive logs when peer is down (#18969 )	2024-02-04 23:25:42 -08:00
Harshavardhana	f225ca3312	Add more advanced cases for dangling (#18968 )	2024-02-04 14:36:13 -08:00
Frank Wessels	8b68e0bfdc	Fix typo in api-router.go (#18955 )	2024-02-03 14:03:51 -08:00
Anis Eleuch	6ae97aedc9	xl: Disable rename2 in decommissioning/rebalance (#18964 ) Always disable rename2 optimization in decom/rebalance	2024-02-03 14:03:30 -08:00
Harshavardhana	960d604013	disconnected returns, an unexpected error to List() returning 500s (#18959 ) provide the error string appropriately so that the matching of error types works. Also add a string based fallback for the said error.	2024-02-03 01:04:33 -08:00
Harshavardhana	ff80cfd83d	move Make,Delete,Head,Heal bucket calls to websockets (#18951 )	2024-02-02 14:54:54 -08:00
Harshavardhana	99fde2ba85	deprecate disk tokens, instead rely on deadlines and active monitoring (#18947 ) disk tokens usage is not necessary anymore with the implementation of deadlines for storage calls and active monitoring of the drive for I/O timeouts. Functionality kicking off a bad drive is still supported, it's just that we do not have to serialize I/O in the manner tokens would do.	2024-02-02 10:10:54 -08:00
Frank Wessels	31743789dc	Fix some leftover issues from PR 18936 (#18946 )	2024-02-01 19:42:56 -08:00
Anis Eleuch	6fd63e920a	log: Use error log type instead of Application/MinIO type (#18930 ) * log: Use error log type instead of Application/MinIO type Also bump github.com/shirou/gopsutil version to address cross compilation issues. * Apply suggestions from code review Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com> --------- Co-authored-by: Anis Eleuch <anis@min.io> Co-authored-by: Harshavardhana <harsha@minio.io> Co-authored-by: Aditya Manthramurthy <donatello@users.noreply.github.com>	2024-02-01 16:13:57 -08:00
Aditya Manthramurthy	59cc3e93d6	fix: `null` inline policy handling for access keys (#18945 ) Interpret `null` inline policy for access keys as inheriting parent policy. Since MinIO Console currently sends this value, we need to honor it for now. A larger fix in Console and in the server are required. Fixes #18939.	2024-02-01 14:45:03 -08:00
Anis Eleuch	61a4bb38cd	batch: Fix a typo while validating smallerThan field (#18942 )	2024-02-01 13:53:26 -08:00
Klaus Post	b192bc348c	Improve object reuse for grid messages (#18940 ) Allow internal types to support a `Recycler` interface, which will allow for sharing of common types across handlers. This means that all `grid.MSS` (and similar) objects are shared across in a common pool instead of a per-handler pool. Add internal request reuse of internal types. Add for safe (pointerless) types explicitly. Only log params for internal types. Doing Sprint(obj) is just a bit too messy.	2024-02-01 12:41:20 -08:00
Harshavardhana	6440d0fbf3	move a collection of peer APIs to websockets (#18936 )	2024-02-01 10:47:20 -08:00
Anis Eleuch	24ecc44bac	Keep ServiceV1 admin stop/restart API and mark as deprecated (#18932 )	2024-01-31 12:20:33 -08:00
Aditya Manthramurthy	0ae4915a93	fix: permission checks for editing access keys (#18928 ) With this change, only a user with `UpdateServiceAccountAdminAction` permission is able to edit access keys. We would like to let a user edit their own access keys, however the feature needs to be re-designed for better security and integration with external systems like AD/LDAP and OpenID. This change prevents privilege escalation via service accounts.	2024-01-31 10:56:45 -08:00
Harshavardhana	caac9d216e	remove all the frivolous logs, that may or may not be actionable (#18922 ) for actionable, inspections we have `mc support inspect` we do not need double logging, healing will report relevant errors if any, in terms of quorum lost etc.	2024-01-30 18:11:45 -08:00
Harshavardhana	057192913c	add total usable capacity, free and used to DataUsageInfo() (#18921 )	2024-01-30 17:49:37 -08:00
Harshavardhana	f25cbdf43c	use all the available nr_requests for NVMe (#18920 )	2024-01-30 14:10:06 -08:00
Klaus Post	6da4a9c7bb	Improve tracing & notification scalability (#18903 ) * Perform JSON encoding on remote machines and only forward byte slices. * Migrate tracing & notification to WebSockets.	2024-01-30 12:49:02 -08:00
Harshavardhana	80ca120088	remove checkBucketExist check entirely to avoid fan-out calls (#18917 ) Each Put, List, Multipart operations heavily rely on making GetBucketInfo() call to verify if bucket exists or not on a regular basis. This has a large performance cost when there are tons of servers involved. We did optimize this part by vectorizing the bucket calls, however its not enough, beyond 100 nodes and this becomes fairly visible in terms of performance.	2024-01-30 12:43:25 -08:00
Anis Eleuch	a669946357	Add cgroup v2 support for memory limit (#18905 )	2024-01-30 11:13:27 -08:00
Poorna	7ffc162ea8	exclude veeam virtual objects from replication (#18918 ) Fixes: #18916	2024-01-30 10:43:58 -08:00
Poorna	bcfd7fbbcf	reuse transports for callhome and remote tgt validation (#18912 )	2024-01-29 23:05:39 -08:00
Harshavardhana	486e2e48ea	enable xattr capture by default (#18911 ) - healing must not set the write xattr because that is the job of active healing to update. what we need to preserve is permanent deletes. - remove older env for drive monitoring and enable it accordingly, as a global value.	2024-01-29 23:03:58 -08:00
Harshavardhana	2ddf2ca934	allow configuring maximum idle connections per host (#18908 )	2024-01-29 16:50:37 -08:00
Poorna	29b1a29044	fix metrics panic in node metrics endpoint (#18894 )	2024-01-29 12:32:44 -08:00
jiuker	b4ab8e095a	fix: preserve bucket metric of data usage for replication info (#18895 )	2024-01-29 08:54:20 -08:00
Harshavardhana	cff8235068	remove getReplicationNodeMetrics() from peer metrics groups	2024-01-28 18:45:20 -08:00
Harshavardhana	944f3c1477	remove local disk metrics from cluster metrics (#18886 ) local disk metrics were polluting cluster metrics Please remove them instead of adding relevant ones. - batch job metrics were incorrectly kept at bucket metrics endpoint, move it to cluster metrics. - add tier metrics to cluster peer metrics from the node. - fix missing set level cluster health metrics	2024-01-28 12:53:59 -08:00
Harshavardhana	1d3bd02089	avoid close 'nil' panics if any (#18890 ) brings a generic implementation that prints a stack trace for 'nil' channel closes(), if not safely closes it.	2024-01-28 10:04:17 -08:00
Harshavardhana	6347fb6636	add missing proper error return in WalkDir() (#18884 ) without this the caller might end up returning incorrect errors and not ignoring the drive properly.	2024-01-27 16:13:41 -08:00
Harshavardhana	32e668eb94	update() stale rebalance stats() object during pool expansion (#18882 ) it is entirely possible that a rebalance process which was running when it was asked to "stop" it failed to write its last statistics to the disk. After this a pool expansion can cause disruption and all S3 API calls would fail at IsPoolRebalancing() function. This PRs makes sure that we update rebalance.bin under such conditions to avoid any runtime crashes.	2024-01-27 10:14:03 -08:00
Harshavardhana	c88308cf0e	avoid 'panic' on mc admin update for single drive setup (#18876 )	2024-01-26 12:07:03 -08:00
Harshavardhana	88837fb753	add new update v2 that updates per node, allows idempotent behavior (#18859 ) add new update v2 that updates per node, allows idempotent behavior new API ensures that - binary is correct and can be downloaded checksummed verified - committed to actual path - restart returns back the relevant waiting drives	2024-01-26 08:40:13 -08:00
Harshavardhana	d0283ff354	remove unnecessary logs in HealBucket() (#18875 )	2024-01-26 08:39:57 -08:00
Harshavardhana	f449a7ae2c	allow bucket import to be idempotent (#18873 ) do not need to be defensive in our approach, we should simply override anything everything in import process, do not care about what currently exists on the disk - backup is the source of truth.	2024-01-25 17:20:54 -08:00
Klaus Post	a113b2c394	Fix inspect format.json exclusion (#18871 ) Right now the format.json is excluded if anything within `.minio.sys` is requested. I assume the check was meant to exclude only if it was actually requesting it.	2024-01-25 15:59:00 -08:00
Harshavardhana	74851834c0	further bootstrap/startup optimization for reading 'format.json' (#18868 ) - Move RenameFile to websockets - Move ReadAll that is primarily is used for reading 'format.json' to to websockets - Optimize DiskInfo calls, and provide a way to make a NoOp DiskInfo call.	2024-01-25 12:45:46 -08:00
Harshavardhana	e377bb949a	migrate bootstrap logic directly to websockets (#18855 ) improve performance for startup sequences by 2x for 300+ nodes.	2024-01-24 13:36:44 -08:00
Poorna	b6e9d235fe	fix replication error logs to include target endpoint (#18863 )	2024-01-24 13:05:43 -08:00
Klaus Post	4a6c97463f	Fix all racy use of NewDeadlineWorker (#18861 ) AlmosAll uses of NewDeadlineWorker, which relied on secondary values, were used in a racy fashion, which could lead to inconsistent errors/data being returned. It also propagates the deadline downstream. Rewrite all these to use a generic WithDeadline caller that can return an error alongside a value. Remove the stateful aspect of DeadlineWorker - it was racy if used - but it wasn't AFAICT. Fixes races like: ``` WARNING: DATA RACE Read at 0x00c130b29d10 by goroutine 470237: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:702 +0x611 github.com/minio/minio/cmd.readFileInfo() github.com/minio/minio/cmd/erasure-metadata-utils.go:160 +0x122 github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.1() github.com/minio/minio/cmd/erasure-object.go:809 +0x27a github.com/minio/minio/cmd.erasureObjects.getObjectFileInfo.func1.2() github.com/minio/minio/cmd/erasure-object.go:828 +0x61 Previous write at 0x00c130b29d10 by goroutine 470298: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).ReadVersion.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:698 +0x244 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 WARNING: DATA RACE Write at 0x00c0ba6e6c00 by goroutine 94507: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol.func1() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:419 +0x104 github.com/minio/minio/internal/ioutil.(DeadlineWorker).Run.func1() github.com/minio/minio/internal/ioutil/ioutil.go:141 +0x33 Previous read at 0x00c0ba6e6c00 by goroutine 94463: github.com/minio/minio/cmd.(xlStorageDiskIDCheck).StatVol() github.com/minio/minio/cmd/xl-storage-disk-id-check.go:422 +0x47e github.com/minio/minio/cmd.getBucketInfoLocal.func1() github.com/minio/minio/cmd/peer-s3-server.go:275 +0x122 github.com/minio/pkg/v2/sync/errgroup.(*Group).Go.func1() ``` Probably back from #17701	2024-01-24 10:08:31 -08:00
Frank Wessels	6c912ac960	Fix startup message when using single path (#18856 )	2024-01-24 10:02:56 -08:00
Harshavardhana	708cebe7f0	add necessary protection err, fileInfo slice reads and writes (#18854 ) protection was in place. However, it covered only some areas, so we re-arranged the code to ensure we could hold locks properly. Along with this, remove the DataShardFix code altogether, in deployments with many drive replacements, this can affect and lead to quorum loss.	2024-01-24 01:08:23 -08:00
Harshavardhana	f78d677ab6	pre-allocate EC memory by default at startup (#18846 )	2024-01-23 20:41:11 -08:00
Poorna	e39e2306d6	site replication: remove extraneous log for missing group (#18785 )	2024-01-23 18:28:11 -08:00
Harshavardhana	52229a21cb	avoid reload of 'format.json' over the network under normal conditions (#18842 )	2024-01-23 14:11:46 -08:00
Harshavardhana	961f7dea82	compress binary while sending it to all the nodes (#18837 ) Also limit the amount of concurrency when sending binary updates to peers, avoid high network over TX that can cause disconnection events for the node sending updates.	2024-01-22 12:16:36 -08:00
Shubhendu	65c4d550cb	Distribution bucket metrics with site replication (#18841 ) If site replication is enabled, we should still show the size and version distribution histogram metrics at bucket level. Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-22 08:45:36 -08:00
Harshavardhana	f9b4a8d6e8	improve server update behavior by re-using memory properly (#18831 )	2024-01-19 18:27:58 -08:00
Harshavardhana	e11d851aee	add new drive I/O waiting/tokens metric (#18836 ) Bonus: add virtual memory used as well part of the system resource metrics.	2024-01-19 14:51:36 -08:00
Harshavardhana	ac81f0248c	introduce new ServiceV2 API to handle guided restarts (#18826 ) New API now verifies any hung disks before restart/stop, provides a 'per node' break down of the restart/stop results. Provides also how many blocked syscalls are present on the drives and what users must do about them. Adds options to do pre-flight checks to provide information to the user regarding any hung disks. Provides 'force' option to forcibly attempt a restart() even with waiting syscalls on the drives.	2024-01-19 14:22:36 -08:00
Aditya Manthramurthy	cc960adbee	fix: remove policy mapping file when empty (#18828 ) On a policy detach operation, if there are no policies remaining attached to the user/group, remove the policy mapping file, instead of leaving a file containing an empty list of policies.	2024-01-19 10:31:40 -08:00
Shubhendu	19387cafab	Use +Inf label additionally for Histogram metrics (#18807 )	2024-01-18 14:51:28 -08:00
Harshavardhana	7c0673279b	capture I/O in waiting and total tokens in diskMetrics (#18819 ) This is needed for the subsequent changes in ServerUpdate(), ServerRestart() etc.	2024-01-18 11:17:43 -08:00
Anis Eleuch	7ce0d71a96	Do not log volume not empty when healing dangling buckets (#18822 ) Healing dangling buckets is conservative, and it is a typical use case to fail to remove a dangling bucket because it contains some data because healing danging bucket code is not allowed to remove data: only healing the dangling object is allowed to do so.	2024-01-18 10:39:27 -08:00
Harshavardhana	dd2542e96c	add codespell action (#18818 ) Original work here, #18474, refixed and updated.	2024-01-17 23:03:17 -08:00
Harshavardhana	21d60eab7c	remove all older unused APIs (#18769 )	2024-01-17 20:41:23 -08:00
Harshavardhana	a4a74e9844	re-init the worker group to ensure errs[] slice is fresh	2024-01-17 20:33:25 -08:00
Harshavardhana	9588978028	fix: HealBucket regression for empty buckets, simplify it (#18815 )	2024-01-17 15:19:09 -08:00
chienguo	8cd967803c	fix: a typo in storeDataUsageInBackend() comment (#18778 )	2024-01-16 15:48:54 -08:00
Harshavardhana	a0e1163fb6	reject reference format from a different deployment (#18800 ) reference format is constant for any lifetime of a minio cluster, we do not have to ever replace it during HealFormat() as it will never change. additionally we should simply reject reference formats that we do not understand early on.	2024-01-16 15:13:14 -08:00
Sveinn	30bd5e2669	adding a missing return case to fix GetObjectTagging (#18793 )	2024-01-15 16:11:06 -08:00
Harshavardhana	38637897ba	fix: listing SSE encrypted multipart objects (#18786 ) GetActualSize() was heavily relying on o.Parts() to be non-empty to figure out if the object is multipart or not, However, we have many indicators of whether an object is multipart or not. Blindly assuming that o.Parts == nil is not a multipart, is an incorrect expectation instead, multipart must be obtained via - Stored metadata value indicating this is a multipart encrypted object. - Rely on <meta>-actual-size metadata to get the object's actual size. This value is preserved for additional reasons such as these. - ETag != 32 length	2024-01-15 00:57:49 -08:00
Harshavardhana	993d96feef	treat all localhost endpoints as local setup with same port (#18784 ) fixes #18783 and avoids user mistakes	2024-01-12 23:53:03 -08:00
Poorna	b2b26d9c95	support proxying of tagging requests in replication (#18649 ) support proxying of tagging requests in active-active replication Note: even if proxying is successful, PutObjectTagging/DeleteObjectTagging will continue to report a 404 since the object is not present locally.	2024-01-12 23:51:33 -08:00
Krishnan Parthasarathi	cba3dd276b	Add more size intervals to obj size histogram (#18772 ) New intervals: [1024B, 64KiB) [64KiB, 256KiB) [256KiB, 512KiB) [512KiB, 1MiB) The new intervals helps us see object size distribution with higher resolution for the interval [1024B, 1MiB).	2024-01-12 23:51:08 -08:00
Anis Eleuch	a47fc75c26	xl: Remove wrong wording for errCorruptedFormat (#18775 ) Also add errCorruptedBackend to make it easier to differentiate between corrupted content or something else wrong in the backend drive	2024-01-12 14:48:44 -08:00
Harshavardhana	e5c8794b8b	avoid disk monitoring leaks under various conditions (#18777 ) - HealFormat() was leaking healthcheck goroutines for disks, we are only interested in enabling healthcheck for the newly formatted disk, not for existing disks. - When disk is a root-disk a random disk monitor was leaking while we ignored the drive. - When loading the disk for each erasure set, we were leaking goroutines for the prepare-storage.go disks which were replaced via the globalLocalDrives slice - avoid disk monitoring utilizing health tokens that would cause exhaustion in the tokens, prematurely which were meant for incoming I/O. This is ensured by avoiding writing O_DIRECT aligned buffer instead write 2048 worth of content only as O_DSYNC, which is sufficient.	2024-01-12 01:48:36 -08:00
Taran Pelkey	ac90a873eb	Verify that remote target bucket is on MinIO server for bucket replication (#18656 )	2024-01-11 14:56:16 -08:00
jiuker	c1a78224cf	fix: prevent queries from starting before initialization (#18766 )	2024-01-10 15:21:52 -08:00
Harshavardhana	39f9350697	optimize readdir() open calls to be dealt with directly via 'fd' (#18762 )	2024-01-10 08:48:50 -08:00
Shubhendu	e31081d79d	Heal buckets at node level (#18612 ) Signed-off-by: Shubhendu Ram Tripathi <shubhendu@minio.io>	2024-01-09 20:34:04 -08:00
Harshavardhana	f02d282754	avoid frivolous logs for expired credentials (#18767 )	2024-01-09 12:25:18 -08:00
Krishnan Parthasarathi	3a90af0bcd	Add line, col to types used in batch-expire (#18747 )	2024-01-08 15:22:28 -08:00
jiuker	53ceb0791f	fix: prevent queries from starting before initialization (#18756 ) Prevent queries from starting before initialization	2024-01-08 12:40:27 -08:00
jiuker	2cd98a0d21	remove outdated notes (#18755 )	2024-01-08 08:04:19 -08:00
Anis Eleuch	04135fa6cd	audit: Add the drives where the dangling object is removed (#18737 )	2024-01-05 14:17:24 -08:00
Harshavardhana	42dc6329e6	simplify success response for GetObjectAttributes() (#18746 )	2024-01-05 12:50:07 -08:00
Sveinn	9b8ba97f9f	feat: add support for GetObjectAttributes API (#18732 )	2024-01-05 10:43:06 -08:00
Anis Eleuch	7705605b5a	scanner: Add a config to disable short sleep between objects scan (#18734 ) Add a hidden configuration under the scanner sub section to configure if the scanner should sleep between two objects scan. The configuration has only effect when there is no drive activity related to s3 requests or healing. By default, the code will keep the current behavior which is doing sleep between objects. To forcefully enable the full scan speed in idle mode, you can do this: `mc admin config set myminio scanner idle_speed=full`	2024-01-04 15:07:17 -08:00
Anis Eleuch	414bcb0c73	prom: Add read quorum per erasure set metric (#18736 )	2024-01-04 15:05:13 -08:00
Harshavardhana	f4710948c4	fix: an odd crash when deleting `null` DEL markers (#18727 ) fixes #18724 A regression was introduced in #18547, that attempted to file adding a missing `null` marker however we should not skip returning based on versionID instead it must be based on if we are being asked to create a DEL marker or not. The PR also has a side-affect for replicating `null` marker permanent delete, as it may end up adding a `null` marker while removing one. This PR should address both scenarios.	2024-01-02 15:08:18 -08:00
Anis Eleuch	3f4488c589	scanner: Allow full throttle if there is no parallel disk ops (#18109 )	2024-01-02 13:51:24 -08:00
Pedro Juarez	8f13c8c3bf	Support to store browser config settings (#18631 ) * csp_policy * hsts_seconds * hsts_include_subdomains * hsts_preload * referrer_policy	2024-01-01 08:36:33 -08:00
Zhou Ting	31d16f6cc2	allow sha256 payload to be configurable for object perf test (#18712 ) Signed-off-by: Zhou Ting <ting.z.zhou@intel.com>	2023-12-29 23:56:50 -08:00
Harshavardhana	a50ea92c64	feat: introduce list_quorum="auto" to prefer quorum drives (#18084 ) NOTE: This feature is not retro-active; it will not cater to previous transactions on existing setups. To enable this feature, please set ` _MINIO_DRIVE_QUORUM=on` environment variable as part of systemd service or k8s configmap. Once this has been enabled, you need to also set `list_quorum`. ``` ~ mc admin config set alias/ api list_quorum=auto` ``` A new debugging tool is available to check for any missing counters.	2023-12-29 15:52:41 -08:00
Harshavardhana	5b2ced0119	re-use globalLocalDrives properly (#18721 )	2023-12-29 09:30:10 -08:00
Anis Eleuch	8a0ba093dd	audit: Fix merrs and derrs object dangling message (#18714 ) merrs and derrs are empty when a dangling object is deleted. Fix the bug and adds invalid-meta data for data blocks	2023-12-27 22:27:04 -08:00
Daniel Valdivia	5fc7da345d	Upgrade Console to v0.44.0 (#18717 ) Signed-off-by: Daniel Valdivia <18384552+dvaldivia@users.noreply.github.com>	2023-12-27 11:19:13 -08:00
Anis Eleuch	8bd4f6568b	server-info: Avoid initializing audit/log http/kafka targets (#18703 ) This can cause unnecessary ServerInfo() call delay.	2023-12-22 10:25:08 -08:00
Harshavardhana	da55499db0	fix: reject clients that do not send proper payload (#18701 )	2023-12-22 01:26:17 -08:00
Anis Eleuch	22f8e39b58	tier: Allow edit of the new Azure and AWS auth params (#18690 ) Allow editing for the service principal credentials from Azure and the web identity token for AWS; Also, more validation of input parameters.	2023-12-21 16:58:10 -08:00
Harshavardhana	eba23bbac4	rename object_size -> block_size for cache subsystem (#18694 )	2023-12-21 16:57:13 -08:00
Harshavardhana	4550535cbb	send proper IPv6 names avoid bracketing notation (#18699 ) Following policies if present ``` "Condition": { "IpAddress": { "aws:SourceIp": [ "54.240.143.0/24", "2001:DB8:1234:5678::/64" ] } } ``` And client is making a request to MinIO via IPv6 can potentially crash the server. Workarounds are turn-off IPv6 and use only IPv4	2023-12-21 16:56:55 -08:00
Anis Eleuch	8432fd5ac2	prom: Add online and healing drives metrics per erasure set (#18700 )	2023-12-21 16:56:43 -08:00
Harshavardhana	7c948adf88	allow pre-allocating buffers to reduce frequent GCs during growth (#18686 ) This PR also increases per node bpool memory from 1024 entries to 2048 entries; along with that, it also moves the byte pool centrally instead of being per pool.	2023-12-21 08:59:38 -08:00
Krishnan Parthasarathi	56b7045c20	Export tier metrics (#18678 ) minio_node_tier_ttlb_seconds - Distribution of time to last byte for streaming objects from warm tier minio_node_tier_requests_success - Number of requests to download object from warm tier that were successful minio_node_tier_requests_failure - Number of requests to download object from warm tier that failed	2023-12-20 20:13:40 -08:00
Poorna	d55b6b9909	Fix quota config replication for SR (#18684 ) Fixing regression introduced by PR #17988	2023-12-19 13:22:47 -08:00
Shireesh Anjal	7680e5f81d	Read new key license_v2 from SUBNET response (#18669 ) SUBNET now has a v2 of license that is returned in the new key `license_v2`. mc will start reading and storing the same. (The old key `license` is deprecated but is still available in SUBNET response to ensure that the current released version of minio doesn't break)	2023-12-18 08:21:44 -08:00
Taran Pelkey	ad8a34858f	Add APIs to create and list access keys for LDAP (#18402 )	2023-12-15 13:00:43 -08:00
Krishnan Parthasarathi	162eced7d2	Fix incorrect metric desc for bucketRequestsDuration (#18657 )	2023-12-14 19:02:11 -08:00
Krishnan Parthasarathi	bec1f7c26a	metrics: Refactor handling of histogram vectors (#18632 )	2023-12-14 14:02:52 -08:00
Anis Eleuch	8771617199	tier: Add support of AWS S3 tiering with web identity token file (#18648 )	2023-12-14 14:01:49 -08:00
Klaus Post	6c89a81af4	Fix CreateFile shared buffer corruption. (#18652 ) `(xlStorageDiskIDCheck).CreateFile` wraps the incoming reader in `xioutil.NewDeadlineReader`. The wrapped reader is handed to `(xlStorage).CreateFile`. This performs a Read call via `writeAllDirect`, which reads into an `ODirectPool` buffer. `(*DeadlineReader).Read` spawns an async read into the buffer. If a timeout is hit while reading, the read operation returns to `writeAllDirect`. The operation returns an error and the buffer is reused. However, if the async `Read` call unblocks, it will write to the now recycled buffer. Fix: Remove the `DeadlineReader` - it is inherently unsafe. Instead, rely on the network timeouts. This is not a disk timeout, anyway. Regression in https://github.com/minio/minio/pull/17745	2023-12-14 10:51:57 -08:00
Praveen raj Mani	10ca0a6936	Label the notification target metrics by their target IDs (#18633 ) This patch adds the targetID to the existing notification target metrics and deprecates the current target metrics which points to the overall event notification subsystem	2023-12-14 09:09:26 -08:00
Harshavardhana	b3314e97a6	re-use the same local drive used by remote-peer (#18645 ) historically, we have always kept storage-rest-server and a local storage API separate without much trouble, since they both can independently operate due to no special state() between them. however, over some time, we have added state() such as - drive monitoring threads now there will be "2" of them per drive instead of just 1. - concurrent tokens available per drive are now twice instead of just single shared, allowing unexpectedly high amount of I/O to go through. - applying serialization by using walkMutexes can now be adequately honored for both remote callers and local callers.	2023-12-13 19:27:55 -08:00
Poorna	3781a0f9ad	replication: Pass metadata timestamps in CopyObject call (#18647 ) Regression from #18285. CopyObject options were inheriting source MTime for metadata timestamps if unspecified, removing this prevented metadata updates from being applied on target.	2023-12-13 15:28:55 -08:00
Poorna	e79b289325	fix datadir missing check on HeadObject (#18646 ) versions pending purge in replication were seeing a errFileCorrupt that prevents permanent deletion after replication. Regression from PR#18477	2023-12-13 14:54:01 -08:00
Harshavardhana	3f72c7fcc7	healthcheck requests with user-agent mozilla do not need redirects (#18642 ) apparently, windows powershell curl has this abhorrent behavior	2023-12-12 16:16:26 -08:00
Harshavardhana	d521c84d55	reduce logging during permission denied errors (#18641 ) log them if any only once	2023-12-12 16:11:17 -08:00
Anis Eleuch	4a21dce2b5	tier: Add support of SP credentials with Azure (#18630 ) Co-authored-by: Anis Elleuch <anis@min.io>	2023-12-11 21:51:53 -08:00
Harshavardhana	65f34cd823	fix: remove ODirectReader entirely since we do not need it anymore (#18619 )	2023-12-09 10:17:51 -08:00

... 3 4 5 6 7 ...

6124 Commits