Google Cloud quotas in us-east4 are very limiting - google-cloud-platform

Any new project that we start has very restrictive quotas for us-east4 - e.g. 24 CPUs
While most quota increase requests seem to be approved immediately, the requests are manual. Because our workflow involves automatically spinning up projects, we need an solution that doesn't involve manual intervention
In comparison, the same project has more reasonable quotas for other regions - e.g. 2400 CPUs for us-west1, us-east1, us-central1
Is there something about us-east4 that makes it inappropriate for these sorts of workloads? We're using it because of lower latency to us, but will be flexible if absolutely required

As per the GCP quota documentation users have to proactively request a quota increase. In that case you can request a quota increase that will take into account your expected upcoming increase in usage. A Feature Request has been created so that this functionality can be made available in the future. Right now there is not an ETA for this FR but any progress can be tracked on the previous link.

Related

Creating new GCP serverless vpc access connection, Error: Insufficient CPU quota in region

I'm trying to create GCP serverless vpc access connection for my cloud functions.
The error message is at below
So i checked quota of my project. and my quota is at below
At first, I didn't have any VM instances so there was no cpu usage.
After, I create new VM instance, 8 quotas of CPUs are created. Still, it makes same error.
Do i need to use other type of cpu for VPC connection?
please share you knowledge. thank you.
The error is quite specific and the root cause is the Quota of CPU. There are two possible reasons for this issue and two possible solutions.
First possible issue is the connectors being created using the gcloud command exceeded the CPU quota of your project. The second is there may be existing CPU resource hidden on your project that needs to be removed.
First solution is to change the Gcloud command you are using with lower --max-instances as additional parameter to lower the number of instance being created.
Second possible solution is QIR (Quota Increase Request), Requesting a quota increase is free of charge. It will only cost more if you uses more resource from your request. For detailed instructions on how to increase quota from the Google Cloud Console, see Requesting a higher quota limit.
You can learn more about CPU Quota's here.

ECS starting tasks sequentially though resources are available

In our ECS cluster setup with ASG Capacity provider, we have 5 EC2 instances and each instance can take around 20 tasks. So overall there are resources available to run 100 tasks. Now if we submit a service with 100 tasks, though there are enough resources, not all tasks are started parallely. I see tasks are coming up in batches of size 20 with a gap of 10 secs between each batch. I observed this from ECS Service Event logs. Any configuration which we can tweak to achieve complete parallelism.
This behavior is due to artificially controlled throughput (expressed in Tasks per Second - TPS) that the ECS service control plane imposes. There is a bursting concept in there (which is the reason for which you see this batch of tasks being launched and then a delta in seconds). The reasons for which these limits exist is to avoid being throttled in other parts of the services surface. These limits can be lifted if there is a strong need but the engineering team will need to validate the use case and expectations (see the point about hitting potentially other limits). The best way to address this discussion is by opening a ticket with AWS Support and explore your alternatives (based on your requirements).

Not able to increase Quota for Compute Engine API in GCP

I have been trying to increase Quota for Google Cloud Platform(GCP) Compute Engine API for a Location and it is not allowing me to Edit or Even select the location.
I have tried the same thing before few months back and it was properly working then. I just created a new project and tried the same thing.
I do have the Owner Permission assigned to me.
After concluding that you are in Free Tier, that is part of constraints.
Your free trial credit applies to all Google Cloud resources, with the following exceptions:
You can't have more than 8 cores (or virtual CPUs) running at the same time.
You can't add GPUs to your VM instances.
You can't request a quota increase. For an overview of Compute Engine quotas, see Resource quotas.
You can't create VM instances that are based on Windows Server images.
You must upgrade your account to perform any of the actions in the preceding list.
Upgrading to a paid account:
https://cloud.google.com/free/docs/gcp-free-tier#how-to-upgrade
Free Tier conditions:
https://cloud.google.com/free/docs/gcp-free-tier
Update: To be able to increase Quotas or Submit Quota Increase, you need to:
For New Project need to wait for 48hrs
You need to have Billing Enabled (Enable it by going into top-left gift icon and following along to Enable Billing in GCP)

Change GPU quota on Google Cloud Platform

I am trying to change the quota for the number of GPUs I can use on a project on the Google Cloud Platform. Thing is I've made requests before on a different account and they all went through.
This is a brand new account about 5 days old and even though I've upgraded my billing the requests are still denied. I reached out for feedback and the response was
Unfortunately, we are unable to grant your quota increase due to insufficient service usage history
I've reached out to their sales team but they haven't gotten back to me yet and I've tried putting in that I just created the account and upgraded billing in the justification for the request as mentioned here.
Does anyone know how to get requests to edit quotas on brand new accounts approved?
After some chatting with the Google Cloud Platform Billing Support, the basic answer is no, there's no way to increase quotas on a new account.
The last bit of correspondence and essentially the official response:
I was able to check with the downstream team the information |Support officer name here|
provided you and before your projects can get their GPU quota
increased it needs to accumulate more billing history. In my
experience I would recommend you to try again and request the quota
increase.
The issue is that new accounts don't have enough billing history or tenure to request quotas and one billing cycle has to pass in order for that information to be made available. I also asked if this is a policy that would change in the future to which the response was:
Google has a very sensitive policy specifically regarding the quotas
for GPU's, meaning this limitation on the product will continue to
work this way since the only way to generate tenure on an account is
by generating billing history.

GCP: Instance creation failed

I recently tried to create an instance group on the Google Cloud Platform (GCP) with 50 n1-standard-1 instances in zone us-east1-b, each with P100 GPUs. I requested and got approval for 200 P100 GPUs in this zone. My CPU, IP addresses, and Routes for this zone and globally all meet the quotas listed on this page.
However, right now, I'm only up to 21 of these 50 instances created, with the rest with a yellow hazard sign and the accompanying warning message: Instance 'instance-group-1-<name>' creation failed: The zone 'projects/<project>/zones/us-east1-b' does not have enough resources available to fulfill the request. '(resource type:compute)'.
Is there any place on the quotas page where I can get information on exactly which compute quota I forgot to ask more of? The error message is unfortunately not very descriptive.
Note: I suspect that this could be a reference to exceeding the Compute Engine API query limiting access to at most 2000 querues per 100 seconds. The 7 day peak usage column does show that I have exceeded it at peak times. However, my Current Usage is at less than 70 queries per 1000 seconds. When I look at my compute engine query usage graphed over time, it doesn't look like I have tripped the 2000 rate limit for several hours. However, the instance group still fails to populate fully to all 50 instances.
This is a typical error which means that at a certain point in time, the resources in 'us-east1-b' are not sufficient to scale your Instance Group even though you have sufficient quota. You have two alternatives:
1- try again later
2- request GPUs in another region/zone and deploy your IG there.
Google also recommends to distribute your workloads in more than one region and zone.
For more information see this and this.