このページは、まだ日本語ではご利用いただけません。翻訳中です。
構成
このプラグインはDB-lessモードと部分的に互換性があります。
The cluster strategy is not supported in DB-less and hybrid modes. For Kong
Gateway in DB-less or hybrid mode, use the redis strategy.
互換性のあるプロトコル
AI Rate Limiting Advancedプラグインは以下のプロトコルに対応しています:
 grpc, grpcs, http, https
パラメータ
このプラグインの設定で使用できるすべてのパラメータのリストは次のとおりです。
- 
    name or pluginstring requiredプラグイン名。この場合は ai-rate-limiting-advanced。- Kong Admin API、Kong Konnect API、宣言型構成、または decK ファイルを使用する場合、フィールドはnameです。
- Kubernetes で KongPlugin オブジェクトを使用する場合、フィールドはpluginです。
 
- Kong Admin API、Kong Konnect API、宣言型構成、または decK ファイルを使用する場合、フィールドは
- 
      instance_namestringプラグインのインスタンスを識別するための任意のカスタム名 (例: ai-rate-limiting-advanced_my-service。インスタンス名はKong ManagerとKonnectに表示されるので、 例えば複数のサービスで同じプラグインを複数のコンテキストで実行する場合に便利です。また、Kong Admin API経由で特定のプラグインインスタンスに アクセスするためにも使用できます。 インスタンス名は、次のコンテキスト内で一意である必要があります。 - Kong Gateway Enterpriseのワークスペース内
- Konnectのコントロールプレーン(CP)またはコントロールプレーン(CP)グループ内
- Kong Gateway (OSS)の全世界
 
- 
      service.name or service.idstringプラグインが対象とするサービス名または ID。最上位の /pluginsエンドポイント. からプラグインをサービスに追加する場合は、これらのパラメータのいずれかを設定してください/services/{serviceName|Id}/pluginsを使用する場合は必要ありません。
- 
      route.name or route.idstringプラグインがターゲットとするルート名または ID。最上位の /pluginsエンドポイント. を通るルートにプラグインを追加する場合は、これらのパラメータのいずれかを設定してください/routes/{routeName|Id}/pluginsを使用する場合は必要ありません。
- 
      consumer.name or consumer.idstringプラグインがターゲットとするコンシューマーの名前または ID。 最上位の /pluginsエンドポイント. からコンシューマーにプラグインを追加する場合は、これらのパラメーターのいずれかを設定してください/consumers/{consumerName|Id}/pluginsを使用する場合は必要ありません。
- 
      consumer_group.name or consumer_group.idstringプラグインが対象とするコンシューマグループの名前または ID。 設定されている場合、プラグインは指定されたグループが認証されているリクエストに対してのみアクティブになります /pluginsエンドポイント./consumer_groups/{consumerGroupName|Id}/pluginsを使用する場合は必要ありません。
- 
    enabledboolean default:trueこのプラグインが適用されるかどうか。 
- 
  configrecord required- 
  identifierstring required default:consumerMust be one of:ip,credential,consumer,service,header,path,consumer-groupThe type of identifier used to generate the rate limit key. Defines the scope used to increment the rate limiting counters. Can be ip,credential,consumer,service,header,pathorconsumer-group. Note ifidentifierisconsumer-group, the plugin must be applied on a consumer group entity. Because a consumer may belong to multiple consumer groups, the plugin needs to know explicitly which consumer group to limit the rate.
 - 
  window_typestring default:slidingMust be one of:fixed,slidingSets the time window type to either sliding(default) orfixed. Sliding windows apply the rate limiting logic while taking into account previous hit rates (from the window that immediately precedes the current) using a dynamic weight. Fixed windows consist of buckets that are statically assigned to a definitive time range, each request is mapped to only one fixed window based on its timestamp and will affect only that window’s counters.
 - 
  sync_ratenumberHow often to sync counter data to the central data store. A value of 0 results in synchronous behavior; a value of -1 ignores sync behavior entirely and only stores counters in node memory. A value greater than 0 will sync the counters in the specified number of seconds. The minimum allowed interval is 0.02 seconds (20ms). 
 - 
  llm_providersarray of typerecordrequiredThe provider config. Takes an array of name,limitandwindow sizevalues.- 
  window_sizearray of typenumberrequiredOne or more window sizes to apply a limit to (defined in seconds). There must be a matching number of window limits and sizes specified. 
 - 
  namestring required Must be one of:openai,azure,anthropic,cohere,mistral,llama2,bedrock,gemini,huggingface,requestPromptThe LLM provider to which the rate limit applies. 
 - 
  limitarray of typenumberrequiredOne or more requests-per-window limits to apply. There must be a matching number of window limits and sizes specified. 
 
- 
  
 - 
  strategystring required default:localMust be one of:cluster,redis,localThe rate-limiting strategy to use for retrieving and incrementing the limits. Available values are: localandcluster.
 - 
  dictionary_namestring required default:kong_rate_limiting_countersThe shared dictionary where counters are stored. When the plugin is configured to synchronize counter data externally (that is config.strategyisclusterorredisandconfig.sync_rateisn’t-1), this dictionary serves as a buffer to populate counters in the data store on each synchronization cycle.
 - 
  hide_client_headersboolean default:falseOptionally hide informative response headers that would otherwise provide information about the current status of limits and counters. 
 - 
  retry_after_jitter_maxnumber default:0The upper bound of a jitter (random delay) in seconds to be added to the Retry-Afterheader of denied requests (status =429) in order to prevent all the clients from coming back at the same time. The lower bound of the jitter is0; in this case, theRetry-Afterheader is equal to theRateLimit-Resetheader.
 - 
  header_namestringA string representing an HTTP header name. 
 - 
  pathstring starts_with:/A string representing a URL path, such as /path/to/resource. Must start with a forward slash (/) and must not contain empty segments (i.e., two consecutive forward slashes). 
 - 
  redisrecord required- 
  hoststring default:127.0.0.1A string representing a host name, such as example.com. 
 - 
  portinteger default:6379between:065535An integer representing a port number between 0 and 65535, inclusive. 
 - 
  connect_timeoutinteger default:2000between:02147483646An integer representing a timeout in milliseconds. Must be between 0 and 2^31-2. 
 - 
  send_timeoutinteger default:2000between:02147483646An integer representing a timeout in milliseconds. Must be between 0 and 2^31-2. 
 - 
  read_timeoutinteger default:2000between:02147483646An integer representing a timeout in milliseconds. Must be between 0 and 2^31-2. 
 - 
  usernamestring referenceableUsername to use for Redis connections. If undefined, ACL authentication won’t be performed. This requires Redis v6.0.0+. To be compatible with Redis v5.x.y, you can set it to default.
 - 
  passwordstring referenceable encryptedPassword to use for Redis connections. If undefined, no AUTH commands are sent to Redis. 
 - 
  sentinel_usernamestring referenceableSentinel username to authenticate with a Redis Sentinel instance. If undefined, ACL authentication won’t be performed. This requires Redis v6.2.0+. 
 - 
  sentinel_passwordstring referenceable encryptedSentinel password to authenticate with a Redis Sentinel instance. If undefined, no AUTH commands are sent to Redis Sentinels. 
 - 
  databaseinteger default:0Database to use for the Redis connection when using the redisstrategy
 - 
  keepalive_pool_sizeinteger default:256between:12147483646The size limit for every cosocket connection pool associated with every remote server, per worker process. If neither keepalive_pool_sizenorkeepalive_backlogis specified, no pool is created. Ifkeepalive_pool_sizeisn’t specified butkeepalive_backlogis specified, then the pool uses the default value. Try to increase (e.g. 512) this value if latency is high or throughput is low.
 - 
  keepalive_backloginteger between:02147483646Limits the total number of opened connections for a pool. If the connection pool is full, connection queues above the limit go into the backlog queue. If the backlog queue is full, subsequent connect operations fail and return nil. Queued operations (subject to set timeouts) resume once the number of connections in the pool is less thankeepalive_pool_size. If latency is high or throughput is low, try increasing this value. Empirically, this value is larger thankeepalive_pool_size.
 - 
  sentinel_masterstringSentinel master to use for Redis connections. Defining this value implies using Redis Sentinel. 
 - 
  sentinel_rolestring Must be one of:master,slave,anySentinel role to use for Redis connections when the redisstrategy is defined. Defining this value implies using Redis Sentinel.
 - 
  sentinel_nodesarray of typerecordlen_min:1Sentinel node addresses to use for Redis connections when the redisstrategy is defined. Defining this field implies using a Redis Sentinel. The minimum length of the array is 1 element.- 
  hoststring required default:127.0.0.1A string representing a host name, such as example.com. 
 - 
  portinteger default:6379between:065535An integer representing a port number between 0 and 65535, inclusive. 
 
- 
  
 - 
  cluster_nodesarray of typerecordlen_min:1Cluster addresses to use for Redis connections when the redisstrategy is defined. Defining this field implies using a Redis Cluster. The minimum length of the array is 1 element.- 
  ipstring required default:127.0.0.1A string representing a host name, such as example.com. 
 - 
  portinteger default:6379between:065535An integer representing a port number between 0 and 65535, inclusive. 
 
- 
  
 - 
  sslboolean default:falseIf set to true, uses SSL to connect to Redis. 
 - 
  ssl_verifyboolean default:falseIf set to true, verifies the validity of the server SSL certificate. If setting this parameter, also configure lua_ssl_trusted_certificateinkong.confto specify the CA (or server) certificate used by your Redis server. You may also need to configurelua_ssl_verify_depthaccordingly.
 - 
  server_namestringA string representing an SNI (server name indication) value for TLS. 
 - 
  cluster_max_redirectionsinteger default:5Maximum retry attempts for redirection. 
 - 
  connection_is_proxiedboolean default:falseIf the connection to Redis is proxied (e.g. Envoy), set it true. Set thehostandportto point to the proxy address.
 
- 
  
 - 
  disable_penaltyboolean default:falseIf set to true, this doesn’t count denied requests (status =429). If set tofalse, all requests, including denied ones, are counted. This parameter only affects theslidingwindow_type and the request prompt provider.
 - 
  request_prompt_count_functionstringIf defined, it use custom function to count requests for the request prompt provider 
 - 
  error_codenumber default:429Set a custom error code to return when the rate limit is exceeded. 
 - 
  error_messagestring default:AI token rate limit exceeded for provider(s):Set a custom error message to return when the rate limit is exceeded. 
 - 
  error_hide_providersboolean default:falseOptionally hide informative response that would otherwise provide information about the provider in the error message. 
 - 
  tokens_count_strategystring required default:total_tokensMust be one of:total_tokens,prompt_tokens,completion_tokens,costWhat tokens to use for cost calculation. Available values are: total_tokensprompt_tokens,completion_tokensorcost.
 - 
  llm_formatstring default:openaiMust be one of:openai,bedrock,geminiLLM input and output format and schema to use 
 
- 
  
 
                      