3 min read

年末准备AI102认证考试

背景

以前这类云相关的certificate,公司都是全额承担报销的. 现在经济下行周期,消减福利,只报销一般.

去年年末,公司说有AI云服务相关培训, 跟manager申请,承诺不会耽误工作,

自己顶住压力, 仍然坚持报名了training,

年末趁自己尚有余力,开始备考.

2025-11-08 12:30:54

  1. 考察AddPhrasse 相关API调用
  • AddPhraseListAsync
  • PhraseistCreateObject
  1. 考察迁移container的部署能力
  • model必须是trained 或者是published的, thus 1.1
  • export model using export container (gzip)
  • run a container and mount model file
  1. 选型:
  • QnA Maker
  • Language Understanding
  • Dispatch
  1. 选型:
  • Form Recognizer
  • Computer Vision
  • Personalizer
  • Custom Vision
  1. 选型和API
  • 请求新建资源,用PUT,更新用PATCH
  • CognitiveServices是包含OCR,Speech,WebSearch,Decision,Language的
  1. 几个ai service的原则
  • transparency
  • faireness
  • inclusiveness
  • reliability and safety
  • privacy and security
  1. 对docker 分发的理解
  • Create
  • Build
  • Push
  • Disctribute and run
  1. 对服务和billing end point的理解

  2. computeVision vs. computeVision.prediction

  3. 分辨subscription key vs. query key (只在query 服务中用到)

  4. form recogniser的训练

  • jpg,png,pdf,tiff
  • <= 50mb
  1. Azure Cognitive Search with server side encryption / CMK
  • Index Size Increase
  • Query times Increase
  • azure key vault is required.
  1. AI principle: transparency

  2. 组网配置. private endpoint with private link

  3. 组网配置. virtual network + ip firewal rule + private link

  4. azure private link 避免virtual network,不论private/public

  5. 服务选型: anomaly detector

  6. 考察SoT: audioStreamFormat.getCompressedFormat/SpeechRecogniser

  7. 技术选型: Face/Speech/Text Analytics

  8. QnA会引入两个服务: Azure Storage + Congnitive Search

  9. IAM - authoring resources

  10. throttle通常解决办法是replica

  11. Speech to Text with AutoDetectSourceLanguageConfig/Text to Speech

  12. StartRecognizeReceiptsFromUri 参数

  13. a new Cognitive Services resource that uses the S0 pricing tier - save cost

  14. adding indexes, doesn’t resolve throttle due to increasing query volume

  15. adding CMK, doesn’t resolve throttle due to increasing query volume

  16. deploy service1 and a private endpoint to vnet1.

  17. For inclusiveness: Direct Line Speech is a robust, end-to-end solution for creating a flexible, extensible voice assistant. It is powered by the Bot Framework and its Direct Line

  18. api endpoint 构成

  19. Purchase Orders (POs) -> Form Recognizer -> OCR -> JSON (extracted info from POs) -> Shaper skill -> JSON -> Table Projection -> JSON -> Power BI

  20. Replica used For resolving throttle

  21. Face的能力是有facial features and attributes的,比如Face mask, glasses等等.

  22. 向量搜索建表

  • Retrievable: Include the category field in the search results.
  • Searchable: Ensure that users can search for words in the category field.
  • Facetable: Ensure that users can perform drill down filtering based on category.
  1. Azure Metrics Advisor
  • data mornitoring and nomaly detection in time series data
  1. API: describeImageInStreamAsync

  2. 先调整domain,然后才是retrain 和export

  3. 涉及finicial situation,的用人介入. (principle of fairness and accountability)

  4. Virtual network settings for ta1 而不是NSG

  5. Metrics Advisor -> Multivariate Anomaly Detction,多维度异常检测

  6. Docker省钱: 尤其是考虑到limited connectivity以及minimise costs

  7. create private endpoint + key authentication

  8. change points vs. batch detection. batch detection 做dataSet vs. anmomalise

  9. 建立一个on premise的service的步骤: k8s (on premise) connected to internet; image from MCR; appID, secret + URL

  10. Azure 管理后台的API是通过subscription来创建服务的. subscription

  11. docker 创建instance的时候,三要素eula,billing,apikey

docker run --rm -it -p 5000:5000 --memory 4g --cpus 1 \
mcr.microsoft.com/azure-cognitive-services/decision/anomaly-detector:latest \
Eula=accept \
Billing={ENDPOINT_URI} \
ApiKey={API_KEY}
  1. To authenticate with a Microsoft Entra token, the Speech resource must have a custom subdomain and use a private endpoint. The Speech service uses custom subdomains with private endpoints only.

  2. 6rpm -> 1000 tpm (1 capacity)

  3. 签名,本地敏感资源通过provisioning在本地直接跑.

  • Provision the Language service resource in Azure.
  • Deploy a Docker container to an on-premises server.
  • Identify the Language service endpoint URL and query the prediction endpoint.
  1. API key. AI resources 均衡负载时,不用 bear token (这个是authentication时候用的)

  2. 模拟环境bot framework sdk -> bot framework emulator

  3. deployment name, endpoint, key

  4. ipsec rules (两个ip之间已经建立的信道),gateway也可以,但是花实践,virtual network rules(直接限制access可行)

  5. Anomaly detector

  6. 构建command (docker)

  7. metric alerts in Azure Monitor, for data audioStreamFormat

  8. 大规模视频分析,用onedrive download link vs. share link 不如直接下载,重新上传.

  9. enable service endpoint/modify virtual network

  10. 麦克风,人像, 直接speech to text 即可