Azure AI Fundamentals AI-900 Exam Preparation

Azure AI Fundamentals AI-900 Exam Prep PRO

You can translate the content of this page by selecting a language in the select box.

AI Unravel Podcast

Pass the 2023 AWS Certified Machine Learning Specialty MLS-C01 Exam with Flying Colors

Azure AI Fundamentals AI-900 Exam Preparation: Azure AI 900 is an opportunity to demonstrate knowledge of common ML and AI workloads and how to implement them on Azure. This exam is intended for candidates with both technical and non-technical backgrounds. Data science and software engineering experience are not required; however, some general programming knowledge or experience would be beneficial.

Azure AI Fundamentals can be used to prepare for other Azure role-based certifications like Azure Data Scientist Associate or Azure AI Engineer Associate, but it’s not a prerequisite for any of them.

This Azure AI Fundamentals AI-900 Exam Preparation App provides Basics and Advanced Machine Learning Quizzes and Practice Exams on Azure, Azure Machine Learning Job Interviews Questions and Answers, Machine Learning Cheat Sheets.

Download Azure AI 900 on iOs

Download Azure AI 900 on Windows10/11

Azure AI Fundamentals Exam Prep

Azure AI Fundamentals AI-900 Exam Preparation App Features:

– Azure AI-900 Questions and Detailed Answers and References

– Machine Learning Basics Questions and Answers

– Machine Learning Advanced Questions and Answers

– NLP and Computer Vision Questions and Answers

– Scorecard

– Countdown timer

– Machine Learning Cheat Sheets

– Machine Learning Interview Questions and Answers

– Machine Learning Latest News

Azure AI Fundamentals AI-900 Exam Preparation — Azure AI 900 – Machine Learning

This Azure AI Fundamentals AI-900 Exam Prep App covers:

ML implementation and Operations,
Describe Artificial Intelligence workloads and considerations,
Describe fundamental principles of machine learning on Azure,
Describe features of computer vision workloads on Azure,
Describe features of Natural Language Processing (NLP) workloads on Azure ,
Describe features of conversational AI workloads on Azure,
QnA Maker service, Language Understanding service (LUIS), Speech service, Translator Text service, Form Recognizer service, Face service, Custom Vision service, Computer Vision service, facial detection, facial recognition, and facial analysis solutions, optical character recognition solutions, object detection solutions, image classification solutions, azure Machine Learning designer, automated ML UI, conversational AI workloads, anomaly detection workloads, forecasting workloads identify features of anomaly detection work, Kafka, SQl, NoSQL, Python, linear regression, logistic regression, Sampling, dataset, statistical interaction, selection bias, non-Gaussian distribution, bias-variance trade-off, Normal Distribution, correlation and covariance, Point Estimates and Confidence Interval, A/B Testing, p-value, statistical power of sensitivity, over-fitting and under-fitting, regularization, Law of Large Numbers, Confounding Variables, Survivorship Bias, univariate, bivariate and multivariate, Resampling, ROC curve, TF/IDF vectorization, Cluster Sampling, etc.
This App can help you:
– Identify features of common AI workloads
– identify prediction/forecasting workloads
– identify features of anomaly detection workloads
– identify computer vision workloads
– identify natural language processing or knowledge mining workloads
– identify conversational AI workloads
– Identify guiding principles for responsible AI
– describe considerations for fairness in an AI solution
– describe considerations for reliability and safety in an AI solution
– describe considerations for privacy and security in an AI solution
– describe considerations for inclusiveness in an AI solution
– describe considerations for transparency in an AI solution
– describe considerations for accountability in an AI solution
– Identify common types of computer vision solution:
– Identify Azure tools and services for computer vision tasks
– identify features and uses for key phrase extraction
– identify features and uses for entity recognition
– identify features and uses for sentiment analysis
– identify features and uses for language modeling
– identify features and uses for speech recognition and synthesis
– identify features and uses for translation
– identify capabilities of the Text Analytics service
– identify capabilities of the Language Understanding service (LUIS)
– etc.

Download Azure AI 900 on iOs

Download Azure AI 900 on Windows10/11

Azure AI Fundamentals Breaking News – Azure AI Fundamentals Certifications Testimonials

AZ-900 Exam
by /u/mikkelmr135 (Microsoft Azure) on April 25, 2024 at 9:46 am
Hi there! I have a quick question about the AZ-900 exam and im finding SO MANY conflicting answers to how much you need to study and how hard the exam actually is. I have gone through all the microsoft learn material and can get above 90% consistently in the practice assessments. Would that be enough or do you guys reccomend studying more via youtube and such? 🙂 submitted by /u/mikkelmr135 [link] [comments]
Authenticator Registration prompt when excluded
by /u/Suited043 (Microsoft Azure) on April 25, 2024 at 9:36 am
Hi all, Recently I started at a new job and I've been requested to roll out company wide MFA. The request was to exclude trusted locations from the policy. I know this isn't the Zero Trust way but I've gotta do it. However, we want to roll out MFA with a grace period for people to opt-in during the grace period. But about half of our users work in-office wich will be registred as a trusted location. This will be excluded, but the one thing I'm not completely statisfied with the fact that when excluded from the MFA due to trusted location users do not get the prompt to register their authenticator. Is there a way to force people to register their account with MFA regardless if they are excluded or not? submitted by /u/Suited043 [link] [comments]
Time Sync for > Using AzureAD Group in Builtin Administrators group
by /u/Freddo_aka_5ilv4 (Microsoft Azure) on April 25, 2024 at 8:11 am
Hi all, Recently I try to manage local administrators with using an inTune policies to replace local administrators. We add the builtin administrator account and a SID Goup from AzureAD. Honestly is working well, but... with all Azure / inTune Sync... just a pain to "wait" right ? I would know if someone already use this method, and if you know : How many time it take ? How I can "foce" this syncing Thanks mates ! submitted by /u/Freddo_aka_5ilv4 [link] [comments]
Admin consent - PnP Management Shell - a LOT of permissions requested
by /u/dkarlq (Microsoft Azure) on April 25, 2024 at 8:01 am
Hi, I got a admin consent request on an application that I would like a second opinion on. It requires a LOT of permissions, and the user who requested it wants it to deploy some SharePoint templates. When googling it looks like a OpenSource-project that Microsoft refers to (but do not manage). Any takes on this? AppID: 31359c7f-bd7e-475c-86db-fdb8c937548e Permissions: https://preview.redd.it/je0sik021lwc1.png?width=328&format=png&auto=webp&s=68db8a8a42587e88ede9a3202fcb2f9da39dc5fd submitted by /u/dkarlq [link] [comments]
P2S VPN Gateway for Microsoft Entra ID Authentication
by /u/rijoskill (Microsoft Azure) on April 25, 2024 at 5:27 am
submitted by /u/rijoskill [link] [comments]
File Sync for DR
by /u/superuseradmin1 (Microsoft Azure) on April 25, 2024 at 2:35 am
I'm looking to create a DR solution to protect some on prem file shares we have at 3 different locations. All 3 locations have one file server on site and specific to just the site. My goal is to get the users up and running with their files in the share if the server goes down. I have a storage account setup and have 3 file shares inside that. I plan on naming the file shares to the name of the specific site. For one of them I plan on testing out with, I have file sync setup with a sync group. I'm at the point of installing the file sync agent on the on prem server and testing out a sync to the Azure storage with it. My initial thought is to setup a Windows server in Azure with a disk big enough to handle the on prem file share size, and then sync up between the Azure Windows server and the on prem one. Then if on prem goes down, change group policy and send them to the server in Azure. My question is if this is the best way to do this or not. I think this should work fine, but is creating a new server in Azure for each site going overboard? Is there a better way to get them to their synced file share like direct to the storage or such? Less servers to manage is always a good thing if possible. My end goal in the future though is to have them just live and go to the Azure file share. This could be a bit down the road though. Thanks much Edit: I should add that I am very new to Azure so far and our on prem file share uses NTFS permissions and security groups to access certain folders and files on the share. submitted by /u/superuseradmin1 [link] [comments]
Can't access isolated VM via Bastion
by /u/_benwa (Microsoft Azure) on April 25, 2024 at 2:13 am
I have an isolated vnet that has a Bastion host in it to access a recovery test Domain Controller. Unfortunately, I can't seem to actually connect to it. The Bastion subnet doesn't have an NSG and the VM subnet has a rule allowing RDP for the Bastion subnet. Any ideas on what is wrong? submitted by /u/_benwa [link] [comments]
Using Azure Functions to handle connection to Cosmos DB and website
by /u/dot_equals (Microsoft Azure) on April 25, 2024 at 1:08 am
-SOLVED- Hi, so I am having some issues with getting my azure function to safely and securely withdraw a key or secret from my vault. At the moment I am just trying to figure out what type of auth_level I need and how to manage secure connections. Do I need to set up some sort certificate to approve the hand shake? I incorrectly assumed that all the authentication would be done with in azure when I set up the IAM giving the function read write access to the vault. I have been reading a lot of documentation and asking chatGPT but Alas here I am. Please let me know what kind of information I need to send in order to help gain a decent answer. submitted by /u/dot_equals [link] [comments]
673 on my AZ-305
by /u/icebreaker374 (Microsoft Azure Certifications) on April 24, 2024 at 11:23 pm
I kinda took the AZ-305 on a whim after having passed the 9/7/500 and 104 in the last week with minimal studying. SO much more SQL and DB related stuff than I was expecting. I feel mildly discouraged by the fact that I got that close. What'd you all use for prep? submitted by /u/icebreaker374 [link] [comments]
Linked Backend with Static Web App
by /u/jayc12345678901 (Microsoft Azure) on April 24, 2024 at 11:13 pm
I’ve been running into kind of a strange issue with Static Web Apps and App service. I have my FE deployed with static web apps and the static web app behind a private endpoint. I have a separate app service that I am deploying my API on. Using the SWA “API” blade I am able to link my app service to the static web app. In theory this should enable me to use domain.com/api as the url of my api, without using the autogenerated domain name given to my app service directly. However, I keep getting a 403 when navigating to domain.com/api whereas I expect to see json response for that endpoint. I have no auth required for my app service and both the static web app and app service are working on their own. Just confused about this 403. submitted by /u/jayc12345678901 [link] [comments]
Help with AzureML: Deleted a compute instance that was running with disk full (and costs)
by /u/augustcs (Microsoft Azure) on April 24, 2024 at 10:35 pm
I was using a compute instance to train a model. At least I tried. I changed something (another image) in my Docker environment and then when running the job, I got the error that the compute's disk was full and it was in an unuseable state: "Operating system disk has run out of disk space. Clear at least 5 GB disk space on OS disk (/dev/sda1/ filesystem mounted on /) through the terminal by removing files/folders, and then do sudo reboot. You can check available disk space by running df -h. For more details refer to https://aka.ms/cidiskfull" The compute was still running. However, I couldn't access the terminal (in the notebook section in Studio), where it just said it 'wasn't available', or 'Current terminal is encountering some issues, please switch compute or restart your current compute and retry.' I then was panicking because I do this for a company and they are very strict with the costs, as the compute is still running and I could not stop it and therefore still accruing costs (?). So I deleted the instance. But I don't really know if I have really deleted it, or that it is still running somewhere and we will still have costs. How do I proceed from here? I'm relatively new with this and not that technically versed as you might tell. Thanks in advance! submitted by /u/augustcs [link] [comments]
A single consumer to read from multiple event hubs
by /u/PlusBasket9721 (Microsoft Azure) on April 24, 2024 at 10:02 pm
Hey! I have a enterprise level application with multiple micro-services, some of the background tasks and communication has been handled by Event hubs. The current implementation has the Event hubs with the Kafka layer, but I am exploring the option of removing the Kafka layer and utilizing the Azure SDK itself. I needed help with one thing - Can I have 1 consumer running off one of my micro-services be able to read / consume events from multiple event hubs, using the Azure SDK? I was able to work get this working using the Kafka layer, but unable to find proper examples with the Azure SDK. Thanks in advance. submitted by /u/PlusBasket9721 [link] [comments]
Migrate/Extend Entra ID to Entra Domain Services
by /u/theamadelorean (Microsoft Azure) on April 24, 2024 at 9:37 pm
We have a client that is currently utilizing cloud only accounts setup in Entra ID. Their devices are all Entra Joined and authenticate that way. Its about a 350 person organization currently. We are going to try and implement an Azure SMB File share (currently on SPO) but will need a domain environment setup for authentication and security groups. Looking through MS documentation, looks like we'll have to have everyone reset their password moving from Entra ID to Entra DS before we can utilize that. Has anyone gone through this process? I'm not necessarily opposed to setting up on prem AD if that route is less headache but doubt it is. submitted by /u/theamadelorean [link] [comments]
Azure Certification Path
by /u/DaveC2020 (Microsoft Azure Certifications) on April 24, 2024 at 9:25 pm
I managed to achieve the MS-900 certification last week and already got AZ-900. My next certification path I’m thinking is going for AZ-104 but is it also worth looking into the next certification after MS-900? submitted by /u/DaveC2020 [link] [comments]
DevOps Migration (Tenant to Tenant)
by /u/mattmak22 (Microsoft Azure) on April 24, 2024 at 9:17 pm
Hello, Does anyone have any experience migrating a Microsoft DevOps environment from one Azure tenant to another? I saw there is a way to "bind" DevOps to a different Entra AD environment, is this all there is? I've tried digging around for info and haven't come up with much. Thank you in advance! submitted by /u/mattmak22 [link] [comments]
Management Groups: Working with business units and environments.
by /u/DevManTim (Microsoft Azure) on April 24, 2024 at 9:10 pm
Following the Azure CAF model, a management group tree starts with root, then an intermediary, then platform, landing zones and decom, sandbox, etc. Within the landing zones we have business units, or logical association of subs to an organizational unit. While not illustrated in CAF, you usually would have multiple subscriptions per org unit based on environment. So, taking the diagram below, inside the Corp MG you'd have a dev / tst / prd subscription. The trick and challenge is... how do you apply policy effectively in this structure, when your environments are nested within an org unit? Consider this use case, at an enterprise level we want to enforce a certain set of Azure policies in all production subscriptions. However, because there could be a prod subscription within SAP, Corp and Online... we'd have to apply azure policy to those subscriptions individually, instead of setting at an MG and allowing it to trickle down. Any thoughts? https://preview.redd.it/joab7hmtlhwc1.png?width=725&format=png&auto=webp&s=c75857c04391020b6b878c9725c89d5eff99c675 submitted by /u/DevManTim [link] [comments]
AZ 104
by /u/Dutchy2023 (Microsoft Azure Certifications) on April 24, 2024 at 8:09 pm
Stupid question; Possible to get AZ 104 certificate in 3 weeks around 4-5 hours a day? Just lost my job and I'm done with being a Support Engineer. Looking for something like Junior Cloud Engineer. submitted by /u/Dutchy2023 [link] [comments]
MC761220 - Adding required endpoints for provisioning cloud PCs
by /u/JustOneMoreMile (Microsoft Azure) on April 24, 2024 at 7:10 pm
I have a client that is leveraging cloud PCs in M365. They received a letter saying that they need to ensure hm-iot-in-4-prod-prna01.azure-devices.net is reachable. I've looked all over the 365 tenant and I cannot for the life of me find where to add the exceptions. The ANC health checks are passing, but the client is getting this in an email every day. Summary: Azure network connection checks have failed and is potentially impacting over 1 Azure network connections and blocking the provisioning of new Cloud PCs. submitted by /u/JustOneMoreMile [link] [comments]
External tables not visible in SQL Pool
by /u/Co2Mtl (Microsoft Azure) on April 24, 2024 at 6:28 pm
Hello I'm trying to give access to a user through the SQL pool to the external tables which are in the datalake. But it doesn't work, the user cannot see them in SSMS. He can see the database itself however (Silver) but nothing in the external tables folder which remains empty. We are working with Azure Synapse. Access to the Bronze layer is ok but it's different since it is a SQL Database. Enriched database (Silver) I gave access to his group to the files in the container (Storage/Container in AZ Portal) with Read and Execute permissions. I applied these permissions to the right folder only so nothing at the rool level (even on the container itself, is it correct ?). But I added his group as a Contributor on the storage. Nothing was applied (AFAIK) at the SQL level (GRANT permissions). Any help would be appreciated, thanks ! submitted by /u/Co2Mtl [link] [comments]
General availability: Application Gateway Web Application Firewall (WAF) inspection limit & size enforcement
by Azure service updates on April 24, 2024 at 6:00 pm
Azure’s regional Web Application Firewall (WAF) running on Application Gateway now supports greater control over request body inspection, and maximum size limits for request bodies and file uploads.

Download Azure AI 900 on iOs

Download Azure AI 900 on Windows10/11

A Twitter List by enoumen

Pass the 2024 AWS Cloud Practitioner CCP CLF-C01 Certification with flying colors

Ace the 2024 AWS Solutions Architect Associate SAA-C03 Exam with Confidence

March 17, 2022May 19, 2022

AWS Machine Learning Certification Specialty Exam Prep

AWS Machine Learning Specialty Certification Prep (Android)

You can translate the content of this page by selecting a language in the select box.

AI Unravel Podcast

The AWS Certified Machine Learning Specialty validates expertise in building, training, tuning, and deploying machine learning (ML) models on AWS.

Use this App to learn about Machine Learning on AWS and prepare for the AWS Machine Learning Specialty Certification MLS-C01.

Download AWS machine Learning Specialty Exam Prep App on iOs

Download AWS Machine Learning Specialty Exam Prep App on Android/Web/Amazon

AWS MLS-C01 Machine Learning Specialty Exam Prep PRO

[appbox appstore 1611045854-iphone screenshots]

[appbox microsoftstore 9n8rl80hvm4t-mobile screenshots]

Download AWS machine Learning Specialty Exam Prep App on iOs

Download AWS Machine Learning Specialty Exam Prep App on Android/Web/Amazon

The App provides hundreds of quizzes and practice exam about:

– Machine Learning Operation on AWS

– Modelling

– Data Engineering

– Computer Vision,

– Exploratory Data Analysis,

– ML implementation & Operations

– Machine Learning Basics Questions and Answers

– Machine Learning Advanced Questions and Answers

– Scorecard

– Countdown timer

– Machine Learning Cheat Sheets

– Machine Learning Interview Questions and Answers

– Machine Learning Latest News

The App covers Machine Learning Basics and Advanced topics including: NLP, Computer Vision, Python, linear regression, logistic regression, Sampling, dataset, statistical interaction, selection bias, non-Gaussian distribution, bias-variance trade-off, Normal Distribution, correlation and covariance, Point Estimates and Confidence Interval, A/B Testing, p-value, statistical power of sensitivity, over-fitting and under-fitting, regularization, Law of Large Numbers, Confounding Variables, Survivorship Bias, univariate, bivariate and multivariate, Resampling, ROC curve, TF/IDF vectorization, Cluster Sampling, etc.

Domain 1: Data Engineering

Create data repositories for machine learning.

Identify data sources (e.g., content and location, primary sources such as user data)

Determine storage mediums (e.g., DB, Data Lake, S3, EFS, EBS)

Identify and implement a data ingestion solution.

Data job styles/types (batch load, streaming)

Data ingestion pipelines (Batch-based ML workloads and streaming-based ML workloads), etc.

Domain 2: Exploratory Data Analysis

Sanitize and prepare data for modeling.

Perform feature engineering.

Analyze and visualize data for machine learning.

Domain 3: Modeling

Frame business problems as machine learning problems.

Select the appropriate model(s) for a given machine learning problem.

Train machine learning models.

Perform hyperparameter optimization.

Evaluate machine learning models.

Domain 4: Machine Learning Implementation and Operations

Build machine learning solutions for performance, availability, scalability, resiliency, and fault

tolerance.

Recommend and implement the appropriate machine learning services and features for a given

problem.

Apply basic AWS security practices to machine learning solutions.

Deploy and operationalize machine learning solutions.

Machine Learning Services covered:

Amazon Comprehend

AWS Deep Learning AMIs (DLAMI)

AWS DeepLens

Amazon Forecast

Amazon Fraud Detector

Amazon Lex

Amazon Polly

Amazon Rekognition

Amazon SageMaker

Amazon Textract

Amazon Transcribe

Amazon Translate

Other Services and topics covered are:

Ingestion/Collection

Processing/ETL

Data analysis/visualization

Model training

Model deployment/inference

Operational

AWS ML application services

Language relevant to ML (for example, Python, Java, Scala, R, SQL)

Notebooks and integrated development environments (IDEs),

S3, SageMaker, Kinesis, Lake Formation, Athena, Kibana, Redshift, Textract, EMR, Glue, SageMaker, CSV, JSON, IMG, parquet or databases, Amazon Athena

Amazon EC2, Amazon Elastic Container Registry (Amazon ECR), Amazon Elastic Container Service, Amazon Elastic Kubernetes Service , Amazon Redshift

Important: To succeed with the real exam, do not memorize the answers in this app. It is very important that you understand why a question is right or wrong and the concepts behind it by carefully reading the reference documents in the answers.

Note and disclaimer: We are not affiliated with Microsoft or Azure or Google or Amazon. The questions are put together based on the certification study guide and materials available online. The questions in this app should help you pass the exam but it is not guaranteed. We are not responsible for any exam you did not pass.

Download AWS machine Learning Specialty Exam Prep App on iOs

Download AWS Machine Learning Specialty Exam Prep App on Android/Web/Amazon

[D] What is the best TTS model for my case?
by /u/hwk06023 (Machine Learning) on April 25, 2024 at 8:07 am
Hi. Here is the new's question. The biggest concern is the rate of generation. I want to generate about 5 seconds of voice in about 100ms. I want to know which model performs best(SOTA) under those conditions. Which model is best for me? I think "styletts2" is best. If you have any relevant experience or know any other information, I would really appreciate your help. Thank you ! submitted by /u/hwk06023 [link] [comments]
[D] Exploring Complex Number Representations for Word Vectors: A New Approach
by /u/_mayuk (Machine Learning) on April 25, 2024 at 3:21 am
Word embeddings like Word2Vec and GloVe have revolutionized natural language processing, offering compact and dense representations of word meanings. However, these embeddings typically represent words as real-valued vectors, potentially limiting their ability to capture complex semantic relationships. In this proposal, we explore an alternative approach: representing word vectors as complex numbers. We propose converting Word2Vec or GloVe vectors into complex numbers, where the real part captures magnitude and the imaginary part encodes additional semantic information. For instance, consider the word vector Vecword=[0.2,−0.3,0.5,0.1,−0.2]. We can convert this vector into a complex number zz as follows: z=Vecword[0]+i×Vecword[1] Here, ii is the imaginary unit. The real part of the complex number represents the magnitude of the word's meaning (0.2), while the imaginary part (-0.3i) captures additional semantic nuances. This approach offers several potential advantages: Enhanced Semantic Representation: Complex numbers can capture both magnitude and phase, allowing for richer semantic representations compared to real-valued vectors. Contextual Information: By encoding semantic information in the imaginary part, we can capture contextual nuances that may be missed by traditional embeddings. Compatibility: Complex number representations can be seamlessly integrated into existing models and frameworks, offering a straightforward extension to current NLP pipelines. Exploring complex number representations for word vectors presents an exciting avenue for enhancing semantic understanding in natural language processing tasks. By leveraging the unique properties of complex numbers, we can potentially unlock deeper insights into the structure and meaning of language. This proposal aims to spark further research and experimentation in this promising direction. Join us as we delve into the fascinating world of complex semantics! submitted by /u/_mayuk [link] [comments]
[R] French GEC dataset
by /u/R-e-v-e-r-i-e- (Machine Learning) on April 25, 2024 at 12:14 am
Hi, does anyone know of a French L2 GEC dataset (that was published at a conference)? submitted by /u/R-e-v-e-r-i-e- [link] [comments]
[D] tutorial on how to build streaming ML applications
by /u/clementruhm (Machine Learning) on April 24, 2024 at 10:16 pm
My primary expertise is audio processing, but i believe this task happens in other domains too: running a model on chunks of infinitely long input. while for some architectures it is straightforward, it can get tedious for convolutional nets. I put together a comprehensive tutorial how to build a streaming ML applications: https://balacoon.com/blog/streaming\_inference/. would be curious to learn wether its a common problem and how do people usually deal with it. Because resources on the topic are surprisingly scarce. submitted by /u/clementruhm [link] [comments]
[D] Why is R^2 so crazy?
by /u/Cloverdover1 (Machine Learning) on April 24, 2024 at 9:40 pm
https://preview.redd.it/jpiyt4b9yhwc1.png?width=1165&format=png&auto=webp&s=95d80f8f9c9241d722717ad25215be4077d541ca Based on the MSE looks good right? But why is my R^2 starting off so negative and approaching 0? Could it be a bug in how i am calculating it? This happened after i min maxed the labels before training. This is an LSTM that is predicting runs scored for baseball games. submitted by /u/Cloverdover1 [link] [comments]
Recall Score Increase [D]
by /u/Legal_Hearing555 (Machine Learning) on April 24, 2024 at 5:38 pm
Hello Everyone, I am trying to do a small fraud detection project and i have so imbalanced dataset. I used randomundersampling because minority class is pretty small and i also tried smote or combining with smote best recall score i got, was with only randomundersampling(0.95). I thought GridsearchCV to increase it but instead of increasing, it is decreasing although i tried to make it to focus on recall score. Why this is happening? submitted by /u/Legal_Hearing555 [link] [comments]
[D] Preserving spatial distribution of data during data splitting
by /u/dr_greg_mouse (Machine Learning) on April 24, 2024 at 5:14 pm
Hello, I am trying to model nitrate concentrations in the streams in Bavaria in Germany using Random Forest model. I am using Python and primarily sklearn for the same. I have data from 490 water quality stations. I am following the methodology in the paper from LongzhuQ.Shen et al which can be found here: https://www.nature.com/articles/s41597-020-0478-7 I want to split my dataset into training and testing set such that the spatial distribution of data in both sets is identical. The idea is that if data splitting ignores the spatial distribution, there is a risk that the training set might end up with a concentration of points from densely populated areas, leaving out sparser areas. This can skew the model's learning process, making it less accurate or generalizable across the entire area of interest. sklearn train_test_split just randomly divides the data into training and testing sets and it does not consider the spatial patterns in the data. The paper I mentioned above follows this methodology: "We split the full dataset into two sub-datasets, training and testing respectively. To consider the heterogeneity of the spatial distribution of the gauge stations, we employed the spatial density estimation technique in the data splitting step by building a density surface using Gaussian kernels with a bandwidth of 50 km (using v.kernel available in GRASS GIS33) for each species and season. The pixel values of the resultant density surface were used as weighting factors to split the data into training and testing subsets that possess identical spatial distributions." I want to follow the same methodology but instead of using grass GIS, I am just building the density surface myself in Python. I have also extracted the probability density values and the weights for the stations. (attached figure) Now the only problem I am facing is how do I use these weights to split the data into training and testing sets? I checked there is no keyword in the sklearn train_test_split function that can consider the weights. I also went back and forth with chat GPT 4 but it is also not able to give me a clear answer. Neither did I find anything concrete on the internet about this. Maybe I am missing something. Is there any other function I can use to do this? Or will I have to write my own algorithm to do the splitting? In case of the latter, can you please suggest me the approach so I can code it myself? In the attached figure you can see the location of the stations and the probability density surface generated using the kernel density estimation method (using Gaussian kernels). Also attaching a screenshot of my dataframe to give you some idea of the data structure. (all columns after longitude ('lon') column are used as features. the NO3 column is used as the target variable.) I will be grateful for any answers. Probability density surface generated using the kernel density estimation method with gaussian kernels. the dataset I am using to model the nitrate concentrations submitted by /u/dr_greg_mouse [link] [comments]
[N] Snowflake releases open (Apache 2.0) 128x3B MoE model
by /u/topcodemangler (Machine Learning) on April 24, 2024 at 4:45 pm
Links: https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/ https://replicate.com/snowflake/snowflake-arctic-instruct submitted by /u/topcodemangler [link] [comments]
Enhance conversational AI with advanced routing techniques with Amazon Bedrock
by Ameer Hakme (AWS Machine Learning Blog) on April 24, 2024 at 4:30 pm
Conversational artificial intelligence (AI) assistants are engineered to provide precise, real-time responses through intelligent routing of queries to the most suitable AI functions. With AWS generative AI services like Amazon Bedrock, developers can create systems that expertly manage and respond to user requests. Amazon Bedrock is a fully managed service that offers a choice of
Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering
by Yunfei Bai (AWS Machine Learning Blog) on April 24, 2024 at 4:27 pm
The Amazon EU Design and Construction (Amazon D&C) team is the engineering team designing and constructing Amazon warehouses. The team navigates a large volume of documents and locates the right information to make sure the warehouse design meets the highest standards. In the post A generative AI-powered solution on Amazon SageMaker to help Amazon EU
Improve accuracy of Amazon Rekognition Face Search with user vectors
by Arik Porat (AWS Machine Learning Blog) on April 24, 2024 at 4:13 pm
In various industries, such as financial services, telecommunications, and healthcare, customers use a digital identity process, which usually involves several steps to verify end-users during online onboarding or step-up authentication. An example of one step that can be used is face search, which can help determine whether a new end-user’s face matches those associated with
[D] Why would such a simple sentence break an LLM?
by /u/michael-relleum (Machine Learning) on April 24, 2024 at 3:59 pm
This is a prompt I entered into MS Copilot (GPT4 Turbo). It's in german but it just means "Would there be any disadvantages if I took the full bath first?"), so this can't be another SolidGoldMagikarp or similar, because the words clearly were in both tokenizer and training vocab. Why would such a simple sentence cause this? Any guesses? (also tried with Claude Opus and LLama 3 70b, which worked fine) https://preview.redd.it/9x6mva7b6gwc1.png?width=1129&format=png&auto=webp&s=bb6ac52d1c52d981161e8a864c5d1dd3794ca392 submitted by /u/michael-relleum [link] [comments]
[R] Speaker diarization
by /u/anuragrawall (Machine Learning) on April 24, 2024 at 3:01 pm
Hi All, I am working on a project where I want to create speaker-aware transcripts from audios/videos, preferably using open-source solutions. I have tried so many approaches but nothing seems to work good enough out of the box. I have tried: whisperX: https://github.com/m-bain/whisperX (uses pyannote) whisper-diarization: https://github.com/MahmoudAshraf97/whisper-diarization (uses Nemo) AWS Transcribe AssemblyAI API Picovoice API I'll need to dig deeper and understand what's causing the incorrect diarization but I am looking for suggestions to improve speaker diarization. Please reach out if you have worked in this area and have had any success. Thanks! submitted by /u/anuragrawall [link] [comments]
[R] I made an app to predict ICML paper acceptance from reviews
by /u/Lavishness-Mission (Machine Learning) on April 24, 2024 at 12:23 pm
https://www.norange.io/projects/paper_scorer/ A couple of years ago, u/programmerChilli analyzed ICLR 2019 reviews data and trained a model that rather accurately predicted acceptance results for NeurIPS. I've decided to continue this analysis and trained a model (total ~6000 parameters) on newer NeurIPS reviews, which has twice as many reviews compared to ICLR 2019. Additionally, review scores system for NeurIPS has changed since 2019, and here is what I've learned: 1) Both conferences consistently reject nearly all submissions scoring <5 and accept those scoring >6. The most common score among accepted papers is 6. An average rating around 5.3 typically results in decisions that could go either way for both ICML and NeurIPS, suggesting that ~5.3 might be considered a soft threshold for acceptance. 2) Confidence scores are less impactful for borderline ratings such as 4 (borderline reject), 5 (borderline accept), and 6 (weak accept), but they can significantly affect the outcome for stronger reject or accept cases. For instance, with ratings of [3, 5, 6] and confidences of [*, 4, 4], changing the "Reject" confidence from 5 to 1 shifts the probabilities from 26.2% - 31.3% - 52.4% - 54.5% - 60.4%, indicating that lower confidence in this case increases your chances. Conversely, for ratings [3, 5, 7] with confidences [4, 4, 4], the acceptance probability is 31.3%, but it drops to 28.1% when the confidence changes to [4, 4, 5]. Although it might seem counterintuitive, a confidence score of 5 actually decreases your chances. One possible explanation is that many low-quality reviews rated 5 are often discounted by the Area Chairs (ACs). Hope this will be useful, and thanks to u/programmerChilli for the inspiration! I also discussed this topic in a series of tweets. submitted by /u/Lavishness-Mission [link] [comments]
[R] SpaceByte: Towards Deleting Tokenization from Large Language Modeling - Rice University 2024 - Practically the same performance as subword tokenizers without their many downsides!
by /u/Singularian2501 (Machine Learning) on April 24, 2024 at 11:42 am
Paper: https://arxiv.org/abs/2404.14408 Github: https://github.com/kjslag/spacebyte Abstract: Tokenization is widely used in large language models because it significantly improves performance. However, tokenization imposes several disadvantages, such as performance biases, increased adversarial vulnerability, decreased character-level modeling performance, and increased modeling complexity. To address these disadvantages without sacrificing performance, we propose SpaceByte, a novel byte-level decoder architecture that closes the performance gap between byte-level and subword autoregressive language modeling. SpaceByte consists of a byte-level Transformer model, but with extra larger transformer blocks inserted in the middle of the layers. We find that performance is significantly improved by applying these larger blocks only after certain bytes, such as space characters, which typically denote word boundaries. Our experiments show that for a fixed training and inference compute budget, SpaceByte outperforms other byte-level architectures and roughly matches the performance of tokenized Transformer architectures.Paper: https://arxiv.org/abs/2404.14408Github: https://github.com/kjslag/spacebyteAbstract:Tokenization is widely used in large language models because it significantly improves performance. However, tokenization imposes several disadvantages, such as performance biases, increased adversarial vulnerability, decreased character-level modeling performance, and increased modeling complexity. To address these disadvantages without sacrificing performance, we propose SpaceByte, a novel byte-level decoder architecture that closes the performance gap between byte-level and subword autoregressive language modeling. SpaceByte consists of a byte-level Transformer model, but with extra larger transformer blocks inserted in the middle of the layers. We find that performance is significantly improved by applying these larger blocks only after certain bytes, such as space characters, which typically denote word boundaries. Our experiments show that for a fixed training and inference compute budget, SpaceByte outperforms other byte-level architectures and roughly matches the performance of tokenized Transformer architectures. https://preview.redd.it/v1xo6g1gzewc1.jpg?width=1507&format=pjpg&auto=webp&s=f9d415307b60639fa67e8a54c8769fa5a6c10f04 https://preview.redd.it/edvqos1gzewc1.jpg?width=1654&format=pjpg&auto=webp&s=f91c8727017e1a1bc7b80bb77a8627ff99182607 https://preview.redd.it/fe6z6i1gzewc1.jpg?width=1181&format=pjpg&auto=webp&s=24d955f30b8ca3eaa7c527f3f40545ed493f789c submitted by /u/Singularian2501 [link] [comments]
[D] Keeping track of models and their associated metadata.
by /u/ClearlyCylindrical (Machine Learning) on April 24, 2024 at 10:20 am
I am starting to accumulate a large number of models for a project I am working on, many of these models are old which I am keeping for archival sake, and many are fine tuned from other models. I am wondering if there is an industry standard way of dealing with this, in particular I am looking for the following: Information about parameters used to train the model Datasets used to train the model Other metadata about the model (i.e. what objects an object detection model trained for) Model performance Model lineage (What model was it fine tuned from) Model progression (Is this model a direct upgrade from some other model, such as being fine tuned from the same model but using better hyper parameters) Model source (Not sure about this, but I'm thinking some way of linking the model to the python script which was used to train it. Not crucial but something like this would be nice) Are there any tools of services which could help be achieve some of this functionality? Also, if this is not the sub for this question could I get some pointers in the correct direction. Thanks! submitted by /u/ClearlyCylindrical [link] [comments]
[D] Deploy the fine-tuned Mistral 7B model using the Hugging Face library
by /u/Future-Outcome3167 (Machine Learning) on April 24, 2024 at 9:31 am
I followed the tutorial provided at https://www.datacamp.com/tutorial/mistral-7b-tutorial and now seek methods to deploy the model for faster inference using Hugging Face and Gradio. Could anyone please share a guide notebook or article for reference? Any help would be appreciated. submitted by /u/Future-Outcome3167 [link] [comments]
[D] Transkribus vs Tesseract for Handwritten Text Recognition (HTR)
by /u/Pretty_Instance4483 (Machine Learning) on April 24, 2024 at 6:15 am
I am looking for a HTR tool with the best accuracy and preferably not pricy (obviously). From my research, it seems that Transkribus was the most mentioned platform with good reviews. As I would need to convert images to text regularly I would need to pay the subscription. So I am wondering if I could use the Tesseract and/or TensorFlow Python library to achieve the same result for free. Would using Tesseract/TensorFlow be less accurate rather than using Transkribus? I learned only the basics of Machine Learning (TensorFlow, scikit-learn, keras), so I might have not enough knowledge to see the difference between the two solutions. Or is training Tesseract/TensorFlow would be challenging? submitted by /u/Pretty_Instance4483 [link] [comments]
[D] How researcher think of inductive bias when thinking of creating new/improving foundational models?
by /u/binny_sarita (Machine Learning) on April 24, 2024 at 2:36 am
I am undergradute student learning machine learning. What I got to know while reading few papers that we try to reduce search space by imposing inductive bias in machine learning models. And the success in creating useful models comes when inductive bias matches with the underlying data. In heriarchical models like NVAE how they instilled inductive bias by specifing the way data gets computed? (I thinks it's called algorithmic bias, not sure though) But how people think such inductive bias will be helpful, what is step by step procedure they go through to insist such inductive bias. I took a lot of class in machine learning and statistics but didn't got any lectures explaing such stuff. Did I missed any course/lecture? Please provide my with papers/lectures/talks related to it if possible Thankyou submitted by /u/binny_sarita [link] [comments]
[R] Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking
by /u/Jesse_marqo (Machine Learning) on April 23, 2024 at 11:07 pm
Generalization of the popular training method of CLIP to be better suited for search and recommendations. Paper: https://arxiv.org/pdf/2404.08535.pdf Github: https://github.com/marqo-ai/GCL Generalises CLIP: Use any number of text and/or images to represent documents. Better text understanding by having both inter- and intra-modal losses. Can encode rank/importance/relevance, a.k.a “rank-tune”. Works with pretrained, text, CLIP models. Can learn uni- or multi-vector representations for documents. Works with binary and Matryoshka methods. Open source 10M row multi-modal dataset with 100k queries and ~5M products. Why? The prevailing methods for training embedding models are largely disconnected from the end use-case (like search), the vector database, the requirements of users, and a lack of representative datasets for development and evaluation, particularly when multiple modalities and ranking is involved. Limitations of current embedding models for vector search Although vector search is very powerful and enables searching across just about any data, the current methods have some limitations. The prevailing methods for training embedding models are largely disconnected from the end use-case (like search), the vector database, and the requirements of users. This means that a lot of the potential of vector search is being unmet. Some of the current challenges are described below. Restricted to using a single piece of information to represent a document Current models encode and represent one piece of information with one vector. The reality is that often there are multiple pieces of pertinent information for a document that may span multiple modalities. For example, in product search there may be a title, description, reviews, and multiple images, each with its own caption. GCL generalises embedding model training to use as many pieces of information as is desired. No notion of rank when dealing with degenerate queries When there are degenerate queries - multiple results that satisfy some criteria of relevance - the ordering of the results is only ever learned indirectly from the many binary relationships. In reality, the ordering of results matters, even for first stage retrieval. GCL allows for the magnitude of query-document specific relevance to be encoded in the embeddings and improves ranking of candidate documents. Poor text understanding when using CLIP like methods For multi-modal models like CLIP, these are trained to only work from image to text (and vice versa). The text-text understanding is not as good as text only models due to the text-text relationships being learned indirectly through images. For many applications, having both inter- and intra-modality understanding is required. GCL allows for any combination of inter- and intra-modal understanding by directly optimizing for this. Lack of representative datasets to develop methods for vector search In developing GCL, it became apparent there was a disconnect with publicly available datasets for embedding model training and evaluation for real-world use cases. Existing benchmarks are typically text only or inter-modal only and focus on the 1-1 query-result paradigm. Additionally, existing datasets have limited notions of relevance, with the majority encoding it as a binary relationship while several use (up-to) a handful of discrete categorizations often on the test set only. This differs from a typical real-world use cases where relevance can be both hard binary relationships or come from continuous variables. To help with this we compiled a dataset of 10M (ranked) product-query pairs, across ~100k queries, nearly 5M products, and four evaluation splits (available here). submitted by /u/Jesse_marqo [link] [comments]
[D] Practical uses of AI inside companies
by /u/CJSF (Machine Learning) on April 23, 2024 at 10:25 pm
How are people using AI inside companies (startups -> FAANG) to improve operations and processes? There is so much talk about leveraging LLM’s and GenAI but I’m struggling to find real concrete examples that are successful, beyond what comes up in a google search. The following areas come to mind first but this list isn’t exhaustive of course: Design (and handoff) Engineering Customer Support Sales Documentation Marketing What’s worked or shown promise? What hasn’t worked? submitted by /u/CJSF [link] [comments]
Meta does everything OpenAI should be [D]
by /u/ReputationMindless32 (Machine Learning) on April 23, 2024 at 10:03 pm
I'm surprised (or maybe not) to say this, but Meta (or Facebook) democratises AI/ML much more than OpenAI, which was originally founded and primarily funded for this purpose. OpenAI has largely become a commercial project for profit only. Although as far as Llama models go, they don't yet reach GPT4 capabilities for me, but I believe it's only a matter of time. What do you guys think about this? submitted by /u/ReputationMindless32 [link] [comments]
Accelerate ML workflows with Amazon SageMaker Studio Local Mode and Docker support
by Shweta Singh (AWS Machine Learning Blog) on April 23, 2024 at 7:20 pm
We are excited to announce two new capabilities in Amazon SageMaker Studio that will accelerate iterative development for machine learning (ML) practitioners: Local Mode and Docker support. ML model development often involves slow iteration cycles as developers switch between coding, training, and deployment. Each step requires waiting for remote compute resources to start up, which
[D] Speech to Text Word Level Timestamps Accuracy Issue
by /u/Mindless-Ordinary485 (Machine Learning) on April 23, 2024 at 7:18 pm
I've had a lot of success with Whisper when it comes to transcriptions, but word level timestamps seems to be slightly inaccurate. From my understanding ("Whisper cannot provide reliable word timestamps, because the END-TO-END models like Transformer using cross-entropy training criterion are not designed for reliably estimating word timestamps." https://www.youtube.com/watch?v=H576iCWt1Co&t=192s) For my use case, I need precise word level timestamps, because I'm doing audio insertion after specific words. This becomes problematic when I do an insertion and the back part of a word ends up on the other side. Example: Given an original audio file with speech that has been transcribed, If I want to insert a clip at the end of the word "France", and according to the timestamp, the word "France" starts at 19.26 and ends at 19.85, I will insert the clip at 19.85. However, if the actual end of France is at 19.92, then when I insert the laugher at 19.85, I will here the remaining "France", likely "ce" (0.07), at the end. I'm curious if anyone has been posed with a similar problem and what they did to get around this? I've experimented with a few open source variations of whisper, but still running into that issue. submitted by /u/Mindless-Ordinary485 [link] [comments]
[R] Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry
by /u/SeawaterFlows (Machine Learning) on April 23, 2024 at 7:11 pm
Paper: https://arxiv.org/abs/2404.06405 Code: https://huggingface.co/datasets/bethgelab/simplegeometry Abstract: Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 25 of 30 International Mathematical Olympiad (IMO) problems whereas the reported baseline based on Wu's method solved only ten. In this note, we revisit the IMO-AG-30 Challenge introduced with AlphaGeometry, and find that Wu's method is surprisingly strong. Wu's method alone can solve 15 problems, and some of them are not solved by any of the other methods. This leads to two key findings: (i) Combining Wu's method with the classic synthetic methods of deductive databases and angle, ratio, and distance chasing solves 21 out of 30 methods by just using a CPU-only laptop with a time limit of 5 minutes per problem. Essentially, this classic method solves just 4 problems less than AlphaGeometry and establishes the first fully symbolic baseline strong enough to rival the performance of an IMO silver medalist. (ii) Wu's method even solves 2 of the 5 problems that AlphaGeometry failed to solve. Thus, by combining AlphaGeometry with Wu's method we set a new state-of-the-art for automated theorem proving on IMO-AG-30, solving 27 out of 30 problems, the first AI method which outperforms an IMO gold medalist. submitted by /u/SeawaterFlows [link] [comments]
[N] Phi-3-mini released on HuggingFace
by /u/topcodemangler (Machine Learning) on April 23, 2024 at 3:26 pm
https://huggingface.co/microsoft/Phi-3-mini-128k-instruct The numbers in the technical report look really great, I guess need to be verified by 3rd parties. submitted by /u/topcodemangler [link] [comments]
[D] How to and Deploy LLaMA 3 Into Production, and Hardware Requirements
by /u/juliensalinas (Machine Learning) on April 23, 2024 at 12:33 pm
Many are trying to install and deploy their own LLaMA 3 model, so here is a tutorial I just made showing how to deploy LLaMA 3 on an AWS EC2 instance: https://nlpcloud.com/how-to-install-and-deploy-llama-3-into-production.html Deploying LLaMA 3 8B is fairly easy but LLaMA 3 70B is another beast. Given the amount of VRAM needed you might want to provision more than one GPU and use a dedicated inference server like vLLM in order to split your model on several GPUs. LLaMA 3 8B requires around 16GB of disk space and 20GB of VRAM (GPU memory) in FP16. As for LLaMA 3 70B, it requires around 140GB of disk space and 160GB of VRAM in FP16. I hope it is useful, and if you have questions please don't hesitate to ask! Julien submitted by /u/juliensalinas [link] [comments]
Significant new capabilities make it easier to use Amazon Bedrock to build and scale generative AI applications – and achieve impressive results
by Swami Sivasubramanian (AWS Machine Learning Blog) on April 23, 2024 at 11:50 am
We introduced Amazon Bedrock to the world a little over a year ago, delivering an entirely new way to build generative artificial intelligence (AI) applications. With the broadest selection of first- and third-party foundation models (FMs) as well as user-friendly capabilities, Amazon Bedrock is the fastest and easiest way to build and scale secure generative
Building scalable, secure, and reliable RAG applications using Knowledge Bases for Amazon Bedrock
by Mani Khanuja (AWS Machine Learning Blog) on April 23, 2024 at 11:40 am
This post explores the new enterprise-grade features for Knowledge Bases on Amazon Bedrock and how they align with the AWS Well-Architected Framework. With Knowledge Bases for Amazon Bedrock, you can quickly build applications using Retrieval Augmented Generation (RAG) for use cases like question answering, contextual chatbots, and personalized search.
[D] What best practices and workflows those working solo as DS/MLE should keep in mind?
by /u/Melodic_Reality_646 (Machine Learning) on April 23, 2024 at 9:40 am
I'm wondering what technical recruiters or seasoned DS/MLE have to say about people with profiles like mine: good theoretical and decent technical background but working solo for too long. Summary of my career for context: I've been working 8 years now as a DS, the first 3 in medium sized R&D and consulting teams (for a big tech company), then for the past 5 as a solo DS for relatively successful non-ai focused start-ups, mostly developing ML/NLP stuff to address specific issues or improve one specific feature of their product (i.e. never a whole product). In 5 years I designed. developed and deployed, say, 4 models (but experimented with many ofc) - along with a few dashboards and simple streamlit POCs). Recently attending to meetups and seeing how people that make part of actual teams work, discuss and exchange knowledge it suddenly stroke me: I'm missing out, I'm becoming obsolete. I dont feel sharp enough for technical interviews, I'm not sure the way I develop and maintain my projects are following good standards/best practices (heck, i hardly follow a kanban, mostly use my planner to report to my boss on progress). I do some version control and document what I put into prod, but not even that I'm sure I'm doing as it'd be expected within a team. submitted by /u/Melodic_Reality_646 [link] [comments]
Integrate HyperPod clusters with Active Directory for seamless multi-user login
by Tomonori Shimomura (AWS Machine Learning Blog) on April 22, 2024 at 5:50 pm
Amazon SageMaker HyperPod is purpose-built to accelerate foundation model (FM) training, removing the undifferentiated heavy lifting involved in managing and optimizing a large training compute cluster. With SageMaker HyperPod, you can train FMs for weeks and months without disruption. Typically, HyperPod clusters are used by multiple users: machine learning (ML) researchers, software engineers, data scientists,
The executive’s guide to generative AI for sustainability
by Wafae Bakkali (AWS Machine Learning Blog) on April 22, 2024 at 5:40 pm
Organizations are facing ever-increasing requirements for sustainability goals alongside environmental, social, and governance (ESG) practices. A Gartner, Inc. survey revealed that 87 percent of business leaders expect to increase their organization’s investment in sustainability over the next years. This post serves as a starting point for any executive seeking to navigate the intersection of generative
[D] Llama-3 may have just killed proprietary AI models
by /u/madredditscientist (Machine Learning) on April 22, 2024 at 3:08 pm
Full Blog Post Meta released Llama-3 only three days ago, and it already feels like the inflection point when open source models finally closed the gap with proprietary models. The initial benchmarks show that Llama-3 70B comes pretty close to GPT-4 in many tasks: The official Meta page only shows that Llama-3 outperforms Gemini 1.5 and Claude Sonnet. Artificial Analysis shows that Llama-3 is in-between Gemini-1.5 and Opus/GPT-4 for quality. On LMSYS Chatbot Arena Leaderboard, Llama-3 is ranked #5 while current GPT-4 models and Claude Opus are still tied at #1. The even more powerful Llama-3 400B+ model is still in training and is likely to surpass GPT-4 and Opus once released. Meta vs OpenAI Some speculate that Meta's goal from the start was to target OpenAI with a "scorched earth" approach by releasing powerful open models to disrupt the competitive landscape and avoid being left behind in the AI race. Meta can likely outspend OpenAI on compute and talent: OpenAI makes an estimated revenue of $2B and is likely unprofitable. Meta generated a revenue of $134B and profits of $39B in 2023. Meta's compute resources likely outrank OpenAI by now. Open source likely attracts better talent and researchers. One possible outcome could be the acquisition of OpenAI by Microsoft to catch up with Meta. Google is also making moves into the open model space and has similar capabilities to Meta. It will be interesting to see where they fit in. The Winners: Developers and AI Product Startups I recently wrote about the excitement of building an AI startup right now, as your product automatically improves with each major model advancement. With the release of Llama-3, the opportunities for developers are even greater: No more vendor lock-in. Instead of just wrapping proprietary API endpoints, developers can now integrate AI deeply into their products in a very cost-effective and performant way. There are already over 800 llama-3 models variations on Hugging Face, and it looks like everyone will be able to fine-tune for their us-cases, languages, or industry. Faster, cheaper hardware: Groq can now generate 800 llama-3 tokens per second at a small fraction of the GPT costs. Near-instant LLM responses at low prices are on the horizon. Open source multimodal models for vision and video still have to catch up, but I expect this to happen very soon. The release of Llama-3 marks a significant milestone in the democratization of AI, but it's probably too early to declare the death of proprietary models. Who knows, maybe GPT-5 will surprise us all and surpass our imaginations of what transformer models can do. These are definitely super exciting times to build in the AI space! submitted by /u/madredditscientist [link] [comments]
[D] Simple Questions Thread
by /u/AutoModerator (Machine Learning) on April 21, 2024 at 3:00 pm
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the previous thread! submitted by /u/AutoModerator [link] [comments]
Introducing automatic training for solutions in Amazon Personalize
by Ba'Carri Johnson (AWS Machine Learning Blog) on April 20, 2024 at 12:38 am
Amazon Personalize is excited to announce automatic training for solutions. Solution training is fundamental to maintain the effectiveness of a model and make sure recommendations align with users’ evolving behaviors and preferences. As data patterns and trends change over time, retraining the solution with the latest relevant data enables the model to learn and adapt,
Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average
by Rajesh Ramchander (AWS Machine Learning Blog) on April 19, 2024 at 4:55 pm
We are excited to announce a new version of the Amazon SageMaker Operators for Kubernetes using the AWS Controllers for Kubernetes (ACK). ACK is a framework for building Kubernetes custom controllers, where each controller communicates with an AWS service API. These controllers allow Kubernetes users to provision AWS resources like buckets, databases, or message queues
Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock – Part 2
by Archana Inapudi (AWS Machine Learning Blog) on April 19, 2024 at 3:15 pm
In Part 1 of this series, we presented a solution that used the Amazon Titan Multimodal Embeddings model to convert individual slides from a slide deck into embeddings. We stored the embeddings in a vector database and then used the Large Language-and-Vision Assistant (LLaVA 1.5-7b) model to generate text responses to user questions based on
Scale AI training and inference for drug discovery through Amazon EKS and Karpenter
by Matthew Welborn (AWS Machine Learning Blog) on April 19, 2024 at 3:07 pm
This is a guest post co-written with the leadership team of Iambic Therapeutics. Iambic Therapeutics is a drug discovery startup with a mission to create innovative AI-driven technologies to bring better medicines to cancer patients, faster. Our advanced generative and predictive artificial intelligence (AI) tools enable us to search the vast space of possible drug
Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock
by Ebbey Thomas (AWS Machine Learning Blog) on April 18, 2024 at 5:57 pm
As you navigate the complexities of cloud migration, the need for a structured, secure, and compliant environment is paramount. AWS Landing Zone addresses this need by offering a standardized approach to deploying AWS resources. This makes sure your cloud foundation is built according to AWS best practices from the start. With AWS Landing Zone, you eliminate the guesswork in security configurations, resource provisioning, and account management. It’s particularly beneficial for organizations looking to scale without compromising on governance or control, providing a clear path to a robust and efficient cloud setup. In this post, we show you how to generate customized, compliant IaC scripts for AWS Landing Zone using Amazon Bedrock.
Live Meeting Assistant with Amazon Transcribe, Amazon Bedrock, and Knowledge Bases for Amazon Bedrock
by Bob Strahan (AWS Machine Learning Blog) on April 18, 2024 at 5:08 pm
You’ve likely experienced the challenge of taking notes during a meeting while trying to pay attention to the conversation. You’ve probably also experienced the need to quickly fact-check something that’s been said, or look up information to answer a question that’s just been asked in the call. Or maybe you have a team member that always joins meetings late, and expects you to send them a quick summary over chat to catch them up. Then there are the times that others are talking in a language that’s not your first language, and you’d love to have a live translation of what people are saying to make sure you understand correctly. And after the call is over, you usually want to capture a summary for your records, or to send to the participants, with a list of all the action items, owners, and due dates. All of this, and more, is now possible with our newest sample solution, Live Meeting Assistant (LMA).
Meta Llama 3 models are now available in Amazon SageMaker JumpStart
by Kyle Ulrich (AWS Machine Learning Blog) on April 18, 2024 at 4:31 pm
Today, we are excited to announce that Meta Llama 3 foundation models are available through Amazon SageMaker JumpStart to deploy and run inference. The Llama 3 models are a collection of pre-trained and fine-tuned generative text models. In this post, we walk through how to discover and deploy Llama 3 models via SageMaker JumpStart. What is
Slack delivers native and secure generative AI powered by Amazon SageMaker JumpStart
by Jackie Rocca (AWS Machine Learning Blog) on April 18, 2024 at 12:00 pm
We are excited to announce that Slack, a Salesforce company, has collaborated with Amazon SageMaker JumpStart to power Slack AI’s initial search and summarization features and provide safeguards for Slack to use large language models (LLMs) more securely. Slack worked with SageMaker JumpStart to host industry-leading third-party LLMs so that data is not shared with the infrastructure owned by third party model providers. This keeps customer data in Slack at all times and upholds the same security practices and compliance standards that customers expect from Slack itself.
Uncover hidden connections in unstructured financial data with Amazon Bedrock and Amazon Neptune
by Xan Huang (AWS Machine Learning Blog) on April 17, 2024 at 3:00 pm
In asset management, portfolio managers need to closely monitor companies in their investment universe to identify risks and opportunities, and guide investment decisions. Tracking direct events like earnings reports or credit downgrades is straightforward—you can set up alerts to notify managers of news containing company names. However, detecting second and third-order impacts arising from events
Open source observability for AWS Inferentia nodes within Amazon EKS clusters
by Riccardo Freschi (AWS Machine Learning Blog) on April 17, 2024 at 2:54 pm
This post walks you through the Open Source Observability pattern for AWS Inferentia, which shows you how to monitor the performance of ML chips, used in an Amazon Elastic Kubernetes Service (Amazon EKS) cluster, with data plane nodes based on Amazon Elastic Compute Cloud (Amazon EC2) instances of type Inf1 and Inf2.
Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks
by Pranav Murthy (AWS Machine Learning Blog) on April 16, 2024 at 11:00 pm
Amazon SageMaker Studio provides a fully managed solution for data scientists to interactively build, train, and deploy machine learning (ML) models. In the process of working on their ML tasks, data scientists typically start their workflow by discovering relevant data sources and connecting to them. They then use SQL to explore, analyze, visualize, and integrate
Distributed training and efficient scaling with the Amazon SageMaker Model Parallel and Data Parallel Libraries
by Xinle Sheila Liu (AWS Machine Learning Blog) on April 16, 2024 at 4:18 pm
In this post, we explore the performance benefits of Amazon SageMaker (including SMP and SMDDP), and how you can use the library to train large models efficiently on SageMaker. We demonstrate the performance of SageMaker with benchmarks on ml.p4d.24xlarge clusters up to 128 instances, and FSDP mixed precision with bfloat16 for the Llama 2 model.

Download AWS machine Learning Specialty Exam Prep App on iOs

Download AWS Machine Learning Specialty Exam Prep App on Android/Web/Amazon

A Twitter List by enoumen

Download AWS machine Learning Specialty Exam Prep App on iOs

Download AWS Machine Learning Specialty Exam Prep App on Android/Web/Amazon