-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add GenAI packages #7169
Comments
Can you guys publish a preview for Microsoft.ML.GenAI.LLaMA package? |
@lostmsu You should be able to consume it from the daily build below Oh, just notice that the GenAI package hasn't been set to |
Can you please publish a preview for Microsoft.ML.GenAI.Core package? https://pkgs.dev.azure.com/dnceng/public/_packaging/dotnet-libraries/nuget/v3/index.json The sample Microsoft.ML.GenAI.Samples/Llama/LLaMA3_1.cs is broken without it . Furthermore, the sample has hard coded weight folder |
Oh, sorry, I'll make the fix |
I am getting System.IO.FileNotFoundException |
@aforoughi1 Which llama, I suppose you are runnning llama 3.2 1B? |
Llama3.1-8B |
The error basically say it can't find the |
// issue 7169 TiktokenTokenizer tokenizer = LlamaTokenizerHelper.FromPretrained(weightFolder, modelFile); var pipeline = new CausalLMPipeline<TiktokenTokenizer, LlamaForCausalLM>(tokenizer, model, "cpu"); System.IO.FileNotFoundException |
@aforoughi1 The LlamaForCausalLM model = LlamaForCausalLM.FromPretrained("Meta-Llama-3.1-8B-Instruct", configName, layersOnTargetDevice: -1 ,targetDevice: "cpu"); |
I sorted the following missing files and the directory structure: The model is loaded successfully ONLY if I use the defaults Setting layersOnTargetDevice: 26, quantizeToInt8: true causes memory corruptions exception. The example also missing stopWatch.Stop(); I also don't see RegisterPrintMessage(), print any messages to the console. |
@aforoughi1 Are you using nightly build or trying the example from main branch |
Nightly buildOn 7 Oct 2024, at 17:20, Xiaoyun Zhang ***@***.***> wrote:
@aforoughi1 Are you using nightly build or trying the example from main branch
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@aforoughi1 And your GPU device/platform? |
Device is set
torch.InitializeDeviceType(DeviceType.CPU);
microsoft.ml.genai.llama\0.22.0-preview.24477.3\
microsoft.ml.torchsharp\0.21.1\
torchsharp-cpu\0.103.0\
Processor 12th Gen Intel(R) Core(TM) i5-1235U 2.50 GHz
Installed RAM 16.0 GB (15.8 GB usable)
System type 64-bit operating system, x64-based processor
Edition Windows 11 Home
Version 23H2
OS build 22631.4249
Experience Windows Feature Experience Pack 1000.22700.1041.0
From: Xiaoyun Zhang ***@***.***>
Sent: 07 October 2024 17:26
To: dotnet/machinelearning ***@***.***>
Cc: Abbas Foroughi ***@***.***>; Mention ***@***.***>
Subject: Re: [dotnet/machinelearning] Add GenAI packages (Issue #7169)
@aforoughi1 <https://github.com/aforoughi1> And your GPU device/platform?
—
Reply to this email directly, view it on GitHub <#7169 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ALUPR57BV5YWNQBCJISZCBDZ2KYZTAVCNFSM6AAAAABI5KARSSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJXGM4DANZTGQ> .
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
The |
Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
Describe the solution you'd like
The
GenAI
packages will provide torchsharp implementation for a series of popular GenAI models. The goal is to load the same weight from the corresponding python regular model.Microsoft.ML.GenAI.Core
(Add GenAI core package #7177)The following models will be added in the first wave
Microsoft.ML.GenAI.Phi
) Add Microsoft.ML.GenAI.Phi, test package and sample project. #7184Microsoft.ML.GenAI.Phi
project [GenAI] Add readme to Microsoft.ML.GenAI.Phi #7206Microsoft.ML.GenAI.LLaMA
) [GenAI] Add LLaMA support #7220Microsoft.ML.GenAI.Mistral
)Microsoft.ML.GenAI.StableDiffusion
)MEAI intergration
Fine-tuning #7287
Along with the benchmark
Benchmark for Phi-3
Flash Attention support [GenAI] Add FlashAttention #7238
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: