Request for the Dataset used in this project. #1

night-fury-me · 2018-05-31T19:26:24Z

Dear Sir,

From the readme section, I came to know that you are interested to share this dataset for research purpose. My group and I, are doing a project related to NLP, for which we are in need of a dataset of Bangla sentences. If you can share the dataset with us, it will be a great help.

Thanks in advance.

firojalam · 2018-06-01T20:02:31Z

Hi,
Thanks for your interest. Can you please write your details? Which institute are you from? We typically engage with the project that people are interested in. So please do let us know.

Firoj

night-fury-me · 2018-06-02T10:31:16Z

I am from North South University, Dept. of ECE. And It will be a course ( Neural Networks ) project. I have also emailed you regarding this issue.

night-fury-me · 2018-06-04T16:43:33Z

May I have your email? I would like to discuss about our research with you.

firojalam · 2018-06-04T23:58:40Z

Hi,
Sorry for the delayed reply. You can write us at [email protected] and [email protected].

Thank you

Nesarul-Hoque · 2022-12-20T12:39:03Z

How can I reduce the dimension from 300 to 100?

firojalam · 2022-12-20T12:44:23Z

There are several ways you can do that:

Naive approach: just get the first 100,
You can use PCA to reduce the dimension. Get top 100 principle dimensions,
Train another model with 100 dimension

Nesarul-Hoque · 2022-12-21T05:22:17Z

Thank you for the quick response. I thought there might be a built-in function to reduce the dimensions like the fastText pre-trained model. Actually, I know about the three approaches to reducing the dimensions. However, the best way is to train the corpus again with 100 dimensions. So, can I get the link to the corpus which you used? [image: Mailtrack] <https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=signaturevirality11&> Sender notified by Mailtrack <https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=signaturevirality11&> 12/21/22, 11:14:08 AM

…

On Tue, Dec 20, 2022 at 6:44 PM Firoj Alam, Scientist, QCRI < ***@***.***> wrote: There are several ways you can do that: - Naive approach: just get the first 100, - You can use PCA to reduce the dimension. Get top 100 principle dimensions, - Train another model with 100 dimension — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A436L2JHVNLGVPBPTVFBDH3WOGS3FANCNFSM4FCWNEJA> . You are receiving this because you commented.Message ID: ***@***.*** com>

-- Md. Nesarul Hoque Assistant Professor Dept. of CSE Bangabandhu Sheikh Mujibur Rahman Science and Technology University Gopalganj 8100, Dhaka, Bangladesh Cell No: +8801712-820727

firojalam · 2022-12-26T15:51:43Z

Dear Nesarul-Hoque,
We do not have oroginal dataset now. And unfortunately, we could not share it before.

Firoj

Nesarul-Hoque · 2022-12-27T06:58:08Z

Thank you for your clarification. [image: Mailtrack] <https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=signaturevirality11&> Sender notified by Mailtrack <https://mailtrack.io?utm_source=gmail&utm_medium=signature&utm_campaign=signaturevirality11&> 12/27/22, 12:56:25 PM

…

On Mon, Dec 26, 2022 at 9:51 PM Firoj Alam, Scientist, QCRI < ***@***.***> wrote: Dear Nesarul-Hoque, We do not have oroginal dataset now. And unfortunately, we could not share it before. Firoj — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A436L2JXZIOCBDCZ4ZGKOBLWPG5JTANCNFSM4FCWNEJA> . You are receiving this because you commented.Message ID: ***@***.*** com>

-- Md. Nesarul Hoque Assistant Professor Dept. of CSE Bangabandhu Sheikh Mujibur Rahman Science and Technology University Gopalganj 8100, Dhaka, Bangladesh Cell No: +8801712-820727

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for the Dataset used in this project. #1

Request for the Dataset used in this project. #1

night-fury-me commented May 31, 2018

firojalam commented Jun 1, 2018

night-fury-me commented Jun 2, 2018 •

edited

Loading

night-fury-me commented Jun 4, 2018

firojalam commented Jun 4, 2018

Nesarul-Hoque commented Dec 20, 2022

firojalam commented Dec 20, 2022

Nesarul-Hoque commented Dec 21, 2022 via email

firojalam commented Dec 26, 2022

Nesarul-Hoque commented Dec 27, 2022 via email

Request for the Dataset used in this project. #1

Request for the Dataset used in this project. #1

Comments

night-fury-me commented May 31, 2018

firojalam commented Jun 1, 2018

night-fury-me commented Jun 2, 2018 • edited Loading

night-fury-me commented Jun 4, 2018

firojalam commented Jun 4, 2018

Nesarul-Hoque commented Dec 20, 2022

firojalam commented Dec 20, 2022

Nesarul-Hoque commented Dec 21, 2022 via email

firojalam commented Dec 26, 2022

Nesarul-Hoque commented Dec 27, 2022 via email

night-fury-me commented Jun 2, 2018 •

edited

Loading