Sohom Ghosh

Website

Address

sohom1ghosh@gmail.com

Alternative Email Addresss.

sohomg.cse.rs@jadavpuruniversity.in

Field of Specialization

Natural Language Processing,
Computational Linguistics,
Large Language Models,
Generative AI,
Financial NLP
B.Tech (Computer Science & Engineering), M.Tech (Software Systems), PhD (Thesis submitted)
1022204003 of 2022-2023
https://orcid.org/0000-0002-4113-0958
https://scholar.google.com/citations?user=7Jm4_McAAAAJ&hl=en
https://linkedin.com/in/sohomghosh
https://dblp.org/pid/184/3634.html

Namaste (নমস্কার) 🙏, I am Sohom.



Here's my updated website: https://sohomghosh.github.io/ . I have submitted my PhD Thesis on Financial Natural Language Processing at Jadavpur University, Kolkata, India.



I like developing innovative solutions for solving real life challenges. Over the last 9+ years, I have been toiling to improve digital experience & financial well-being of millions of users across different industries like Internet, Financial Services and so on. Presently, I am working as a Senior Data Scientist in the industry. Before this, I worked for Times Internet (digital wing of The Times Group) and MathLogic (an AI consulting startup). My research interests include Industrial Applications of Natural Language Processing and Deep Learning. Now my mission is to demystify financial texts for social good.



In addition to being a US patent holder, co-author of the books Natural Language Processing Fundamentals and The Natural Language Processing Workshop, I have several publications in international venues of repute, such as TheWebConf (WWW), CIKM, COLING, LREC, IEEE BigData, CODS-COMAD and so on. I hold a Master’s Degree in Software Systems (with specialization in Data Analytics) from BITS Pilani, India and a Bachelor’s Degree in Computer Science and Engineering from HIT-K.

PhD in Engineering
2022-present
Jadavpur University, Kolkata
M.Tech in Software Systems
2019
BITS, Pilani
B.Tech in Computer Science & Engineering
2016
WBUT (MAKAUT-WB)
Novel Institute of Vocational Training
Big Data Faculty
Sep 2015 - Jan 2016
Fidelity Investments
Senior Data Scientist
Jun 2019 - present
Times Internet
Data Scientist
Jan 2017 - Jun 2019
Fn MathLogic
Analyst
July 2016 - Jan 2017
Natural Language Processing,
Computational Linguistics,
Large Language Models,
Generative AI,
Financial NLP

Publication

# Publication
1 Ghosh, Sohom and Chen, Chung-Chi and Naskar, Sudip Kumar, Generator-Guided Crowd Reaction Assessment, Companion Proceedings of the ACM Web Conference 2024, 2024, https://doi.org/https://doi.org/10.1145/3589335.3651512
2 Ghosh, Sohom and Majhi, Arnab and Narayana, Aswartha and Naskar, Sudip Kumar, IndicFinNLP: Financial Natural Language Processing for Indian, LREC-COLING 2024, 2024, https://doi.org/TBD
3 Sohom Ghosh and Ankush Chopra and Sudip Kumar Naskar, Learning to Rank Hypernyms of Financial Terms Using Semantic Textual Similarity, SN Comput. Science, 2023, https://doi.org/10.1007/s42979-023-02134-z
4 Ghosh, Sohom and Naskar, Sudip Kumar, Using Natural Language Processing to Enhance Understandability of Financial Texts, Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023, https://doi.org/10.1145/3570991.3571051
5 Sarkar, Anubhav and Chakraborty, Swagata and Ghosh, Sohom and Naskar, Sudip Kumar, Evaluating Impact of Social Media Posts by Executives on Stock Prices, Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation, 2023, https://doi.org/10.1145/3574318.3574339
6 Ghosh, Sohom and Naskar, Sudip Kumar, LIPI at the FinNLP-2022 ERAI Task: Ensembling Sentence Transformers for Assessing Maximum Possible Profit and Loss from Online Financial Posts, Proceedings of the Fourth Workshop on Financial Technology and Natural Language Processing (FinNLP), 2022, https://doi.org/10.18653/v1/2022.finnlp-1.13
7 Ghosh, Sohom and Sengupta, Shovon and Naskar, Sudip and Singh, Sunny Kumar, {F}in{RAD}: Financial Readability Assessment Dataset - 13,000+ Definitions of Financial Terms for Measuring Readability, Proceedings of the 4th Financial Narrative Processing Workshop @LREC2022, 2022, https://doi.org/https://aclanthology.org/2022.fnp-1.1
8 Ghosh, Sohom and Naskar, Sudip Kumar, FiNCAT: Financial Numeral Claim Analysis Tool, Companion Proceedings of the Web Conference 2022, 2022, https://doi.org/10.1145/3487553.3524635
9 Sohom Ghosh and Sudip Kumar Naskar, FiNCAT-2: An enhanced Financial Numeral Claim Analysis Tool, Software Impacts, 2022, https://doi.org/https://doi.org/10.1016/j.simpa.2022.100288
10 Ghosh, Sohom and Naskar, Sudip Kumar, Detecting context-based in-claim numerals in Financial Earnings Conference Calls, International Journal of Information Technology, 2022, https://doi.org/https://doi.org/10.1007/s41870-022-00952-7
                                        Illuminate\Database\Eloquent\Collection Object
(
    [items:protected] => Array
        (
            [0] => App\Models\Profile_details_other Object
                (
                    [connection:protected] => mysql
                    [table:protected] => profile_details_others
                    [primaryKey:protected] => id
                    [keyType:protected] => int
                    [incrementing] => 1
                    [with:protected] => Array
                        (
                        )

                    [withCount:protected] => Array
                        (
                        )

                    [preventsLazyLoading] => 
                    [perPage:protected] => 15
                    [exists] => 1
                    [wasRecentlyCreated] => 
                    [escapeWhenCastingToString:protected] => 
                    [attributes:protected] => Array
                        (
                            [id] => 21
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => research_area
                            [details] => {"research_areas":["Natural Language Processing,\r\nComputational Linguistics,\r\nLarge Language Models,\r\nGenerative AI,\r\nFinancial NLP"]}
                            [created_at] => 2023-10-05 01:48:36
                            [updated_at] => 2023-10-05 01:48:36
                        )

                    [original:protected] => Array
                        (
                            [id] => 21
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => research_area
                            [details] => {"research_areas":["Natural Language Processing,\r\nComputational Linguistics,\r\nLarge Language Models,\r\nGenerative AI,\r\nFinancial NLP"]}
                            [created_at] => 2023-10-05 01:48:36
                            [updated_at] => 2023-10-05 01:48:36
                        )

                    [changes:protected] => Array
                        (
                        )

                    [casts:protected] => Array
                        (
                        )

                    [classCastCache:protected] => Array
                        (
                        )

                    [attributeCastCache:protected] => Array
                        (
                        )

                    [dates:protected] => Array
                        (
                        )

                    [dateFormat:protected] => 
                    [appends:protected] => Array
                        (
                        )

                    [dispatchesEvents:protected] => Array
                        (
                        )

                    [observables:protected] => Array
                        (
                        )

                    [relations:protected] => Array
                        (
                        )

                    [touches:protected] => Array
                        (
                        )

                    [timestamps] => 1
                    [hidden:protected] => Array
                        (
                        )

                    [visible:protected] => Array
                        (
                        )

                    [fillable:protected] => Array
                        (
                            [0] => user_id
                            [1] => user_type
                            [2] => content_type
                            [3] => details
                        )

                    [guarded:protected] => Array
                        (
                            [0] => *
                        )

                )

            [1] => App\Models\Profile_details_other Object
                (
                    [connection:protected] => mysql
                    [table:protected] => profile_details_others
                    [primaryKey:protected] => id
                    [keyType:protected] => int
                    [incrementing] => 1
                    [with:protected] => Array
                        (
                        )

                    [withCount:protected] => Array
                        (
                        )

                    [preventsLazyLoading] => 
                    [perPage:protected] => 15
                    [exists] => 1
                    [wasRecentlyCreated] => 
                    [escapeWhenCastingToString:protected] => 
                    [attributes:protected] => Array
                        (
                            [id] => 22
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => education
                            [details] => {"degree":["PhD in Engineering","M.Tech in Software Systems","B.Tech in Computer Science & Engineering"],"year":["2022-present","2019","2016"],"university":["Jadavpur University, Kolkata","BITS, Pilani","WBUT (MAKAUT-WB)"]}
                            [created_at] => 2023-10-05 01:49:44
                            [updated_at] => 2024-03-29 02:24:30
                        )

                    [original:protected] => Array
                        (
                            [id] => 22
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => education
                            [details] => {"degree":["PhD in Engineering","M.Tech in Software Systems","B.Tech in Computer Science & Engineering"],"year":["2022-present","2019","2016"],"university":["Jadavpur University, Kolkata","BITS, Pilani","WBUT (MAKAUT-WB)"]}
                            [created_at] => 2023-10-05 01:49:44
                            [updated_at] => 2024-03-29 02:24:30
                        )

                    [changes:protected] => Array
                        (
                        )

                    [casts:protected] => Array
                        (
                        )

                    [classCastCache:protected] => Array
                        (
                        )

                    [attributeCastCache:protected] => Array
                        (
                        )

                    [dates:protected] => Array
                        (
                        )

                    [dateFormat:protected] => 
                    [appends:protected] => Array
                        (
                        )

                    [dispatchesEvents:protected] => Array
                        (
                        )

                    [observables:protected] => Array
                        (
                        )

                    [relations:protected] => Array
                        (
                        )

                    [touches:protected] => Array
                        (
                        )

                    [timestamps] => 1
                    [hidden:protected] => Array
                        (
                        )

                    [visible:protected] => Array
                        (
                        )

                    [fillable:protected] => Array
                        (
                            [0] => user_id
                            [1] => user_type
                            [2] => content_type
                            [3] => details
                        )

                    [guarded:protected] => Array
                        (
                            [0] => *
                        )

                )

            [2] => App\Models\Profile_details_other Object
                (
                    [connection:protected] => mysql
                    [table:protected] => profile_details_others
                    [primaryKey:protected] => id
                    [keyType:protected] => int
                    [incrementing] => 1
                    [with:protected] => Array
                        (
                        )

                    [withCount:protected] => Array
                        (
                        )

                    [preventsLazyLoading] => 
                    [perPage:protected] => 15
                    [exists] => 1
                    [wasRecentlyCreated] => 
                    [escapeWhenCastingToString:protected] => 
                    [attributes:protected] => Array
                        (
                            [id] => 23
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => industrial_exp
                            [details] => {"industrial_name":["Fidelity Investments","Times Internet","Fn MathLogic"],"industrial_position":["Senior Data Scientist","Data Scientist","Analyst"],"industrial_period":["Jun 2019 - present","Jan 2017 - Jun 2019","July 2016 - Jan 2017"]}
                            [created_at] => 2023-10-05 01:52:27
                            [updated_at] => 2024-03-29 02:24:50
                        )

                    [original:protected] => Array
                        (
                            [id] => 23
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => industrial_exp
                            [details] => {"industrial_name":["Fidelity Investments","Times Internet","Fn MathLogic"],"industrial_position":["Senior Data Scientist","Data Scientist","Analyst"],"industrial_period":["Jun 2019 - present","Jan 2017 - Jun 2019","July 2016 - Jan 2017"]}
                            [created_at] => 2023-10-05 01:52:27
                            [updated_at] => 2024-03-29 02:24:50
                        )

                    [changes:protected] => Array
                        (
                        )

                    [casts:protected] => Array
                        (
                        )

                    [classCastCache:protected] => Array
                        (
                        )

                    [attributeCastCache:protected] => Array
                        (
                        )

                    [dates:protected] => Array
                        (
                        )

                    [dateFormat:protected] => 
                    [appends:protected] => Array
                        (
                        )

                    [dispatchesEvents:protected] => Array
                        (
                        )

                    [observables:protected] => Array
                        (
                        )

                    [relations:protected] => Array
                        (
                        )

                    [touches:protected] => Array
                        (
                        )

                    [timestamps] => 1
                    [hidden:protected] => Array
                        (
                        )

                    [visible:protected] => Array
                        (
                        )

                    [fillable:protected] => Array
                        (
                            [0] => user_id
                            [1] => user_type
                            [2] => content_type
                            [3] => details
                        )

                    [guarded:protected] => Array
                        (
                            [0] => *
                        )

                )

            [3] => App\Models\Profile_details_other Object
                (
                    [connection:protected] => mysql
                    [table:protected] => profile_details_others
                    [primaryKey:protected] => id
                    [keyType:protected] => int
                    [incrementing] => 1
                    [with:protected] => Array
                        (
                        )

                    [withCount:protected] => Array
                        (
                        )

                    [preventsLazyLoading] => 
                    [perPage:protected] => 15
                    [exists] => 1
                    [wasRecentlyCreated] => 
                    [escapeWhenCastingToString:protected] => 
                    [attributes:protected] => Array
                        (
                            [id] => 24
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => awards
                            [details] => {"awards_by":["CODS-COMAD 2023","CODS-COMAD 2024"],"awards_name":["Honourable Mention at the Young Researchers\u2019 Symposium Track","Travel Grant"],"awards_year":["2023","2024"]}
                            [created_at] => 2023-10-05 01:53:55
                            [updated_at] => 2024-03-29 02:25:53
                        )

                    [original:protected] => Array
                        (
                            [id] => 24
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => awards
                            [details] => {"awards_by":["CODS-COMAD 2023","CODS-COMAD 2024"],"awards_name":["Honourable Mention at the Young Researchers\u2019 Symposium Track","Travel Grant"],"awards_year":["2023","2024"]}
                            [created_at] => 2023-10-05 01:53:55
                            [updated_at] => 2024-03-29 02:25:53
                        )

                    [changes:protected] => Array
                        (
                        )

                    [casts:protected] => Array
                        (
                        )

                    [classCastCache:protected] => Array
                        (
                        )

                    [attributeCastCache:protected] => Array
                        (
                        )

                    [dates:protected] => Array
                        (
                        )

                    [dateFormat:protected] => 
                    [appends:protected] => Array
                        (
                        )

                    [dispatchesEvents:protected] => Array
                        (
                        )

                    [observables:protected] => Array
                        (
                        )

                    [relations:protected] => Array
                        (
                        )

                    [touches:protected] => Array
                        (
                        )

                    [timestamps] => 1
                    [hidden:protected] => Array
                        (
                        )

                    [visible:protected] => Array
                        (
                        )

                    [fillable:protected] => Array
                        (
                            [0] => user_id
                            [1] => user_type
                            [2] => content_type
                            [3] => details
                        )

                    [guarded:protected] => Array
                        (
                            [0] => *
                        )

                )

            [4] => App\Models\Profile_details_other Object
                (
                    [connection:protected] => mysql
                    [table:protected] => profile_details_others
                    [primaryKey:protected] => id
                    [keyType:protected] => int
                    [incrementing] => 1
                    [with:protected] => Array
                        (
                        )

                    [withCount:protected] => Array
                        (
                        )

                    [preventsLazyLoading] => 
                    [perPage:protected] => 15
                    [exists] => 1
                    [wasRecentlyCreated] => 
                    [escapeWhenCastingToString:protected] => 
                    [attributes:protected] => Array
                        (
                            [id] => 25
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => teaching_exp
                            [details] => {"institution_name":["Novel Institute of Vocational Training"],"institution_position":["Big Data Faculty"],"institution_period":["Sep 2015 - Jan 2016"]}
                            [created_at] => 2023-10-05 02:33:31
                            [updated_at] => 2023-10-05 02:33:31
                        )

                    [original:protected] => Array
                        (
                            [id] => 25
                            [user_id] => 7
                            [user_type] => scholar
                            [content_type] => teaching_exp
                            [details] => {"institution_name":["Novel Institute of Vocational Training"],"institution_position":["Big Data Faculty"],"institution_period":["Sep 2015 - Jan 2016"]}
                            [created_at] => 2023-10-05 02:33:31
                            [updated_at] => 2023-10-05 02:33:31
                        )

                    [changes:protected] => Array
                        (
                        )

                    [casts:protected] => Array
                        (
                        )

                    [classCastCache:protected] => Array
                        (
                        )

                    [attributeCastCache:protected] => Array
                        (
                        )

                    [dates:protected] => Array
                        (
                        )

                    [dateFormat:protected] => 
                    [appends:protected] => Array
                        (
                        )

                    [dispatchesEvents:protected] => Array
                        (
                        )

                    [observables:protected] => Array
                        (
                        )

                    [relations:protected] => Array
                        (
                        )

                    [touches:protected] => Array
                        (
                        )

                    [timestamps] => 1
                    [hidden:protected] => Array
                        (
                        )

                    [visible:protected] => Array
                        (
                        )

                    [fillable:protected] => Array
                        (
                            [0] => user_id
                            [1] => user_type
                            [2] => content_type
                            [3] => details
                        )

                    [guarded:protected] => Array
                        (
                            [0] => *
                        )

                )

        )

    [escapeWhenCastingToString:protected] => 
)
                                    
No Records found!
CODS-COMAD 2023
Honourable Mention at the Young Researchers’ Symposium Track
2023
CODS-COMAD 2024
Travel Grant
2024