Citizen science on Hive - exploring a neutrino mass signal at CERN’s Large Hadron Collider
The end of the year is always a catastrophe in terms of availability. University duties are heavy, with exams and a decent amount of administration. Moreover, my health is still not great, and issues are piling up on top of each other. Having got COVID at a scientific conference a month ago changed my life quite badly.
For that reason, my on-chain presence is kind of reduced, both for STEMsocial, my own blog and all projects I run here. However, I am never very far and things get eventually done!
Today, I decided to go back to the citizen science project that we started a couple of months ago. This project focuses on a particle physics study of a neutrino mass model at CERN’s Large Hadron Collider (aka the LHC), and the key point is that non-scientific actors from the Hive community will conduct it (under my guidance). I recently wrote a blog on the general context behind this study, that can be found here.
The topic of the present episode concerns the simulation of the neutrino signal that we aim to study. We will first make sure that every participant is comfortable in using the tools we installed a while ago. For this purpose, we plan to reproduce some results from a scientific publication from 2020. In a second stage, those older results will be extended to a situation that has never been considered so far.
In other words, we plan to produce brand new scientific results on Hive!
[Credits: geralt (Pixabay)]
Let’s start with a recap of the previous episodes of our adventure. This should allow anyone motivated in joining us to catch up. In terms on involvement, a couple of hours by episode are in principle sufficient.
- Episode 1 was about the installation of the MG5aMC software dedicated to particle collider simulations. We got seven reports from the participants (agreste, eniolw, gentleshaid, mengene, metabs, servelle and travelingmercies), among which that of @metabs consists of an excellent documentation on how to get started with a virtual machine running on Windows.
- In episode 2 we made use of MG5aMC to generate 10,000 simulated LHC collisions relative to the production of a top-antitop pair at the LHC. We got eight reports from the participants (agreste, eniolw, gentleshaid, isnochys, mengene, metabs, servelle and travelingmercies).
- Episode 3 focused on the installation of MadAnalysis5, a piece of software allowing us to simulate detector effects, reconstruct the output of complex simulations, and the analysis of produced events. We got seven contributions (agreste, eniolw, gentleshaid, isnochys, metabs, servelle and travelingmercies).
- Episode 4 was a study of top-antitop production at CERN’s Large Hadron Collider. We got five contributions (agreste, eniolw, gentleshaid, servelle and travelingmercies). The solutions to the proposed assignments are available here.
As usual, I finish this short introduction with a list of acknowledgements to all participants to this project, and supporters from our community: @agmoore, @agreste, @aiovo, @alexanderalexis, @amestyj, @darlingtonoperez, @eniolw, @firstborn.pob, @gentleshaid, @gtg, @isnochys, @ivarbjorn, @linlove, @mengene, @mintrawa, @robotics101, @servelle, @travelingmercies and @yaziris. Please let me know if you want to be added or removed from this list.
Signal and background at the LHC
In the previous episodes of our citizen science project on Hive, we have used quite a bit the simulation software MG5aMC. All the simulations done so far however relies on the Standard Model of particle physics. This is not what we aim to do in our project targeting the simulation of a signal of a new phenomenon. Of course, we will have to deal with Standard Model simulations too, as they are relevant for the background to our signal.
In practice, our signal will be well hidden in the background, and our simulated collisions will comprise both signal and background events (I remind that one event is simply one simulated collision). It is then up to us to design an appropriate analysis, or a selection strategy, in which various event properties are calculated. We next use them to decide whether a given event is kept or rejected, depending on their numerical value.
The grand goal is thus to choose the properties well, so that as many background events as possible are rejected, and the largest possible part of the signal is selected.
In the blog of today, we focus on the signal only.
Installation of a particle physics model in MG5aMC - task 1
The first step necessary to allow MG5aMC to simulate our signal is to add the model of physics considered in its model database. This model is an extension of the Standard Model that includes new massive neutrinos that mix with the massless neutrinos of the Standard Model to provide them a mass.
This model is available from this page and has been documented in this scientific publication. In order to add it to our local MG5aMC installation, it is sufficient to extract this tarball in the MG5aMC model directory. This is achieved by typing the following commands in a standard shell.
cd MG5_aMC_v2_9_9/models; curl https://feynrules.irmp.ucl.ac.be/raw-attachment/wiki/HeavyN/SM_HeavyN_NLO_UFO.tgz > model.tgz; tar xf model.tgz; rm -f model.tgz
Note that you way have to adapt the path in the first line to correctly refer to the location of your local MG5aMC installation. We can check then that everything is fine by listing the content of the models
subdirectory of the MG5aMC folder (through the command ls
). A folder named SM_HeavyN_NLO
should be there.
This folder contains a Python version of the neutrino mass model considered, called a UFO model. This name is not a joke (see here). UFOs are usual model libraries in particle physics since the time they have been invented in 2010. This is an interesting story about which I may write, one day. It includes a monastery, a few physicists and a lot of beer (I repeat: it is not a joke!).
Here is a screenshot of what I got. Can you reproduce it?
[Credits: @lemouth]
If yes, then you are ready to simulate our signal beyond the Standard Model of particle physics.
About the signal considered
The process considered involves two colliding protons. The reason is simple to get: at CERN’s LHC protons are collided. In reality, however, what matters is not really the protons themselves, but their constituents. In fact we must consider collisions between two of the quarks making the protons, as described in the left part of the figure below (see the symbols q1 and q2 on the left).
This is how it works at very high energies. Protons are composite particles and their content plays the most important role.
[Credits: CMS-EXO-21-003 (CMS @ CERN)]
Our two quarks carry a lot of energy, as their associated protons are accelerated close to the speed of light. Moreover, these quarks are sensitive to weak interactions. As a consequence, each of them has a given probability to emit a W boson, one of the mediators of the weak interactions. Those two W bosons are given in purple in the figure.
What comes next is the interesting part of our new physics process. The two W bosons exchange a heavy neutrino N, in red in the figure, and this leads to the production of two leptons of the same electric charge (see the symbols l1 and l2 on the right part of the figure).
The process described above is precisely the signal that we want to simulate. It depends on one of the new particles of the model N (one of the included heavy neutrinos), and on its properties (how it couples to electrons, muons and taus).
In the reference scientific publication, my collaborators and I studied the production of two muons or two antimuons. In the study that is planned to be conducted on Hive, we will consider all other possibilities (pairs of electrons, pairs of taus, mixed pairs of different leptons, etc.).
For today, however, we want to make sure everybody uses MG5aMC for new physics simulations correctly. For this reason, we consider the production of muons as in the 2020 publication, and aim at reproducing some of the results shown in that article.
Preparing the simulation of a heavy neutrino signal - task 2
We now move on by considering the production rate of the signal described above.
We aim to reproduce the ‘WW purple line’ shown in figure 2 in the reference publication. This is done by starting MG5aMC in a shell, of course after moving back to the main MG5aMC folder as we are still in principle in the models
subdirectory (this is the reason of the cd ..
command below).
cd ..; ./bin/mg5_aMC
Then, we need to convert the Python2 UFO model we downloaded into a Python3 version of it (this can be ignored if you use Python2), import the model and automatically generate a working directory containing a Fortran code that embeds all the physics details of our process.
This is achieved by typing a few commands in the MG5aMC command line interface. I emphasise that the first line of the commands below has to be omitted if you use Python2.
MG5_aMC>set auto_convert_model T MG5_aMC>import model SM_HeavyN_NLO MG5_aMC>define p = g u c d s u~ c~ d~ s~ MG5_aMC>define j = p MG5_aMC>generate p p > mu+ mu+ j j QED=4 QCD=0 $$ w+ w- / n2 n3 MG5_aMC>add process p p > mu- mu- j j QED=4 QCD=0 $$ w+ w- / n2 n3 MG5_aMC>output test_signal
In the above command, we have removed the possibility of having a bottom quark in the proton, as this quark is taken massive and not considered massless as all other quarks. Technically, all quarks are massive, but their masses are just negligibly small compared with the LHC energy. This is achieved with the commands given in the third and fourth line above.
Then, let’s investigate the form of the generate
command (fifth line above). On the left-hand side of the arrow >
, we indicate that two protons are collided (p p
). On the right-hand side of the arrow, we indicate that two antimuons (mu+ mu+
) are produced, which corresponds to the two leptons present in the diagram above, as well as two jets (j j
) that represent the final state quarks. The links between a jet and a quark are detailed here.
The sixth line above (the add process
one) is similar, although we produce this time a pair of muons instead of a pair of antimuons. What matters is the fact that the produced leptons have the same electric charge. This means that we have either a pair of muons, or a pair of antimuons. In the Standard Model, the rate to produce two muons or two antimuons is quite small, so that we expect a small background to our signal. This contrast with a process in which one muon and one antimuon would be produced.
At the end of these commands, a folder named test_signal
is created. This name can be changed to anything you want. The folder contains the Fortran code that will allow us to handle all relevant quantum field theory calculations associated with the process of interest, without knowing much about what is going on in the inside.
Please now type:
open index.html
You can check the diagrams associated with the process considered by following the link ‘Process Information’. You should get to a page similar to the one shown below.
[Credits: @lemouth]
Feel free to click on a few html
links to check the Feynman diagrams associated with the process (they are the same as the one shown above).
Feynman diagrams - assignment 1
The assignments of this week start with an easy question. Please explore the different diagrams that have been generated by MG5aMC (by clicking on the various ‘html’ links, and figure out what are the difference between them? Why are there so many diagrams?
Note that the answer is not expected to be long.
Calculation of the signal production rate - task 3
We are now ready to use the generated code for some computations. We will calculate the rate at which the process of interest occurs at the LHC Run 2. The LHC Run 2 recorded 140 fb-1 of data, so that the rate will tell us how many signal events should be present in data.
The computation is triggered by typing, in the MG5aMC interpreter,
launchAs we are only interested in rates, there is no need to turn on any of the option proposed by the code. Therefore, we only have to type
0
(or press enter) as an answer to the question raised by MG5aMC:
[Credits: @lemouth]
Next, MG5aMC asks us whether we want to modify the parameters of the model (the param_card.dat
file), and whether we want to change the configuration of the process (the run_card.dat
file). We will do both.
Let’s start with the parameters and type 1
in the MG5aMC command line interface.
- Lines 18, 19 and 20 control the masses of the heavy neutrinos. There are three heavy neutrinos in the model, and their identifiers correspond to the numbers
9900012
,9900014
and9900016
that appear in the card.
Here, we want a scenario in which only one heavy neutrino is active. We will thus set the mass of the first neutrino to something small, like 1000 GeV (1 GeV is equal to the proton mass), and the other two to something large, like 1,000,000,000 GeV. This gives (with the line numbers in yellow):[Credits: @lemouth] - Lines 48-56 allow us to control the strength of the couplings of the heavy neutrino with the Standard Model electron, muon and tau. Here, we decoupled the neutrinos N2 and N3 so that only lines 48, 51 and 54 are relevant. Moreover, in our signal, only the interaction with muons matters.
Therefore, the parameter on line 51 is fixed to 1, and the other entries are all set to zero. This gives (still with the line numbers in yellow):[Credits: @lemouth] - Finally, we must recalculate a property of our new neutrino called its width (this tells how fast it decays, and how it decays). This is achieved by modifying line 79 and set the parameter to
auto
. This gives (again with the line numbers in yellow):[Credits: @lemouth]
We are now done for the parameter card. We can save the file and quit (:wq
in the VI editor), which brings us back in the MG5aMC command line interface. We then type 2
to access the editing of the run card.
- We first go to line 42-43 and modify the ‘PDF’ choice for our calculation. We choose to use
lhapdf
, with the PDF set number262000
. Those PDF functions allow us to relate the colliding protons to their constituents. This should give (as usual, the yellow numbers are the line numbers in the file):[Credits: @lemouth] It is interesting to note the two6,500
values that have been set (by default) for the energy of the two colliding beams. These numbers are given in GeV. Their sum yields 13,000 GeV, which corresponds to 13 TeV. This was the collision energy at the LHC Run 2. - On line 96, we replace
10.0 = ptl
by0.0 = ptl
. We do not want to impose any energy threshold on the produced muons. - On line 185, we replace
True = use_syst
byFalse = use_syst
. We do not want to calculate the systematic error associated to our computation.
That’s all, We can save the file (:wq
) and start the run (by pressing enter).
If everything goes well, we should obtain a cross section of 0.012 pb in about 10 minutes (it took 6-7 minutes on my machine), which is equal to 12 fb. Multiplying by 140 fb-1, we predict a number of about 1680 signal events potentially hidden in LHC run 2 data.
Signal rate dependence on the neutrino mass - assignment 2
Let’s now repeat the exercise above, but for heavy neutrino masses varying from 50 GeV to 20,000 GeV. The idea is, to reproduce the purple line in figure 2 of our reference publication.
In other words, we want a plot with the heavy neutrino mass on the X axis and the value of the cross section, either in pb or in fb, on the Y axis. Please report both your plot and the associated numerical values in your next #citizenscience report. I leave you the choice to decide how many mass points you need and how to plot the results.
Although we can do the exercise manually for each mass value considered, but we can also use MG5aMC in its scanning mode (which is good for people who do not want to babysit their computer).
To this end, we can fix the mass parameter as given in the image below (for a choice of masses in the [50, 500, 2500,10000] GeV set, which is not sufficient to get a smooth curve). As usual, the numbers in yellow are the line numbers in the param_card.dat
file.
[Credits: @lemouth]
Please share the obtained plot with us, and let us know whether it matches that of the reference paper. If there are differences, do you know where they come from?
Signal rate at LHC run 3 - final assignment
Let’s now produce new predictions never computed before!
We repeat the above exercise, but in the light of LHC Run 3 that will start on July 5th. The difference is that the collision energy will increase from 13,000 GeV to 13,600 GeV. This should be fixed in the run_card.dat
file.
As above, generate a figure depicting the dependence of the rate on the heavy neutrino mass (heavy neutrino mass on the X axis and the value of the cross section, either in pb or in fb, on the Y axis), share it with us in your report, as well as the numbers you obtained.
In addition, please let me know how long it took you to get results for one mass point, and to which extent you are able to share files of a few MB with us. This is important so that I could assess how to proceed for the next steps of this project, and to design a file sharing strategy with @gtg (who will help us in managing our data; thanks to him in advance).
Summary: a neutrino mass signal at the LHC
In this fifth episode of our citizen science project on Hive, we focused on an LHC signal relevant for a neutrino mass model.
The idea was to reproduce results from this reference publication, namely the production rate of a signal in which the production of two muons or two antimuons is induced by the exchange of a new heavy neutrino in an LHC collision. The motivation for such a signal is that the heavy neutrino is a potential candidate to explain why the neutrinos of the Standard Model are massive.
However, whereas reproducing older results is good to verify we understand what we are doing, producing new predictions never made before is also something great. For that reason, we updated a figure of the reference publication to the collider energy expected in the LHC Run 3, that is supposed to start in 5 days.
When this task will be accomplished by the participants to this project, for the first time ever Hive will have been used to produce new particle physics results never calculated before!
I hope you will all enjoy the activities of this week. I am planning to release the solution to the assignments in two weeks, and then go with another episode in 3 weeks just before I will leave for vacation.
I am looking forward to read the reports of all interested participants. The #citizenscience tag is waiting for you all! Good luck and have a nice week-end!
https://twitter.com/BenjaminFuks/status/1542478103146029059
The rewards earned on this comment will go directly to the people( @lemouth ) sharing the post on Twitter as long as they are registered with @poshtoken. Sign up at https://hiveposh.com.
Amazing, unique, historic....and you are battling COVID!
I am so sorry the effects of this 21st century plague is taking such a toll on you. Please take care of yourself.
It is not yet amazing and historic. This will be when the first reports will be made available (hopefully we will have at least a few of them :).
Health-wise, every day comes with its sets of little annoying things. But every day is different in the sense that at least, I never know what to expect :D
🙁
Thanks for your contribution to the STEMsocial community. Feel free to join us on discord to get to know the rest of us!
Please consider delegating to the @stemsocial account (85% of the curation rewards are returned).
You may also include @stemsocial as a beneficiary of the rewards of this post to get a stronger support.
Hello dear friend @lemouth, I am pleased to see another report on citizen science in hive, with the introduction that you made is visualized the dedication that the participants have had with each of the programmed activities. The project is enrumbado and will be a success when they finish it, also as I said above is a precedent for other areas of knowledge for the execution of science projects in hive.
Thank you for keeping us always informed, until later, that you have an excellent week and above all a lot of health.
The pleasure is for me, and I truly hope that this project will go to the end, in a few month: a real scientific publication by Hive community members :)
The initiative goes on. Will be there!
Great! I am looking forward to read your upcoming report! And please be ready for the next 4 episodes, that are well formed in my head ;)
I am still trying find some time to participate on that!! Thanks again
!1UP
Feel free to. It only requires a few hours of your time per episode, and I am totally available (on chain and on the STEMsocial Discord server) to answer any question and to provide help if needed. I recommend however to start with episode 1 that is needed for the present episode ;)
Cheers!
You have received a 1UP from @gwajnberg!
@stem-curator, @vyb-curator, @pob-curator, @neoxag-curator
And they will bring !PIZZA 🍕.
Learn more about our delegation service to earn daily rewards. Join the Cartel on Discord.
Congratulations @lemouth! You have completed the following achievement on the Hive blockchain and have been rewarded with new badge(s):
Your next target is to reach 97000 upvotes.
You can view your badges on your board and compare yourself to others in the Ranking
If you no longer want to receive notifications, reply to this comment with the word
STOP
Check out the last post from @hivebuzz:
excellent
Thanks for passing by!
Congratulations @lemouth! You received a personal badge!
Wait until the end of Power Up Day to find out the size of your Power-Bee.
May the Hive Power be with you!
You can view your badges on your board and compare yourself to others in the Ranking
Check out the last post from @hivebuzz:
Oh, this is exciting, something to keep me busy this week-end! 😀
I hope you will enjoy this episode, and I am looking forward to read your report! ^^
I started working on it, there was a prompt about the new version of MG5_aMC and I installed the updated version. Is there any chance that it is related to these problems I have encountered:
(1) I couldn't access the diagrams as it says that there were no corresponding html for the Feynman diagrams in the SubProcesses folder of my output folder
(2) After modifying the parameters in the param_card.dat and run_card.dat, I encountered an error upon running the code.
Is the output folder has to be the same folder for generating the simulation events of top-antitop collisions from before?
Thank you. :)
Mmmh interesting issues. Let's try to fix them together.
(1) I have not updated MG5aMC on my side and it still works like a charm. Can you please share the full MG5aMC history of commands you typed?
(2) Can you share your param_card and run_card with me, as well as details with respect to the error (for instance a copy of the screen output)?
Thanks in advance!
Here's the output upon launching the MG5aMC after installing the new version:
I was able to load the model and was able to generate the subprocesses (although, I still can't access the diagrams when I try to click the html link)
I launched MG5_aMC and modified the card.dat files:
And, here's the compilation error log after trying to run the code:
Thank you ! :)
The fact that you don't have the diagrams is weird, as everything seems to have worked at ths stage. Is there any warning message printed to the screen related to the diagrams? If not, maybe you can check manually where they are.
To this aim, please go to the
test_signal/SubProcesses
folder (I don’t know iftest_signal
is the name you choose; if not, please adapt it). There, you should see a few folders starting withP1_
andP2_
. There are jpg files in these folders. They include the diagrams.Does it work?
There is an apparent mistake with the run_card. The PDF-related lines should be
lhapdf = pdlabel
and262000 = lhaid
(check my version of the run card in the post). This is what causes the compilation failure of MG5aMC.Un-relately, you have an extra non zero parameter in the paramcard (in the mixing parameters). Also, please don’t forget to set the calculation of the decay width of the heavy neutrino to automatic (
auto
). I don't know whether you have done it.Good luck, and please let me know if the comments above help you to fix all issues!
Cheers!
Congratulations @lemouth! You received a personal badge!
Participate in the next Power Up Day and try to power-up more HIVE to get a bigger Power-Bee.
May the Hive Power be with you!
You can view your badges on your board and compare yourself to others in the Ranking
Check out the last post from @hivebuzz:
Just getting to read this. I will try to work on it over the weekend and see if I can drop the report as well or sometime next week. Not sure about the assignment though. However, I will try my best to see if I can answer them.
Did you manage? How did it go?
I'm on it. If the electricity supply stays stable, my reports should be tomorrow
I am very excited to see them ^^
Currently on it.